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Intel486™ MICROPROCESSOR 


INTRODUCTION 


Intel microprocessors and peripherals provide a broad 
range of time-saving, energy-efficient, high-perform- 
ance solutions to designers of both mobile and desktop 
microprocessor-based systems. Intel’s microprocessor/ 
peripheral interface delivers time and performance ad- 
vantages to the designers of microprocessor-based sys- 
tems, meeting their demand for greater performance, 
lower power consumption and a wider variety of built- 
in features for their customers. 


HIGH-PERFORMANCE 
ENTRY-LEVEL SYSTEMS 


Intel offers an entire product line of entry-level micro- 
processors for desktop and mobile systems, ranging 
from the 25 MHz version of the Intel486™ SX proces- 
sor to the high performance 100 MHz IntelDX4 proc- 
essor. The IntelDX4 processor is the world’s fastest 
486, providing a new level of affordable computing 
power to the business desktop and unparalled perform- 
ance to mobile computers. Intel couples superior per- 
formance with sophisticated energy-efficient SL tech- 
nology to meet the requirements of the EPA’s Energy 
Star guidelines. 


REDUCED TIME TO MARKET 


Intel’s universal motherboard for Intel486 microproces- 
sor-based desktop systems and scalable architecture for 
mobile systems reduces the development effort required 
to produce an entire product line of high-performance 
entry-level systems, thereby reducing time to market. 


December 1994 
Order Number: 2418 16-002 


Intel offers a wide variety of off-the-shelf components 
to fulfill the requirements of system designers while 
simplifying the implementation of their designs. Off- 
the-shelf system solutions greatly decrease the potential 
risk of costly project delays due to component incom- 
patibility. These system solutions greatly reduce the 
amount of time required to design, debug, manufacture 
and test microprocessor-based systems. 


INCREASED RELIABILITY 


High reliability is a tangible goal that translates to 
higher reliability for your product, reduced downtime, 
and reduced repair costs. As more and more functions 
are integrated into fewer components, the resulting sys- 
tem requires less power, produces less heat, and re- 
quires fewer mechanical connections—again resulting 
in greater system reliability. 3 


LOWER COSTS 


Using proven, reliable off-the-shelf components will re- 
duce design costs, manufacturing costs, and time to 
market while increasing project viability and product 
reliability. 
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IintelDX4™ Processor 

— Up to 100-MHz Operation 

— Speed-Multiplying Technology 
— 32-Bit Architecture 

— 16K-Byte On-Chip Cache 

— Integrated Floating-Point Unit 


— 3.3V Core Operation with 5V Tolerant 


I/O Buffers 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 
Compatibility 

— Binary Compatible with Large 
Software Base 


Write-Back Enhanced IntelDX2™ 

Processor 

— Speed-Multiplying Technology 

— 32-Bit Architecture 

— 8K-Byte On-Chip Write-Back Cache 

— Integrated Floating-Point Unit 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 
Compatibility 

— Binary Compatible with Large 
Software Base 


IintelDX2™ Processor 

— Speed-Multiplying Technology 

— 32-Bit Architecture 

— 8K-Byte On-Chip Cache 

— Integrated Floating-Point Unit 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 
Compatibility 

— Binary Compatible with Large 
Software Base 
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IntelSX2T™ Processor 

— Speed-Multiplying Technology 

— 32-Bit Architecture 

— 8K-Byte On-Chip Cache 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 

~ Compatibility | 

— Binary Compatible with Large 
Software Base 


Intel486™ DX Processor 

— 32-Bit Architecture 

— 8K-Byte On-Chip Cache 

— Integrated Floating-Point Unit 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 
Compatibility 

— Binary Compatible with Large 
Software Base 


Intel486 SX Processor 

— 32-Bit Architecture 

— 8K-Byte On-Chip Cache 

— SL Technology 

— Static Design 

— IEEE 1149.1 Boundary Scan 
Compatibility | 

— Binary Compatible with Large 
Software Base 


*Other brands and names are the property of their respective owners. 
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DATA SHEET DESIGNATIONS 


Intel uses various data sheet markings to designate each phase of the document as it relates to the product. 
The marking appears in the lower, inside corner of the data sheet. The jorewing is the definition of these 
meies: | 


Product Preview | Contains information on products in the design phase of development. Do not 
finalize a design with this information. Revised information will be published when 
the product becomes available. 


Contains information on products being sampled or in the initial production phase of 
development.t 


Preliminary Contains preliminary information on new products in SraduCHON tT 
»No Marking | Contains information on products in full production. 


Tt Specifications within these data sheets are subject to change without notice. Verify with your local Intel sales office that 
= you nave the latest datasheet before finalizing a design. 
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intel. 
1.0 INTRODUCTION 


The Intel486 processor family enables a range of 
low-cost, high-performance entry-level system de- 
signs capable of running the entire installed base of 
DOS*, Windows*, OS/2*, and UNIX* applications 
written for the Intel architecture. This family includes 
the IntelDX4 processor, the fastest Intel486 proces- 
sor (up to 50% faster than an IntelDX2 processor). 
The IntelDX4 processor integrates a 16K unified 
cache and floating point hardware on-chip for 
improved performance. The IntelDX2 processor 
integrates an 8K unified cache and floating point 
hardware on chip. The IntelDX2 processor is also 
available with a write-back on-chip cache for im- 
proved entry-level performance. The IntelDX4 and 
IntelDX2 processors use Intel’s speed-multiplying 
technology, allowing the processor to operate at fre- 
quencies higher than the external memory bus. The 
Intel486 DX processor offers the features of the 
IntelDX2 processors without speed-multiplying. The 
Intel486 SX processor offers the features of the 
Intel486 DX processor without floating point hard- 
ware and the IntelSX2 processor adds speed-multi- 
plying to the Intel486 SX processor. The entire 
Intel486 processor family incorporates energy effi- 
cient “SL Technology” for mobile and desktop com- 
puting. 


SL Technology enables desktop system designs 
that exceed the Environment Protection Agency’s 
(EPA) Energy Star program guidelines without com- 
promising performance. It also increases system de- 
sign flexibility and improves battery life in ail Intel486 
processor-based notebooks. SLTechnology allows 
system designers to differentiate their power man- 
agement schemes with a variety of energy-efficient 
or battery-life preserving features. Intel486 proces- 
sors provide power management features that are 
transparent to application and operating system 
software. Stop Clock, Auto HALT Power Down, and 
Auto Idle power down allow software transparent 
control over processor power management. Equally 
important is the capability of the processor to man- 
age system power consumption. Intel486 processor 
System Management Mode (SMM) incorporates a 
non-maskable System Management Interrupt 
(SMI #), a corresponding Resume (RSM) instruction 
and a new memory space for system management 
code. Intel’s SMM ensures seamless power control 
of the processor core, system logic, main memory, 
and one or more peripheral devices, that is transpar- 
ent to any application or operating system. 
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Intel486 processors are available in a full range of 
speeds (25 MHz to 100 MHz), packages (PGA, 
SQFP PQFP), and voltages (5V, 3.3V) to meet any 

system design requirements. | 


1.1 Processor Features 


All of the Intel486 processors consist of a 32-bit inte- 
ger processing unit, an on-chip cache, and a memo- 
ry management unit. This ensures full binary com- 
patibility with the 8086, 8088, 80186, 80286, 
Intel386™ SX, Intel886 DX, and all versions of 
Intel486 processors. All of the Intel486 processors 
offer the following features: 


e 32-bit RISC integer core—The Intel486 processor 
performs a complete set of arithmetic and logical 
operations on 8-, 16-, and 32-bit data types using 
a full-width ALU and eight general purpose regis- 
ters. 


e Single Cycle Execution—Many instructions exe- 
cute in a single clock cycle. 


e Instruction Pipelining—The fetching, decoding, 
address translation and execution of. instructions 
are overlapped within the Intel486 processor. 


© On-Chip Floating Point Unit—\|ntel486 processors 
support the 32-, 64-, and 80-bit formats specified 
in IEEE standard 754. The unit is binary compati- 
ble with the 8087, Intel287™, Intel387™ coproc- 
essors, and Intel OverDrive™ processor. 


e On-Chip Cache with Cache Consistency Sup- 
port—An 8-Kbyte (16 Kbyte on the IntelDX4 proc- 
essor) internal cache is used for both data and 
instructions. Cache hits provide zero wait-state 
access times for data within the cache. Bus activ- 
ity is tracked to detect alterations in the memory 
represented by the internal cache. The internal 
cache can be invalidated or flushed so that an 
external cache controller can maintain cache 
consistency. 


e External Cache Control—Write-back and flush 
controls for an external cache are provided so 
the processor can maintain cache consistency. 


e On-Chip Memory Management Unit—Address 
management and memory space protection 
mechanisms maintain the integrity of memory ina 
multitasking and virtual memory environment. 
Both segmentation and paging are supported. 
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Burst Cycles—Burst transfers allow a new double 
word to be read from memory on each bus clock 
cycle. This capability is especially useful for in- 


struction prefetch and for filling the internal 


cache. 


Write Buffers—The processor contains four write 
buffers to enhance the performance of consecu- 
tive writes to memory. The processor can contin- 
ue internal operations after a write to these buff- 
ers, without waiting for the write to be completed 
“on the external bus. | 


Bus Backoff—\f another bus master needs con- 
trol of the bus during a processor initiated bus 
cycle, the Intel486 processor will float its bus sig- 
nals, then restart the cycle when the bus be- 
comes available again. 


Instruction Restart—Programs can continue exe- 
cution following an exception generated by an 
unsuccessful attempt to access memory. This 
feature is important for supporting demand-paged 
virtual memory applications. 


Dynamic Bus Sizing—External controllers can dy- 
- namically alter the effective width of the data bus. 
Bus widths of 8, 16, or 32 bits can be used. 


Boundary Scan (JTAG)—Boundary Scan pro- 
vides in-circuit testing of components on printed 
circuit boards. The Intel Boundary Scan imple- 
mentation conforms with the IEEE Standard Test 
Access Port and Boundary Scan Architecture. 
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SL Technology provides the following features: 


e /ntel System Management Mode—A unique Intel 
_ architecture operating mode provides a dedicat- 
ed special purpose interrupt and address space 
that can be used to implement intelligent power 
management and other enhanced functions in a 
manner that is completely transparent to the op- 
erating system and applications software. — 


e //O Restart—An |/O instruction interrupted by a 
System Management Interrupt (SMI#) can auto- 
matically be restarted following the execution of 
the RSM instruction. 


e Stop Clock—The Intel486 processor has a stop 
clock control mechanism that provides two low- 
power states: a “fast wake-up” Stop Grant state 
(~ 20 mA-100 mA) and a “slow wake-up” Stop 
Clock state with CLK frequency at 0 MHz 
(100 »A-1000 pA). 

e Auto HALT Power Down—After the execution of 
a HALT instruction, the Intel486 processor issues 
a normal Halt bus cycle and the clock input to the 
Intel486 processor core is automatically stopped, 
causing the processor to enter the Auto HALT 
Power Down state (~ 20 mA-100 mA). 


e Upgrade Power Down Mode—When a Intel486 
processor upgrade is installed, the upgrade pow- 
er down mode detects the presence of the up- 
grade, powers down the core, and tri-states all 
outputs of the original processor, so the Intel486 
processor enters a very low current mode. | 


e Auto Idle Power Down —This function allows the 
processor to reduce the core frequency to the 
bus frequency when both the core and bus are 
idle. Auto Idle Power Down is software transpar- 
ent and does not affect processor performance. 
Auto Idle Power Down provides an average pow- 
er savings of 10% and is only applicable to clock 
multiplied processors. 
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Enhanced Bus Mode Features (for the Write- Back e Write Bursting—Data written from the processor 


Enhanced IntelDX2 Processor only): - to memory can be bursted (zero wait state trans- 
e Write Back Internal Cache—The Write-Back En- fer). 


hanced IntelDX2 processor adds write-back sup- 


port to the 8-Kbyte unified cache. The on-chip 
cache is configurable to be write-back or write- 1. 2 Intel486™™ Processor Product 


through on a line by line basis. The internal cache Family | | 
implements a modified MESI protocol, which is og See 
abe applicable to i systems. Table 1-1 shows the Intel486 processors available 


oe by Clock Mode, Supply Voltage, Maximum Frequen- 
e Enhanced Bus Mode—The definitions of some — cy, and Package. Likewise, an individual product will 


signals have been changed to support the new have either a 5V supply voltage or a 3.3V supply 

Enhanced Bus mode (write-back mode). voltage, but not both. An individual product will have 
either a 1X clock or a 2X clock, but not both. Please 
contact Intel for the latest Ere rene and 
specifications. | 


Table 1-1. Product Options 


ae 168- 
Intel486T™ ae (MHz) Pin 208-Lead 196-Lead 


Provessors | [as [ss] ao[s0 | 66] 75 [100] 6A] SOFP | POF 


intel4e6Sx Processor | 3av|~ |» | | | | | | | | 
peeve) al 


Fiasxew Processor [svt | | [| | | |e] | 4 
inie496 OXProcessr [eav| [»| | 1 | | | |» 

Tw? tet [ep | | pet | 
mabe Process Laat | te[»f | | | |» | 


rv{ || [el=t | fe]. —| |: 
witeBackennanced fa! | lel] | | | |» || 
inebxeProcessr sv | | | [»l>| | |» || 
neon prooessr fev] | | | | lel»t-| » [| 
axceck —SOSCSC—“CSCSC“‘s*‘CStC*S 


nieta6 SxProcesor®™ [sav[> [>] | | | | [|] > | 
rv[-f-| T 7 >]. |]. [+ 

mieHes OxProcesso® fav | [| | | | | ||» | 
8 2 

NOTES: 


1. The 5V 33-MHz Intel486 DX processor is available in 168-pin PGA and 196-lead PQFP packages. The 5V 50-MHz 
Intel486 DX processor is available in a 168-pin PGA package only. 

2. With the addition of SL Technology to the Intel486 processor family, the Low Power Intel486 SX and Low Power Intel486 
DX processors have been superseded with the 3.3V Intel486 processors described in this document. 
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2.0 HOW TO USE-THIS DOCUMENT 


2.1 Introduction © 


This data sheet is a compilation of previously pub- 
lished individual data sheets for the Intel486 SX, 
IntelSX2, Intel486 DX, IntelDX2 and IntelDX4 proc- 
essors. With the addition of the Write-Back En- 
hanced IntelDX2 and information previously pub- 
lished for the introduction of the SL Enhanced 
Intel486 processors, this data sheet encompasses 
the entire current Intel486 processor family. 


This data sheet describes the Intel486 processor ar- 
chitecture, features and technical details. Unless 
otherwise stated, any description for the Intel486 
processor listed in this data sheet applies to all 
Intel486 processors. Where architectural or other 
differences do occur (for example, the IntelDX4 
processor has a 16-Kbyte on-chip cache, all other 
Intel486 processors have an 8-Kbyte on-chip 
_ cache), these differences are described in separate 

sections. Section 2.2 provides a brief section de- 
scription, highlighting the specific sections that con- 
tain processor-unique information.. 


This data sheet does not detail the Intel486SL proc- 
essor, the Low Power Intel486 SX or Low Power 
Intel486 DX directly. The Low Power Intel486 proc- 
essors have been superseded by current versions of 
the Intel486 processors. : 


It is important to note that all Intel486 DX, IntelDX2, 
and IntelDX4 processors have an on-chip floating 
point unit. The Intel486 SX and IntelSX2 processors 
do not have on-chip floating point and do not pro- 
vide FERR# and IGNNE#, floating point error re- 
porting signals. 


The 5V 50-MHz Intel486 DX processor does not im- 
plement SL Technology and does not contain the 
following pins: SMIACT#, SRESET, SMI#, 
STPCLK#, and UP#. 


Boundary Scan (JTAG) testability features, capability 


and associated test signals (TCK, TMS, TDI, and © 


TDO) are standard on all Intel486 processors except 
the Intel486 SX processors in 168-pin PGA package. 
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2.2 Section Contents and Processor 
Specific Information 


The following is a brief description of the contents of 
each section: 


Section 1: “Introduction.” This section is an 
overview of the current Intel486 
processor family, product features 
and highlights. This section also lists 
product frequency, voltage and 


package offerings. 


“How to Use This Document.” This 
section presents information to aid in 
the use of this data sheet. 


“Pin Description.” This section con- 
tains all of the pin configurations for 
the various package options (168- 
Pin PGA, 208-Lead SQF, and 196- 
Lead PQFP), package diagrams, pin 
assignment tables and pin assign- 
ment differences for the various 
processors within a package class. 


The 168-Pin PGA and 208-Lead 
SQFP package diagrams shown are 
for the IntelDX2, Write-Back En- 
hanced IntelDX2 and IntelDX4 proc- | 
essors, with differences for other — 
members of the Intel486 processor 
family listed in separate tables. The 
196-Lead PQFP package diagram is 
for the Intel486 DX processor. Differ- 
ences for the Intel486 SX processor 
in the 196-Lead package are also 
listed in a separate table. 


This section also provides a quick — 
pin reference table that lists pin sig- 
nals for the Intel486 processor fami- 
ly. The table, whenever necessary, 
has sections applicable to each cur- 
rent Intel486 processor family mem- 
ber. 


“Architectural Overview.” This sec- 
tion describes the Intel486 proces- 
sor architecture, including the regis- 
ter and instruction sets, memory or-— 
ganization, data types and formats, 
and interrupts for all Intel486 proces- — 
sors. 


Section 2: 


Section 3: 


Section 4: 


Section 5: 


Section 6: 


Section 7: 


Section 8: 


The architectural overview describes 
the 32-bit RISC integer core of the 
Intel486 processor. The on-chip 


floating point unit for the Intel486 DX — 


IntelDX2 and IntelDX4 processors is 
included in this section. Operational 
differences for the Intel486 SX and 
IntelSX2 processors (i.e. processors 
that do not containing on-chip float- 
ing point units) are also described in 
detail. 


“Real Mode Architecture.” This sec- 
tion describes the Intel486 proces- 
sor real-mode architecture, including 
memory addressing, reserved loca- 
tions, interrupts, and Shutdown and 
HALT. This section applies to all 
Intel486 processors. 


“Protected Mode Architecture.” This 
section describes the Intel486 pro- 
tected-mode architecture, including 
addressing mechanism, segmenta- 
tion, protection, paging and virtual 
8086 environment. This section ap- 
plies to all Intel486 processors. 


“On-Chip Cache.” This section de- 
scribes the on-chip cache of the 


Intel486 processors. Specific infor- . 


mation on size, features, modes, and 
configurations is described. The dif- 
ferences between the IntelDX4 proc- 
essor on-chip cache (16-KByte) and 
other members of the Intel486 proc- 
essor family on chip cache (8-KByte) 
are detailed. 


This section also documents fea- 
tures, modes and operational issues 
specific to the Write-Back Enhanced 
IntelDX2 processor. The specifics for 
the Write-Back Enhanced IntelDX2 
are interleaved with sections on the 
Standard mode _  (write-through) 
cache of other Intel486 processors 
as appropriate. 


“System Management Mode (SMM) 
Architectures.” This section de- 
scribes the System Management 
Mode architecture of the Intel486 
processors, including system man- 
agement mode interrupt processing 
and programming mode. Specific in- 
formation to the Write-Back En- 
hanced IntelDX2 processor only are 
listed in appropriate sections. 


Section 9: 


Section 10: 


Section 11: 
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This section applies to Intel486 proc- 
essors except the 50 MHz Intel486 
DX processor, which does not imple- 
ment SL Technology. 


“Hardware Interface.” This section 
describes the hardware interface of 
the current Intel486 processor fami- 
ly, including signal descriptions, in- 
terrupt interfaces, write buffers, reset 
and initialization, and clock control. 


The IntelDX4 processor speed multi- 
plying options are detailed in this 
section. The Write-Back Enhanced 
IntelDX2 processor signals (both 
new pins and those which have dif- 
ferent operational functions) are de- 
tailed in this section. Reset and ini- 
tialization, as it applies to all of the 
Intel486 processor family, is also 
documented here. 


Use and operation of the Stop Clock, 
Auto HALT Power Down and other 
power-saving SL Technology fea- 
tures are described. Information spe- 
cific to the Write-Back Enhanced 
IntelDX2 processor is also docu- 
mented whenever appropriate. 


“Bus Operation.” This section de- 
scribes the Intel486 processor bus 
operation, including the data transfer 
mechanism and bus functional de- 
scription. When in Standard Bus 
mode, the Write-Back Enhanced 
IntelDX2 processor bus operation is 
the same as other members of the 
Intel486 processor family. Specific 
information to the Write-Back 
IntelDX2 processor in Enhanced Bus 
mode is detailed in a separate sec- 
tion for ease of use. 


“Testability.” This section describes 
the testability of the Intel486 proces- 
sors, including the built-in self test 
(BIST), on-chip cache testing, trans- 
lation lookaside buffer (TLB) testing, 
tri-state output test mode, and 
boundary scan (JTAG). 


Both the Write-Back Enhanced 
IntelDX2 and the IntelDX4 proces- 
sors have unique cache structures 
that alter testing in comparison to 
other members of the current 
Intel486 processor family These 
processor-specific differences are 
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Section 12: 


Section 13: 


Section 14: 


Section 15: 


Section 16: 
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documented in this section. A com-: 


plete listing of Boundary Scan ID 
Codes and Boundary Scan Register 
Bits orders are also included. 


“Debugging Support.” This section 
describes the Intel486 processor de- 
bugging support, including the break- 
point instruction, single-step trap and 
debug registers. This section applies 
to all Intel486 processors. 


“Instruction Set Summary.” This 
section provides clock count and in- 
struction encoding summaries for all 
the Intel486 processors. 


“Differences between Intel486 Proc- 
essors and Intel886™ Processors.” 
This section lists the differences be- 
tween the Intel486 processor family 


and the Intel386 processor family. 


Also described and documented are 
differences between the _ Intel386 
with an Intel3887™ math coproces- 
sors and the Intel486 processors 
with on-chip floating point units. This 
section applies to all Intel486 proc- 
essors. 

“Differences between the PGA, 


SQFP and PQFP Versions of the 
Intel486 SX and Intel486 DX Proces- 


_ sors.” The Low Power Intel486 SX 


and Intel486 DX processors have 
been superseded by the current 
Intel486 processors. This section 
lists the differences between the cur- 
rent Intel486 SX and Intel486 DX 
products offered in PGA, SQFP and 
PQFP packages. The current 
Intel486 SX and Intel486 DX proces- 
sors in the PQFP package can oper- 
ate in 2X clock mode, which is de- 
scribed in detail here. Electrical 
specifications for the Intel486 SX 
and Intel486 DX processors in 2X 
Clock mode are listed in this section. 


‘OverDriveTM Processor Socket.” 
This section describes the OverDrive 
processor socket requirements for 
end-user upgradability of the 


Intel486 processor family. This sec- 


tion applies to all Intel486 proces- 
sors. 


| Section 17: 


Section 18: 


Appendix A: 


Appendix B: 


Appendix C: 


Appendix D: 


Appendix E: 


a 
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“Electrical Data.” This section lists 
the AC and DC specifications for all 
Intel486 processors. Processor spe- 
cific information is listed in both com- 


mon and separate tables and sec- 
tions as appropriate. 


‘Mechanical Data.” This section lists 
the mechanical and thermal data, in- 


cluding the package specifications 


(PGA, SQFP and PQFP) for all 
Intel486 processors. Processor spe- 
cific information is listed in both com- 
mon and separate tables and sec- 
tions as appropriate. . 


‘Advanced Features.” This section 
documents the advanced features of 
the Intel486 processor family not 
covered in other sections of this data 
sheet. 


“Features Determination.” This sec- 
tion documents the CPUID function 
to determine the Intel486 processor 
family identification and processor 
specific information. This section ap- 
plies to all Intel486 processors. 


‘IBIS Models.” This section provides 
a detailed sample listing of the types 
of I/O buffer modeling information 
available for the Intel486 processor 
family. This section applies to all 
Intel486 processors. 

“BSDL Listing.” This section pro- 
vides a sample listing of a BSDL file 
for the Intel486 processor family. 
This section applies to all Intel486 
processors. 


“System Design Notes.” This sec- 
tion provides design notes applica- 
ble to the use of System Manage- 
ment Mode and SMM routines with 
the Intel486 processor. This section 
applies to all Intel486 processors, 


except the 50-MHz Intel486 DX — 


processor. 
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2.3 Documents Replaced by This Data 
Sheet 


This Data Sheet contains all of the latest information 
for the Intel486 processor family and replaces the 
following documentation: 


SL Enhanced Intel486™ Microprocessor Data 
Sheet Addendum, Order No. 241696 


Intel486™ SX Microprocessor Data Book, Order No. 
240950 


IntelSX2™ Microprocessor Data Sheet, Order No. 
241966 


Intel486™ DX Microprocessor Data Book, Order 
No. 240440 


Intel486™ DX2 Microprocessor Data Book, Order 
No. 241245 


IntelDX4™ Microprocessor Data Book, Order No. 
241944 


Intel486™ Family of Microprocessors Low Power 
Version Data Sheet, Order No. 241199 
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3.0 PIN DESCRIPTION 


3.1 Pin Assignments 


The following figures show the pin assignments of 
each package type for the Intel486 processor prod- 
uct family. Tables are provided showing the pin dif- 
ferences between the existing Intel486 processor 
products and the Intel486 processor products. 


- 168-Pin PGA—Pin Grid Array 


e Package Diagram 
e Pin Assignment Difference Table 
° Pin Cross Reference by Pin Name 


208-Lead SQFP—Quad Flat Pack 

e Package Diagram 

e Pin Assignment Difference Table 

e Pin Assignment Table in numerical order 


196-Lead PQFP—Plastic Quad Flat Pack 

e Package Diagram 

e Pin Assignment Difference Table 

e Pin Assignment Table in numerical order 
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Figure 3-1. Package Diagram for 168-Pin PGA Package of the IntelDX2™ Processor 
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Figure 3-2. Package Diagram for 168-Pin PGA Package of the 
Write-Back Enhanced IntelDX2™ Processor 
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Figure 3-3. 168-Pin PGA Pinout Diagram (Pin Side) for the IntelDX4™ Processor 
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Table 3-1. Pinout Differences for 168-Pin PGA Package 


Previous 
intel486T™ 
SX 

Processor(?) 


NC(1) NC TCK(5) 
TCK(4) 
. Tepe e Teter 


Previous 
IintelDX2TM 


Previous 
Intel486 DX 
Processor(7) 


intel486 SX|IntelSX2T™ 
Processor | Processor 


Intel486 DX 
Processor 


IntelDX2 |IntelDX4™ 
PeOcoee On, Processor 


INC INC INC INC INC INC 
CACHE # (6)| 
B13 INC INC INC INC INC 
WB/WT # (6) 
NC TMS(S) 
a 


TDO <= = a | 
TDOMM) 


a coe ee a 
vec! | vec | Veo [Veo Veo [Veo | Veo | Vous 
fr] no [Nc [nc [ No | No” | No | No | CLKwut 
[sep no | no | no [no | no [No [| No | vouner™ 
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pac ag 
. NC. Do Not Connect. These pins should always remain unconnected. Connection of NC pins to Vcc or Vss or to any 
other signal can result in component malfunction or incompatibility with future steppings of the Intel486 processors. 

_ 2. INC. Internal No Connect. These pins are not connected to any internal pad in Intel486 processors and OverDrive™ 
processors. However, new signals are defined for the location of the INC pins in the Intel486 processor proliferations. All 

INC pins defined by: Intel have a specific use for jumperless single socket compatibility with current and future processors. 
A system design could connect any signal to an INC pin without affecting the operation of the processor. However, the 
purpose of a specific INC pin should be understood before it is used. If not, the system design will sacrifice the ability to 
implement a jumperless (single socket) flexible motherboard. 

3. This pin location is for the Vocs pin on the IntelDX4 processor. For compatibility with 3.3V processors that have 5V safe 
input buffers (i.e., IntelDX4 processors), this pin should be connected to a Vcc trace, not to the Vcc plane. See section 
3.2, ‘Quick Pin Reference,” for a description of the Vocs pin on the IntelDX4 processor. 

4. These pins were only available on previous 50-MHz Intel486 DX processors. These pins are now on all speeds of the 
Intel486 DX processor. 

5. These pins were No Connects on previous Intel486 DX and IntelDX2 processors. For compatibility with old designs, they 
can still be left unconnected. 

6. These pins are used on the Write-Back Enhanced IntelDX2 processor only. 

7. Previous versions of the Intel486 processor family do not implement SL Technology and are not described in this data 
sheet. 
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Table 3-2. Pin Cross Reference for 168-Pin PGA Package of the IntelDX2™ Processor | 


ae ee ee 


A20M# ..D15 
ADS# ...S17 . 
AHOLD ..A17 B11 A11 
BEO#....K15 C4 B3 
BE1#....J16 C5 B4 
BE2# ....J15 E2 B5 
BE3#....F17 E16 EY 
BLAST# .R16 G2 F17 
BOFF# .. 013 G16 G1 
S4 H16 G17 
J4 H17 
K2 H1 
K16 K1 
sedan L16 K17 
M2 Lu 
eee M16 L17 
ee ee P16 M1 
sae ee R3 M17 
Ce ea eon R6 P17 
R8 Q2 
R9 R4 
R10 S6 
R11 S8 
R14 S9 
S10 
S11 
$12 
$14 


eee ee 


eee eve 


eer eee 


W/R# ...N17 
STPCLK#G15 
SRESET .C10 
TCK ....A3(3) 
TDI ....A14(3) 
TDO ...B16(S) 
TMS ...B14(3) 
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NOTES: . 

1. INC. Internal No Connect. These pins are not connected to any internal pad in Intel486™ processors and OverDrive™ 
processors. However, new signals are defined for the location of the INC pins in the Intel486 processor proliferation. All 
INC pins defined by Intel have a specific use for jumperless single socket compatibility with current and future processors. 
A system design could connect any signal to an INC pin without affecting the operation of the processor. However, the 
purpose of a specific INC pin should be understood before it is used. If not, the system design will sacrifice the ability to 
implement a jumperless (single socket) flexible motherboard. 

2. NC. Do Not Connect. These pins should always remain unconnected. Connection of NC pins to Vcc or Vsg on to any 
other signal can result in component malfunction or incompatibility with future steppings of the Intel486 processors. 

3. Boundary Scan pins are not included on the 168-pin PGA package version of the Intel486 SX processor. 
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* Pin 3. See Note 1 for Table 3-3. 


Figure 3-4. Package Diagram for 208-Lead SQFP of the IntelDX2™ Processor 


2-23 


Intel486™ PROCESSOR FAMILY in 


PLOCK# 
A10 
vcc 
VSS 
vcc 
VSS 
A12 
A13 
A14 
vcc 
vss 


= A15 
J 


vss 
“1 LOCK# 
“723 AG 
“3 AT 
——) VCC 
2 A8& 
3 VSS 
3 VEC 
1 A9 
A16 
A17 
3 VSS 
<3 VCC 
<= TO 


PeAPSESND | 


“——"7 BLAST# 


} 7 ———3 ADS# 
TT A4 
AS _ 
2c UPA - 


193 
192 
191 
190 
189 ft ~ _ 
188 
187 
186 |-— 


175 
174 

171 
170 ¥- 
169 

168 


167 F-— TMS 
166 A18 
165 A19 
164 £ A20 
161 fF A21 
A22 
A23 


184} 
183 -—— 
181 

180 £ 
178 ¢— 
177 

176 {-— 


185 |-—— 


208 Lead SQFP 
Write-Back Enhanced 
IntelDX2™ Processor 


(Top View) 


53 
54 
55 
58 
—4 59 
===} 60 
——4 78 
479 


vss 

vec 
VSS 

NC (7.2.9 67 


TOO = 


024 


vSS c--— 157 
SRESET co 
SMIACT# 

vcc 

VSS 
WB/WT# C— 

SMM C—---) 65 

FERRS# C227729 66 
CACHE# 
IGNNE# ©" 

031 

D30 

vss 

029 

028 

vcc 

vss 

vcc 

D27 

D26 

D25 

vcc 


242202-5 


Figure 3-5. Package Diagram for 208-Lead SQFP of the Write-Back Enhanced IntelDX2™ Processor 
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Figure 3-6. Package Diagram for 208-Lead SQFP of the IntelDX4™ Processor 
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Table 3-3. Pinout Differences for 208-Lead SQFP Package 


| Write-Back 
Intel486™ SX Intel486 DX | = IntelDX2™ Enhanced IntelDX4™ 
Processor Processor Processor intelDX2 Processor 


Processor 


INC IGNNE # IGNNE# IGNNE# | IGNNE# 


NOTES: 

1. This pin location is for the Vccs pin on the IntelDX4 processor. For compatibility with 3.3V processors that have 5V safe 
input buffers (i.e., IntelIDX4 processors), this pin should be connected to a Vcc trace, not to the Vcc plane. See section 
3.2, “Quick Pin Reference,” for a description of the Vocs pin on the IntelDX4 processor. 

2. INC. Internal No Connect. These pins are not connected to any internal pad in Intel486 processors and OverDrive™ 
processors. However, new signals are defined for the location of the INC pins in the Intel486 processor proliferations. All 
INC pins defined by Intel have a specific use for jumperless single socket compatibility with current and future processors. 
A system design could connect any signal to an INC pin without affecting the operation of the processor. However, the 
purpose of a specific INC pin should be understood before it is used. If not, the system design will sacrifice the ability to 
implement a jumperless (single socket) flexible motherboard. 

3. NC. Do Not Connect. These pins should always remain unconnected. Connection of NC pins to Vcc or Vss or to any 
other signal can result in component malfunction or incompatibility with future steppings of the Intel486 processors. 
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Table 3-4. Pin Assignment for 208-Lead SQFP Package of the IntelDX2™ Processor 
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Table 3-4. Pin Assignment for 208-Lead SQFP Package of the IntelIDX2™ Processor nate 


Cas [-neser [00 [ow [se | nar [ame | are 
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NOTES: 

1. This pin location is for the Vocs pin on the IntelDX4™ processor. For compatibility with 3.3V processors that have 5V 
safe input buffers (i.e., IntelDX4 processors), this pin should be connected to a Vcc trace, not to the Voc plane. See 
section 3.2, “Quick Pin Reference,” for a description of the Vocs pin on the IntelDX4 processor. 

2. INC. Internal No Connect. These pins are not connected to any internal pad in Intel486 processors and OverDrive proces- 
sors. However, new signals are defined for the location of the INC pins in the Intel486 processor proliferations. All INC 
pins defined by Intel have a specific use for jumperless single socket compatibility with current and future processors. A 
system design could connect any signal to an INC pin without affecting the operation of the processor. However, the 
purpose of a specific INC pin should be understood before it is used. If not, the system design will sacrifice the ability to 
implement a jumperless (single socket) flexible motherboard. 

3. NC. Do Not Connect. These pins should always remain unconnected. Connection of NC pins to Vcc or Vsg or to any 
other signal can result in component malfunction or incompatibility with future steppings of the Intel486 processors. 
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196 Lead SQFP 


(Top View) 


242202-7 


Figure 3-7. Package Diagram for 196-Lead PQFP Package of the Intel486™ DX Processor 
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Table 3-5. Pinout Differences for 196-Lead PQFP Package 


Low Power Previous Low Power 
Intel486™ SX Intel486 SX Iintel486 DX Iintel486 DX 


Processor(3) | Processor Processor‘(3) Processor 


Previous 


Intel486 DX 
Processor 


Intel486 SX 
Processor 


Re 
. INC. Internal No Connect. These pins are not connected to any internal pad in Intel486™ processors and OverDrive™ 

processors. However, new signals are defined for the location of the INC pins in the Intel486 processor proliferations. All 
INC pins defined by Intel have a specific use for jumperless single socket compatibility with current and future processors. 
A system design could connect any signal to an INC pin without affecting the operation of the processor. However, the 
purpose of a specific INC pin should be understood before it is used. If not, the system design will sacrifice the ability to 
implement a jumperless (single socket) flexible motherboard. 

2. NC. Do Not Connect. These pins should always remain unconnected. Connection of NC pins to Vcc or Vss or to any 
other signal can result in component malfunction or incompatibility with future steppings of the Intel486 processors. 

3. Previous versions of the Intel486 processor family do not implement SL Technology and are not described in this data 
sheet. 
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Table 3-6. Pin Assignments for Intel486™ DX Processor 196-Lead PQFP Package 
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Table 3-6. Pin Assignments for intel4a6r™ DX Processor 196-Lead PQFP Package (Continued) 
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3.2 Quick Pin Reference 


The following is a brief pin description. For detailed signal descriptions refer to section 9.2, “Signal Descrip- 


tion.” 


Table 3-7. intel486™ Processor Pin Descriptions 


mf 


ADDRESS BUS 


A31-A4 Ne 
A2-A3 O 
BE0-3# ¥ 


DATA BUS 
\/O 


DATA PARITY 


DPO-DP3 


| 


CLocK provides the fundamental timing and the internal operating frequency for the 
Intel486 processor. All external timing parameters are specified with respect to the 
rising edge of CLK. | 


The Address Lines. A31-A2, together with the byte enables signals. BEO # — 
BE3#, define the physical area of memory or input/output space accessed. 
Address lines A31-A4 are used to drive addresses into the processor to perform 
cache line invalidations. Input signals must meet setup and hold times too and to3. 
A31-A2 are not driven during bus or address hold. 


The Byte Enable signals indicate active bytes during read and write cycles. During 
the first cycle of a cache fill, the external system should assume that all byte enables 
are active. BE3# applies to D24-D31, BE2# applies to D16—D23, BE1# applies to 
D8-—D15 and BEO# applies to DO-D7. BEO#-BE3# are active LOW and are not 
driven during bus hold. 


The Data Lines, DO-D7, define the least significant byte of the data bus while lines 
D24-D31 define the most significant byte of the data bus. These signals must meet 
setup and hold times tao and to3 for proper operation on reads. These pins are 
driven during the second and subsequent clocks of write cycles. 


There is one Data Parity pin for each byte of the data bus. Data parity is generated 
on all write data cycles with the same timing as the data driven by the Intel486 
processor. Even parity information must be driven back into the processor on the 
data parity pins with the same timing as read information to insure that the correct 
parity check status is indicated by the Intel486 processor. The signals read on these 
pins do not affect program execution. 


Input signals must meet setup and hold times too and tog. DPO-DP3 should be 
connected to Vcc through a pull-up resistor in systems that do not use parity. DPO- 
DP3 are active HIGH and are driven during the second and subsequent clocks of 
write cycles. 


Parity Status is driven on the PCHK # pin the clock after ready for read operations. 
The parity status is for data sampled at the end of the previous clock. A parity error 
is indicated by PCHK # being LOW. Parity status is only checked for enabled bytes 
as indicated by the byte enable and bus size signals. PCHK # is valid only in the 
clock immediately after read data is returned to the processor. At all other times 
PCHK # is inactive (HIGH). PCHK# is never floated. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


BUS CYCLE DEFINITION a 


The memory/input-output, data/control and write/read lines are the primary bus 
definition signals. These signals are driven valid as the ADS# signal is asserted. 


Bus Cycle Initiated — 

Interrupt Acknowledge 
Halt/Special Cycle 
I/O Read 

1 ie 1/O Write 

0 Code Read 

0 Reserved. 

1 Memory Read 

1 1 Memory Write 


The bus definition signals are not driven during bus hold and follow the timing of the 
address bus. Refer to section 10.2.11, “Special Bus Cycles,” for a description of the 
special bus cycles. 


The Bus Lock pin indicates that the current bus cycle is locked. The Intel486 
processor will not allow a bus hold when LOCK # is asserted (but address holds are 
allowed). LOCK # goes active in the first clock of the first locked bus cycle and goes 
inactive after the last clock of the last locked bus cycle. The last locked cycle ends 
when ready is returned. LOCK # is active LOW and is not driven during bus hold. 
Locked read cycles will not be transformed into cache fill cycles if KEN # is returned 
active. 


The Pseudo-Lock pin indicates that the current bus transaction requires more than | 
one bus cycle to complete. For the Intel486 processor, examples of such operations 
are segment table descriptor reads (64 bits), in addition to cache line fills (128 bits). 


For Intel486 processors with on-chip FPU, floating point long reads and write 
(64 bits) also require more than one bus cycle to complete. 


The Intel486 processor will drive PLOCK # active until the addresses for the last bus 
cycle of the transaction have been driven regardless of whether RDY # or BRDY # 
have been returned: Normally PLOCK# and BLAST # are inverse of each other. 
However during the first bus cycle of a 64-bit floating point write (for Intel486 
processors with on-chip FPU), both PLOCK# and BLAST # will be asserted. 


PLOCK # is a function of the BS8¥, BS16# and KEN# inputs. PLOCK # should be 
sampled only in the clock ready is returned. PLOCK # is active LOW and is not 
driven during bus hold. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


BUS CONTROL 


ADS# The Address Status output indicates that a valid bus cycle definition and address 
are available on the cycle definition lines and address bus. ADS # is driven active in 
the same clock as the addresses are driven. ADS # is active LOW and is not driven 


during bus hold. 


RDY # The Non-burst Ready input indicates that the current bus cycle is complete. RDY # 
indicates that the external system has presented valid data on the data pins in 
response to a read or that the external system has accepted data from the Intel486 
processor in response to a write. RDY # is ignored when the bus is idle and at the 
end of the first clock of the bus cycle. 

RDY # is active during address hold. wale can be returned to the processor while 
AHOLD is active. 

RDY # is active LOW, and is not provided with an internal pull-up resistor. RDY # 
must satisfy setup and hold times t;¢ and t;7 for proper chip operation. 


BURST CONTROL 


BRDY # The Burst Ready input performs the same function during a burst cycle that RDY # 
performs during a non-burst cycle. BRDY # indicates that the external system has 
presented valid data in response to a read or that the external system has accepted 
data in response to a write. BRDY # is ignored when the bus is idle and at the end of 
the first clock in a bus cycle. 

BRDY # is sampled in the second and subsequent clocks of a burst cycle. The data 
presented on the data bus will be strobed into the processor when BRDY # is 
sampled active. If RDY # is returned simultaneously with BRDY #, BRDY # is 
ignored and the burst cycle is prematurely aborted. 

BRDY # is active LOW and is provided with a small pull-up resistor. BRDY # must 


satisfy the setup and hold times ty, and t;7. 


BLAST # The Burst Last signal indicates that the next time BRDY # is returned the burst bus 
cycle is complete. BLAST # is active for both burst and non-burst bus cycles. 
BLAST # is active LOW and is not driven during bus hold. 


INTERRUPTS 


RESET The Reset input forces the Intel486 processor to begin execution at a known state. 
The processor cannot begin execution of instructions until at least 1 ms after Voc 
and CLK have reached their proper DC and AC specifications. The RESET pin 
should remain active during this time to insure proper processor operation. RESET is 
active HIGH. RESET is asynchronous but must meet setup and hold times tap and 
to for recognition in any specific clock. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


|_Symbol | Type . | Name and Function oo 


INTERRUPTS (Continued) 


The Maskable Interrupt indicates that an external interrupt has been generated. If. 
the internal interrupt flag is set in EFLAGS, active interrupt processing will be 
initiated. The Intel486 processor will generate two locked interrupt acknowledge bus 
cycles in response to the INTR pin going active. INTR must remain active until the 
interrupt acknowledges have been performed to; assure that the interrupt is 
recognized. 


INTR is active HIGH and is not provided with an internal pull-down resistor. INTR is 
asynchronous, but must meet setup and hold times tg9 and to1 for recognition in any 
specific clock. 


The Non-Maskable Interrupt request signal indicates that an external non- 
maskable interrupt has been generated. NMI is rising edge sensitive. NMI must be — 
held LOW for at least four CLK periods before this rising edge. NMI is not provided 

with an internal pull-down resistor. NMI is asynchronous, but must meet setup and | 
hold times top and to1 for recognition in any specific clock. 


The Soft Reset pin duplicates all the functionality of the RESET pin with the . 
following two exceptions: 

1. The SMBASE register will retain its previous value. 
2. If UP # (I) is asserted, SRESET will not have an effect on the host processor. 

For soft resets, SRESET should remain active for at least 15 CLK periods. SRESET 
is active HIGH. SRESET is asynchronous but must meet setup and hold times too 
and to; for recognition in any specific clock. 


SRESET 


SMI # The System Management Interrupt input is used to invoke the System 
Management Mode (SMM). SMI # is a falling edge triggered signal which forces the 
processor into SMM at the completion of the current instruction. SMI # is recognized 
on an instruction boundary and at each iteration for repeat string instructions. SMI # 
does not break LOCKed bus cycles and cannot interrupt a currently executing SMM. 
The processor will latch the falling edge of one pending SMI # signal while the 
processor is executing an existing SMI #. The nested SMI # will not be recognized 
until after the execution of a Resume (RSM) instruction. - 


SMIACT # The System Management Interrupt ACTive is an active low output, indicating that 
the processor is operating in SMM. It is asserted when the processor begins to 
execute the SMI# state save sequence and will remain active LOW until the 


processor executes the last state restore cycle out of SMRAM. 


STPCLK # The SToP CLockK request input signal indicates a request has been made to turn 

| off the CLK input. When the processor recognizes a STPCLK #, the processor will 
stop execution on the next instruction boundary, unless superseded by a higher 
priority interrupt, empty all internal pipelines and the write buffers and generate a 
Stop Grant acknowledge bus cycle. STPCLK # is active LOW and is provided with 
an internal pull-up resistor. STPCLK # is an asynchronous signal, but must 
remain active until the processor issues the Stop Grant bus cycle. STPCLK # 
may be de-asserted at any time after the processor has issued the Stop Grant 


bus cycle. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


Symbol Name and Function | 


BUS ARBITRATION 


a i! 


; 7 


The Bus Request signal indicates that the Intel486 processor has internally 
generated a bus request. BREQ is generated whether or not the Intel486 processor 
is driving the bus. BREQ is active HIGH and is never floated. 


The Bus Hold request allows another bus master complete control of the processor 
bus. In response to HOLD going active the Intel486 processor will float most of its 
output and input/output pins. HLDA will be asserted after completing the current bus 
cycle, burst cycle or sequence of locked cycles. The Intel486 processor will remain 
in this state until HOLD is de-asserted. HOLD is active high and is not provided with 
an internal pull-down resistor. HOLD must satisfy setup and hold times tyg and ty9 © 
for proper operation. 


Hold Acknowledge goes active in response to a hold request presented on the 
HOLD pin. HLDA indicates that the Intel486 processor has given the bus to another 
local bus master. HLDA is driven active in the same clock that the Intel486 
processor floats its bus. HLDA is driven inactive when leaving bus hold. HLDA is 
active HIGH and remains driven during bus hold. 


The Backoff input forces the Intel486 processor to float its bus in the next clock. 
The processor will float all pins normally floated during bus hold but HLDA will not be 
asserted in response to BOFF #. BOFF # has higher priority than RDY # or BRDY #; 
if both are returned in the same clock, BOFF # takes effect. The processor remains 
in bus hold until BOFF # is negated. If a bus cycle was in progress when BOFF # 


7 ¥ 
BOFF # 
was asserted the cycle will be restarted. BOFF # is active LOW and must meet 
setup and hold times t;g and t1g for proper operation. 


CACHE INVALIDATION 
The Address Hold request allows another bus master access to the processor’s 
address bus for a cache invalidation cycle. The Intel486 processor will stop driving 
its address bus in the clock following AHOLD going active. Only the address bus will 
be floated during address hold, the remainder of the bus will remain active. AHOLD 
is active HIGH and is provided with a small internal pull-down resistor. For proper 


AHOLD 
operation AHOLD must meet setup and hold times t1g and tig. 


EADS # This signal indicates that a va/id External Address has been driven onto the 
Intel486 processor address pins. This address will be used to perform an internal 
cache invalidation cycle. EADS # is active LOW and is provided with an internal pull- 
up resistor. EADS # must satisfy setup and hold times ty. and t;3 for proper 


operation. 


CACHE CONTROL 
The Cache Enable pin is used to determine whether the current cycle is cacheable. 
When the Intel486 processor generates a cycle that can be cached and KEN # is — 
active one clock before RDY # or BRDY # during the first transfer of the cycle, the 
cycle will become a cache line fill cycle. Returning KEN # active one clock before 
RDY # during the last read in the cache line fill will cause the line to be placed in the 
on-chip cache. KEN # is active LOW and is provided with a small internal pull-up 


KEN # 
resistor. KEN # must satisfy setup and hold times t;4 and t;s5 for proper operation. 


FLUSH # The Cache Flush input forces the Intel486 processor to flush its entire internal 
cache. FLUSH # is active low and need only be asserted for one clock. FLUSH # is 
asynchronous but setup and hold times tag and to; must be met for recognition in 


any specific clock. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


PAGE CACHEABILITY 


The Page Write-Through and Page Cache Disable pins reflect the state of the 
page attribute bits, PWT and PCD, in the page table entry, page directory entry or 
control register 3 (CR3) when paging is enabled. If paging is disabled, the processor 
ignores the PCD and PWT bits and assumes they are zero for the purpose of 
caching and driving PCD and PWT pins. PWT and PCD have the same timing as the 
cycle definition pins (M/lIO#, D/C #, and W/R#). PWT and PCD are active HIGH 
and are not driven during bus hold. PCD is masked by the cache disable bit (CD) in 
Control Register 0. | 


BUS SIZE CONTROL 


The Bus Size 16 and Bus Size 8 pins (bus sizing pins) cause the Intel486 processor 
to run multiple bus cycles to complete a request from devices that cannot-provide or 
accept 32 bits of data in a single cycle. The bus sizing pins are sampled every clock. 


The state of these pins in the clock before ready is used by the Intel486 processor to 
determine the bus size. These signals are active LOW and are provided with internal 
pull-up resistors. These inputs must satisfy setup and hold times t;4 and ty5 for 
proper operation. 


ADDRESS MASK 
A20M # 


When the Address Bit 20 Mask pin is asserted, the Intel486 processor masks 
physical address bit 20 (A20) before performing a lookup to the internal cache or 
driving a memory cycle on the bus. A20M # emulates the address wraparound at 
one Mbyte, which occurs on the 8086 processor. A20M # is active LOW and should 
be asserted only when the processor is in real mode. This pin is asynchronous but 
should meet setup and hold times tog and to, for recognition in any specific clock. 
For proper operation, A20M# should be sampled high at the falling edge of RESET. 


TEST ACCESS PORT 


Test ClockK is an input to the Intel486 processor and provides the clocking function 
required by the JTAG Boundary scan feature. TCK is used to clock state information 
and data into component on the rising edge of TCK on TMS and TDI, respectively. 

Data is clocked out of the part on the falling edge of TCK and TDO. TCK is provided 
with an internal pull-up resistor. 


Test Data Input is the serial input used to shift JTAG instructions and data into 
component. TDI is sampled on the rising edge of TCK, during the SHIFT-IR and 
SHIFT-DR TAP controller states. During all other tap controller states, TDI is a 

“don’t care.” TDI is provided with an internal pull-up resistor. 


Test Data Output is the serial output used to shift JTAG instructions and data out of 

the component. TDO is driven on the falling edge of TCK during the SHIFT-IR and 
SHIFT-DR TAP controller states. At all other times TDO is driven to the high 

impedance state. 


Test Mode Select is decoded by the JTAG TAP (Tap Access Port) to select the. 
operation of the test logic. TMS is sampled on the rising edge of TCK. To guarantee 
deterministic behavior of the TAP controller TMS is provided with an internal puil-up 
resistor. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


_ Name and Function 


PERFORMANCE UPGRADE SUPPORT 


UP # 


The Upgrade Present input detects the presence of the upgrade processor, then 
powers down the core, and tri-states all outputs of the original processor, so that the 
original processor consumes very low current. UP # is active LOW and sampled at 

all times, including after power-up and during reset. 


NUMERIC ERROR REPORTING FOR INTEL486 DX, INTELDX2™, AND INTELDX4™ PROCESSORS 


7 a 


IGNNE # i 


The Floating point ERRor pin is driven active when a floating point error occurs. 
FERR # is similar to the ERROR # pin on the Intel887™ Math CoProcessor. 

FERR # is included for compatibility with systems using DOS type floating point error 
reporting. FERR# will not go active if FP errors are masked in FPU register. FERR # 
is active LOW, and is not floated during bus hold. 


When the IGNore Numeric Error pin is asserted the processor will ignore a numeric 


error and continue executing non-control floating point instructions, but FERR # will 
still be activated by the processor. When IGNNE # is de-asserted the processor will 
freeze on a non-control floating point instruction, if a previous floating point 
instruction caused an error. IGNNE # has no effect when the NE bit in control 
register 0 is set. IGNNE # is active LOW and is provided with a small internal pull-up 
resistor. IGNNE# is asynchronous but setup and hold times tap and to; must be met 
to insure recognition on any specific clock. 


WRITE-BACK ENHANCED INTELDX2 PROCESSOR SIGNAL PINS 


CACHE # ¥ 
FLUSH # 7 


The CACHE # output indicates internal cacheability on read cycles and burst write- 
back on write cycles. CACHE # is asserted for cacheable reads, cacheable code 
fetches and write-backs. It is driven inactive for non-cacheable reads, I/O cycles, 
special cycles, and write-through cycles. 


Cache FLUSH # is an existing pin that operates differently if the processor is 
configured as Enhanced Bus mode (write-back). FLUSH # will cause the processor 
to write back all modified lines and flush (invalidate) the cache. FLUSH # is 
asynchronous, but must meet setup and hold times tap and to, for recognition in any 
specific clock. 


The Hit/Miss to a Modified Line pin is a cache coherency protocol pin that is 
driven only in Enhanced Bus mode. When a Snoop cycie is run, HITM # indicates 
that the processor contains the snooped line and that the line has been modified. 
Assertion of HITM# implies that the line will be written back in its entirety, unless the 
processor is already in the process of doing a replacement write-back of the same 
line. 


The Invalidation Request pin is a cache coherency protocol pin that is used only in 
the Enhanced Bus mode. It is sampled by the processor on EADS #-driven snoop 
cycles. It is necessary to assert this pin to get the effect of the processor invalidate 
cycle on write-through-only lines. INV also invalidates the write-back lines. However, 
if the snooped line is modified, the line will be written back and then invalidated. INV 
must satisfy setup and hold times t;2 and t;3 for proper operation. 
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Table 3-7. Intel486™ Processor Pin Descriptions (Continued) 


Symbol i; Name and Function 


WRITE-BACK ENHANCED INTELDX2 PROCESSOR SIGNAL PINS (Continued) 


PLOCK # / 


In the Enhanced bus mode, Pseudo-Lock Output is always driven inactive. In this 
mode, a 64-bit data read (caused by an FP operand access or a segment descriptor 
read) is treated as a multiple cycle read request, which may be a burst or a non-burst 
access based on whether BRDY # or RDY # is returned by the system. Because 

only write-back cycles (caused by Snoop write-back or replacement write-back) are 
write burstable, a 64-bit write will be driven out as two non-burst bus cycles. 

BLAST # is asserted during both writes. Refer to the Bus Functional Description 
section 10.3 for details on Pseudo-Locked bus cycles. 


For the Write-Back Enhanced IntelDX2 processor, Soft RESET operates similar to 
other Intel486 processors. On SRESET, the internal SMRAM base register retains 
its previous value, does not flush, write-back or disable the internal cache. Because 
SRESET is treated as an interrupt, it is possible to have a bus cycle while SRESET is 
asserted. SRESET is serviced only on an instruction boundary. SRESET is 
asynchronous but must meet setup and hold times ta9 and ta; for recognition in any 
specific clock. 


The Write-Back/Write-Through pin enables Enhanced Bus mode (write-back 
cache). It also defines a cached line as write-through or write-back. For cache 
configuration, WB/WT # must be valid during RESET and be active for at least two 
clocks before and two clocks after RESET is de-asserted. To define write-back or 


SRESET 7 
WB/WT # 
write-through configuration of a line, WB/WT # is sampled in the same clock as the 
; first RDY # or BRDY # is returned during a line fill (allocation) cycle. 


INTELDX4 PROCESSOR CLKMUL, VCC5, AND VOLDET 


CLKMUL — a The CLocK MULtiplier input, defined during device RESET, defines the ratio of 


internal core frequency to external bus frequency. If sampled low, the core 
frequency operates at twice the external bus frequency (speed doubled mode). If 
wii 
VOLDET ¥ 


driven high or left floating, speed triple mode is selected. CLKMUL has an internal 
pull-up speed to Vcc and may be left floating in designs that select speed tripled 
clock mode. 


The 5V reference voltage input is the reference voltage for the 5V-tolerant I/O 
buffers. This signal should be connected to +5V +5% for use with 5V logic. If all 
inputs are from 3V logic, this pin should be connected to 3.3V. 


A VOLtage DETect signal allows external system logic to distinguish between a 5V 
Intel486 processor and the 3.3V IntelDX4 processor. This signal is active low for a 
3.3V IntelDX4 processor. This pin is available only on the PGA version of the 
IntelDX4 processor. 
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Table 3-8. Output Pins 


[Name [Active Level| When Floated _ 
ferco | WH | 
fawn | 


W/R#, HIGH/LOW | Bus Hold 
M/lIO#, D/C# 


Pouce [tow 
Ferret) [tow [ 
fsmacte@ [iow | 
fvouwer? [tow f 


NOTES: 

1. Present on the Intel486™ Dx, IntelDX2™, and 
IntelDX4™ processors only. 

2. Not present in the 50-MHz intel486 DX processor. 

3. Present on the Write-Back Enhanced IntelDX2 proces- 
sor only. 

4. Present on the IntelDX4 processor only. 
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Table 3-9. Input/Output Pins() 


[Name | Active Level | When Foated 


NOTE: 
1. All input/output signals are floated when UP # is assert- 
ed. . 


Table 3-10. Test Pins 


NOTE: 
1. The test pins are not present on the Intel486 SX™ proc- | 
essor in the PGA package. 
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Table 3-11. Input Pins 


Level Asynchronous Pull-Down 
ae 
CLK2(1) | 
reset | HIGH «| ——Aaynchronous | 
[sreser [ic | Aemmctvonous | PulkDown 


BS16#, Synchronous Pull-Up | 
BS8 # 


a 
wi 

a 
ee 
ee 
i = 


CLKMUL # (§) 


NOTES: 

1. CLK2 is present on 2X clock mode Intel486™ SX and Intel486 DX processors. 
2. Present on the Intel486 DX, IntelDX2™, and IntelDX4™ processors only. 

3. Not present in the 50-MHz Intel486 DX processor. 

4. The test pins are not present on the Intel486 SX processor in the PGA package. 
5. Present on the Write-Back Enhanced IntelDX2 processor only. 

6. Present on the IntelDX4 processor only. 
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4.0 ARCHITECTURAL OVERVIEW 


4.1 Introduction 


The Intel486 processor family is a 32-bit architecture 
with on-chip memory management, floating point, 
and cache memory units. Figure 4-1 is a block dia- 
gram of the Intel486 processor family. The Intel486 
processor contains all the features of the Intel886™ 
processor with enhancements to increase perform- 
ance. 


The Intel486 processor instruction set includes the 
complete Intel386 processor instruction set along 
with extensions to serve new applications and in- 
crease performance. The on-chip memory manage- 
ment unit (MMU) is completely compatible with the 
Intel386 processor MMU. Software written for previ- 


ous members of the Intel architecture family will run — 


on the Intel486 processor without any modifications. 


On-chip cache memory allows frequently used data 
and code to be stored on-chip reducing accesses to 
the external bus. RISC design techniques reduce in- 
struction cycle times. A burst bus feature enables 
fast cache fills. 


The memory management unit (MMU) consists of a 
segmentation unit and a paging unit. Segmentation 
allows management of the logical address space by 
providing easy data and code relocatibility and effi- 
cient sharing of global resources. The paging mech- 
anism operates beneath segmentation and is trans- 
parent to the segmentation process. Paging is op- 
tional and can be disabled by system software. Each 
segment can be divided into one or more 4-Kbyte 
segments. To implement a virtual memory system, 
full restartability for all page and segment faults is 
supported. 


Memory is organized into one or more variable 
length segments, each up to four Gbytes (232 bytes) 
in size. A segment can have attributes associated 
with it which include its location, size, type (i.e., 
stack, code or data), and protection characteristics. 
Each task on an Intel486 processor can have a max- 
imum of 16,381 segments and each are up to four 
Gbytes in size. Thus, each task has a maximum of 
64 terabytes (trillion bytes) of virtual memory. 


Intel486™ PROCESSOR FAMILY 


The segmentation unit provides four levels of pro- 
tection for isolating and protecting applications and 
the operating system from each other. The hardware 
enforced protection allows the design of systems 
with a high degree of software integrity. 


The Intel486 processor has two modes of operation: 
Real Address Mode (Real Mode) and Protected 
Mode Virtual Address Mode (Protected Mode). In 
Real Mode the Intel486 processor operates as a 
very fast 8086. Real Mode is required primarily to set 
up the Intel486 processor for Protected Mode oper- 
ation. Protected Mode provides access to the so- | 
phisticated memory management paging and privi- 

lege capabilities of the processor. 


Within Protected Mode, software can perform a task 
switch to enter into tasks designated as Virtual 8086 
Mode tasks. Each Virtual 8086 task behaves with 
8086 semantics, allowing 8086 processor software 
(an application program or an entire operating sys- 
tem) to execute. : 


System Management Mode (SMM) provides the sys- 
tem designer with a means of adding new software 
controlled features to their computer products that 
always operate transparently to the Operating Sys- 
tem (OS) and software applications. SMM is intend- 
ed for use only by system firmware, not by applica- 
tions software or general purpose systems software. 


The on-chip cache is 16 Kbytes in size for the 
IntelDX4 processor and 8 Kbytes in size for all other 
members of the Intel486 processor family. It is 4- 
way set associative and follows a write-through poli- 
cy. The on-chip cache includes features to provide 
flexibility in external memory system design. Individ- 
ual pages can be designated as cacheable or non- 
cacheable by software or hardware. The cache can 
also be enabled and disabled by software or hard- 
ware. The Write-Back Enhanced IntelDX2 processor 
can be set to use an on-chip write-back cache polli- 
cy. : 


The Intel486 processor also has features that facili- 
tate high-performance hardware designs. The 1X 
bus clock input eases high-frequency board-level 
designs. The clock multiplier on IntelSX2, IntelDX2, 
and IntelDX4 processors improves execution per- 
formance without increasing board design complexi- 
ty. The clock multiplier enhances all operations op- 
erating out of the cache and/or not blocked by ex- 
ternal bus accesses. The burst bus feature enables 
fast cache fills. 
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3 64-Bit Interunit Transfer Bus 32-Bit Data Bus 


iy 32-Bit Data Bus 3 


tation 
Index 


Bus . 
Register File [-—7—5 rapes 


Translation 
Lookaside 
Buffer 


Displacement Bus 


Micro-instruction 


jecoecsoreveccosn, 
Hy 
e 


Floating Point}; 
Unit ; 
a ae 
* | Floating Point} ; | Control ROM | instruction 
Register File |: Path 


Srecennscccossnscescorenseoesssseresecosed 


Protect i a 


ROIES: 
* Available on intel486™ DX, IntelOX2™ and IntelDX4™ processors. 


Data Bus 
Transceivers 

ADS# WiR# D/C# M/O# PCD 
PWT ROY# LOCK# PLOCK# 
BOFF# AZ20M# BREQ HOLD 
HLDA RESET SRESET INTR 
NM! SMIi# SMIACT# FERR#* 
IGNNE#* STPCLK# 


BRDY# BLAST# 


BS16# BS&# 


“| KEN# FLUSH# AHOLD 
EADS# 


CACHE# HITM# 
WBMT TT 


** Available on all intel486 processors except 158-pin PGA Intel486 SX processors. 


t Available only on the inteiDX4 processor. 
tt Available only on the Write-Back Enhanced InteliDX2 processor. 
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Figure 4-1. Intel486™ Processor Block Diagram 


4.1.1 INTEL486 DX, INTELDX2™, AND 
INTELDX4™ PROCESSOR ON-CHIP 
FLOATING POINT UNIT 


The Intel486 DX, IntelDX2, and IntelDX4 processors 
incorporate the basic Intel486 processor 32-bit ar- 
chitecture with on-chip memory management and 
cache memory units. They also have an on-chip 
floating point unit (FPU) that operates in parallel with 
the arithmetic and logic unit. The FPU provides arith- 
metic instructions for a variety of numeric data types 
and executes numerous built-in transcendental func- 
tions (e.g., tangent, sine, cosine, and log functions). 


The floating point unit fully conforms to the ANSI/ — 


IEEE standard 754-1985 for floating point arithmetic. 


All software: written for the Intel386 processor, 
Intel887 math coprocessor and previous members 
of the 86/87 architectural family will run on these 
processors without any modifications. 
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4.1.2 UPGRADE POWER DOWN MODE 


Upgrade Power Down Mode on the Intel486 proces- 
sor is initiated by the Intel OverDrive processor using 
the UP# (upgrade present) pin. Upon sensing the 
presence of the Intel OverDrive Processor, the 
Intel486 processor tri-states its outputs and enters — 
the “Upgrade Power Down Mode,” lowering its pow- 
er consumption. The UP# pin of the Intel486 proc- 
essor is driven active (low) by the UP# pin of the 
Intel OverDrive processor. 


4.2 Register Set 
The Intel486 processor register set can be eee into 
the following categories: 
e Base Architecture Registers . 

— General Purpose Registers 

_— Instruction Pointer - 
— Flags Register 
— Segment Registers 


intel. 


e Systems Level Registers 

— Control Registers 

— System Address Registers 
e Debug and Test Registers 


The base architecture and floating point registers 
(see below) are accessible by the applications pro- 
gram. The system level registers can only be ac- 
cessed at privilege level 0 and used by system level 
programs. The debug and test registers also can 
only be accessed at privilege level 0. 


4.2.1 FLOATING POINT REGISTERS 


In addition to the registers listed above, the Intel486 
DX, IntelDX2, and IntelDX4 processors also have 
the following: 


e Floating Point Registers 
— Data Registers 
— Tag Word 
— Status Word 
— Instruction and Data Pointers 
— Control Word 


4.2.2 BASE ARCHITECTURE REGISTERS 


Figure 4-2 shows the Intel486 processor base archi- 
tecture registers. The contents of these registers are 
task-specific and are automatically loaded with a 
new context upon a task switch operation. 


The base architecture includes six directly accessi- 
ble descriptors, each specifying a segment up to 4 
Gbytes in size. The descriptors are indicated by the 
selector values placed in the Intel486 processor 
segment registers. Various selector values can be 
loaded as a program executes. 


The selectors are also task-specific, so the segment 
registers are automatically loaded with new context 
upon a task switch operation. 


NOTE: 
In register descriptions, “‘set’” means “‘set to 
1,” and “reset’’ means “reset to 0.” 
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General Purpose Registers 
24,23 1615 


Segment Registers 


Code Segment 
Stack Segment 


Data Segments 


15 0 
cS 
Ss 
DS 
ES 
FS 
GS 


Instruction Pointer 
31 16 15 .@) 


EP 
Flags Register 


FLAGS EFLAGS 
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Figure 4-2. Base Architecture Registers 


4.2.2.1 General Purpose Registers © 


The eight 32-bit general purpose registers are 
shown in Figure 4-2. These registers hold data or 
address quantities. The general purpose registers 
can support data operands of 1, 8, 16 and 32 bits, 
and bit fields of 1 to 32 bits. Address operands of 16 
and 32 bits are supported. The 32-bit registers are 
named EAX, EBX, ECX, EDX, ESI, EDI, EBP and 
ESP. 


The least significant 16 bits of the general purpose 
registers can be accessed separately by using the 
16-bit names of the registers AX, BX, CX, DX, Sl, DI, 
BP and SP. The upper 16 bits of the register are not 
changed when the lower 16 bits are accessed sepa- 
rately. 
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Finally, 8-bit operations can individually access the 
lower byte (bits O-—7) and the highest byte (bits 8- 
15) of the general purpose registers AX, BX, CX and 
DX. The lowest bytes are named AL, BL, CL and DL 
respectively. The higher bytes are named AH, BH, 
CH and DH respectively. The individual byte acces- 
sibility offers additional flexibility for data operations, 
but is not used for effective address calculation. 


4.2.2.2 Instruction Pointer 


The instruction pointer shown in Figure 4-2 is a 32- 
bit register named EIP. EIP holds the offset of the 
next instruction to be executed. The offset is always 
relative to the base of the code segment (CS). The 
lower 16 bits (bits 0-15) of the EIP contain the 16-bit 
instruction pointer named IP, which is used for 16-bit 
addressing. | 


intel. 


The flags register is a 32-bit register named 
EFLAGS. The defined bits and bit fields within 
EFLAGS control certain operations and indicate 
status of the Intel486 processor. The lower 16 bits 
(bit O-15) of EFLAGS contain the 16-bit register 
named FLAGS, which is most useful when executing 
8086 and 80286 processor code. EFLAGS is shown 
in Figure 4-3. 


4.2.2.3 Flags Register 


EFLAGS bits 1, 3, 5, 15 and 22-31 are defined as 


“Intel Reserved.” When these bits are stored during 


interrupt processing or with a PUSHF instruction 
(push flags onto stack), a one is stored in bit 1 and 
zeros in bits 3, 5, 15 and 22-31. 


FLAGS 


332222222222i1%1%1%141%1131+%118~«'1 . 
10987654321098765432109876543210 7 


EFLAGS 


Identification Flag 
Virtual Interrupt Pending 
Virtual Interrupt Flag 
Alignment Check 
Virtual Mode 
Resume Flag 
Nested Task Flag 
i/O Privilege Level 
Overflow 

Direction Flag 
Interrupt Enable 


Q] indicates Intel Reserved; Do not define. - 


NOTE: 
See section 4.2.7 “Compatibility.” 


VIVJAIVIR[ GIN] oP fojot t{TIs|Z} JAE. Pr. 
ple(CIMPEL IT] & TFVFIFIFIFVFLoTF| clr] 


=z 


Cc 
vF 


Carry Flag 
Parity Flag 
Auxiliary lag 
Zero Flag 
Sign Flag 

~ Trap Flag 
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Figure 4-3. Flag Registers 
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ID _— (Identification Flag, bit 21) 


The ability of a program to set and clear the ID 
flag indicates that the processor supports the 
CPUID instruction. (Refer to section 13, ‘“In- 
struction Set Summary,” and Appendix B, 
‘Feature Determination: CPUID Instruction.’’) 


VIP = (Virtual Interrupt Pending Flag, bit 20) 


The VIP flag together with the VIF enable 
each applications program in a multitasking 
environment to have virtualized versions of 
the system’s IF flag. For more on the use of 
this flag in virtual-8086 mode and in protected 
mode. (Refer to Appendix A, ‘“Advanced Fea- 
tures.’’) 


VIF (Virtual Interrupt Flag, bit 19) 


The VIF is a virtual image of IF (the interrupt 
flag) used with VIP. For more on the use of 
this flag in virtual-8086 mode and in protected 
mode. (Refer to Appendix A, ‘Advanced Fea- 
tures.”’) 


AC (Alignment Check, bit 18) 


The AC bit is defined in the upper 16 bits of 
the register. It enables the generation of faults 
if a memory reference is to a misaligned ad- 
dress. Alignment faults are enabled when AC 
is set to 1. A misaligned address is a word 
access an odd address, a dword access to an 
address that is not on a dword boundary, or 
an 8-byte reference to an address that is not 
on a 64-bit word boundary. (See section 
10.1.5, “Operand Alignment.”) 


Alignment faults are only generated by pro- 


‘Table 4-1. Data Type Alignment Requirements 


Memory Access Alignment 
J (Byte Boundary) 
2 


wee 


Single Precision 
Real 


Double Precision 
Real 


Extended Precision 
Real 


Selector 


48-Bit Segmented 
Pointer 


32-Bit Segmented 2 | 
Pointer 
Des sipter" 


Descriptor” 


FSTENV/FLDENV 4/2 (On Operand Size) 
Save Area 
FSAVE/FRSTOR 4/2 (On Operand Size) 


Save Area 


[sisting [ed 


grams running at privilege level 3. The AC bit 
setting is ignored at privilege levels 0, 1 and 2. 
Note that references to the descriptor tables 
(for selector loads), or the task state segment 
(TSS), are implicitly level 0 references even if 
the instructions causing the references are 
executed at level 3. Alignment faults are re- 
ported through interrupt 17, with an error code 
of 0. Table 4-1 gives the alignment required 
for the Intel486 processor data types. 


IMPLEMENTATION NOTE: 

Several instructions on the Intel486 proces- 
sor generate misaligned references, even if 
their memory address is aligned. For exam- 
ple, on the Intel486 processor, the SGDT/ 
SIDT (store global/interrupt descriptor table) 
instruction reads/writes two bytes, and then 
reads/writes four bytes from a “pseudo-de- 
scriptor’ at the given address. The Intel486 
processor will generate misaligned refer- 
ences unless the address is on a 2 mod 4 
boundary. The FSAVE and FRSTOR instruc- 
tions (floating point save and restore state) 
will generate misaligned references for one- 
half of the register save/restore cycles. The 
Intel486 processor will not cause any AC 
faults if the effective address given in the in- 
struction has the proper alignment. 
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VM 


RF 


NT 
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(Virtual 8086 Mode, bit 17) 


The VM bit provides Virtual 8086 Mode with- 
in Protected Mode. If set while the Intel486 
processor is in Protected Mode, the Intel486 
processor will switch to Virtual 8086 opera- 


tion, handling segment loads as the 8086 . 
processor does, but generating exception 


13 faults on privileged opcodes. The VM bit 
can be set only in Protected Mode, by the 
IRET instruction (if current privilege level = 
0) and by task switches at any privilege lev- 
el. The VM bit is unaffected by POPF. 
PUSHF always pushes a 0 in this bit, even if 
executing in Virtual 8086 Mode. The 
EFLAGS image pushed during interrupt pro- 
cessing or saved during task switches will 
contain a 1 in this bit if the interrupted code 
was executing as a Virtual 8086 Task. 


(Resume Flag, bit 16) 


The RF flag is used in conjunction with the 
debug register breakpoints. It is checked at 
instruction boundaries before breakpoint 
processing. When FF is set, it causes any 
debug fault to be ignored on the next in- 
struction. RF is then automatically reset at 


the successful completion of every instruc- 


tion (no faults are signaled) except the IRET 
instruction, the POPF instruction, (and JMP, 


- CALL, and INT instructions causing a task 


switch). These instructions set RF to the val- 
ue specified by the memory image. For ex- 
ample, at the end of the breakpoint service 
routine, the IRET instruction can pop an 
EFLAG image having the RF bit set and re- 
sume the program’s execution at the break- 
point address without generating another 
breakpoint fault on the same location. 


(Nested Task, bit 14) 


The flag applies to Protected Mode. NT is 
set to indicate that the execution of this task 
is within another task. If set, it indicates that 
the current nested task’s Task State Seg- 
ment (TSS) has a valid back link to the previ- 
ous task’s TSS. This bit is set or reset by 
control transfers to other tasks. The value of 
NT in EFLAGS is tested by the IRET instruc- 
tion to determine whether to do an inter-task 
return or an intra-task return. A POPF or an 
IRET instruction will affect the setting of this 
bit according to the image popped, at any 
privilege level. 


lOPL 


OF 


DF 


TF 
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(Input/Output Privilege Level, bits 12-13) 


This two-bit field applies to Protected Mode. 
IOPL indicates the numerically maximum 
CPL (current privilege level) value permitted 
to execute I/O instructions without generat- 
ing an exception 13 fault or consulting the 
I/O Permission Bitmap. It also indicates the 
maximum CPL value allowing alteration of 
the IF (INTR Enable Flag) bit when new val- 
ues are popped into the EFLAG register. 
POPF and IRET instruction can alter the 
IOPL field when executed at CPL = 0. Task 
switches can always alter the IOPL field, 
when the new flag image is loaded from the 
incoming task’s TSS. 


(Overflow Flag, bit 11) 


is set if the operation resulted in a signed 
overflow. Signed overflow occurs when the 
operation resulted in carry/borrow into the 
sign bit (high-order bit) of the result but did 
not result in a carry/borrow out of the high- 
order bit, or vice-versa. For 8-, 16-, 32-bit 
operations, OF is set according to overflow 
at bit 7, 15, 31, respectively. 


(Direction Flag, bit 10) 


DF defines whether ESI and/or EDI regis- 
ters post decrement or post increment dur- 
ing the string instructions. Post increment 
occurs if DF is reset. Post decrement occurs 
if DF is set. 


(INTR Enable Flag, bit 9) 


IF flag, when set, allows recognition of ex- 
ternal interrupts signaled on the INTR pin. 
When IF is reset, external interrupts signaled 
on the INTR are not recognized. IOPL indi- 
cates the maximum CPL value allowing al- 
teration of the IF bit when new values are 
popped into EFLAGS or FLAGS. : 


(Trap Enable Flag, bit 8) 


TF controls the generation of exception 1 
trap when single-stepping through code. 
When TF is set, the Intel486 processor gen- 
erates an exception 1 trap after the next in- 
struction is executed. When TF is reset, ex- 
ception 1 traps occur only as a function of 
the breakpoint addresses loaded into debug 
registers DRO-—DR3. 
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SF (Sign Flag, bit 7) 4.2.2.4 Segment Registers 
SF is set if the high-order bit of the result is ; 
set, it is reset otherwise. For 8-, 16-, 32-bit Six 16-bit segment registers hold segment selector 
operations, SF reflects the state of bit 7, 15, values identifying the currently addressable memory 
31 respectively. segments. In protected mode, each segment may 
; range in size from one byte up to the entire linear 
ZF (Zero Flag, bit 6) | and physical address space of the machine, 4 
ZF is set if all bits of the result are 0. Other- Gbytes (232 bytes). In real address mode, the maxi- 
wise, it is reset. ; mum segment size is fixed at 64 Kbytes (216 bytes). 
a ene mae Flag, ei a The six addressable segments are defined by the 
The Auxiliary Flag is used to simplify the ad- segment registers CS, SS, DS, ES, FS and GS. The 
dition and subtraction of packed BCD quan- —_— selector in CS indicates the current code segment: 
tities. AF is set if the operation resulted ina —_the selector in SS indicates the current stack seg- 
carry out of bit 3 (addition) or a borrow into ment; the selectors in DS, ES, FS and GS indicate 
bit 3 (subtraction). Otherwise, AF is reset. the current data segments. 
AF is affected by carry out of, or borrow into 
bit 3 only, regardless of overall operand 
length: 8, 16 or 32 bits. 4.2.2.5 Segment Descriptor Cache Registers 
PF (Parity Flags, bit 2) The segment descriptor cache registers are not pro- 
PF is set if the low-order eight bits of the grammer visible, yet it is very useful to understand 
operation contains an even number of “1’s” their content. A programmer invisible descriptor 
(even parity). PF is reset if the low-order —— cache register is associated with each programmer- 
eight bits have odd parity. PF is afunction of visible segment register, as shown by Figure 4-4. 
only the low-order eight bits, regardless of Each descriptor cache register holds a 32-bit base 
operand size. address, a 32-bit segment limit, and the other neces- 
CF (Carry Flag, bit 0) sary segment attributes. 


CF is set if the operation resulted in a carry 
out of (addition), or a borrow into (subtrac- 
tion) the high-order bit. Otherwise, CF is re- 
set. For 8-, 16- or 32-bit operations, CF is 
set according to carry/borrow at bit 7, 15 or 
31, respectively. 


Segment Registers Descriptor Registers (Loaded Automatically) 
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Figure 4-4. intel486™ Processor Segment Registers and Associated Descriptor Cache Registers 
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When a selector value is loaded into a segment reg- 
ister, the associated descriptor cache register is au- 
tomatically updated with the correct information. In 


_ Real Mode, only the base address is updated direct- - 


ly (by shifting the selector value four bits to the left), 
because the segment maximum limit and attributes 


are fixed in Real Mode. In Protected Mode, the base - 


address, the limit, and the attributes are all updated 
per the contents of the segment descriptor indexed 
by the selector. . 


Whenever a memory reference occurs, the segment 
descriptor cache register associated with the seg- 
ment being used is automatically involved with the 
memory reference. The 32-bit segment base ad- 
dress becomes a component of the linear address 
calculation, the 32-bit limit is used for the limit-check 
operation, and the attributes are checked against 
the type of memory reference requested. 
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Figure 4-5 illustrates the system level registers, 
which are the control operation of the on-chip 
cache, the on-chip floating point unit (on the Intel486 
DX, IntelDX2, and IntelDX4 processors) and the seg- 
mentation and paging mechanisms. These registers 
are only accessible to programs running at privilege 
level 0, the highest privilege level. 


4.2.3 SYSTEM LEVEL REGISTERS 


The system level registers include three control reg- 
isters and four segmentation base registers. The 
three control registers are CRO, CR2 and CR3. CR1 
is reserved for future Intel processors. The four seg- 
mentation base registers are the Global Descriptor 
Table Register (GDTR), the Interrupt Descriptor Ta- 
ble Register (IDTR), the Local Descriptor Table Reg- 
ister (LDTR) and the Task State Segment Register 
(TR). 


Page Fault Linear Address Register 


Page Directory Base Register 


47 32-Bit Linear Base Address 1615 Limit 


0 


poccistes alan Segment Registers 


32- PF ee ee ee Linear Base Address 


Descriptor Se (Loaded Automatically) 


20-Bit Segment Limit Attributes 
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Figure 4-5. System Level Registers 


2-50 


intel : Intel486™ PROCESSOR FAMILY 


1 indicates Intel Reserved; Do not define. 
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NOTE: 
See Section 4.2.7 “Compatibility”. 


Figure 4-6. Control Register 0 


4.2.3.1 Control Registers e On-Chip Cache Control Modes: CD, NW 
(Table 4-3) 
Control Register 0 (CRO) © On-Chip Floating Point Unit: NE, TS, EM, TS (Ta- 


bles 4-4, 4-5, and 4-6). (Also applies for Intel486 


CRO, shown in Figure 4-6, contains 10 bits for con- SX and IntelSX2 processors.) 


trol and status purposes. The function of the bits in 
CRO can be categorized as follows: e Alignment Check Control: AM 


e Intel486 Processor Operating Modes: PG, PE e Supervisor Write Protect: WP 
(Table 4-2) 


Table 4-2. Intel486™ Processor Operating Modes 


oe on REAL Mode. Exact 8086 processor semantics, with 32-bit extensions available with prefixes. 
4 


Protected Mode. Exact 80286 processor semantics, plus 32-bit extensions through both prefixes 


and “‘default’’ prefix setting associated with code segment descriptors. Also, a sub-mode is 
defined to support a virtual 8086 processor within the context of the extended 80286 processor 
protection model. 


UNDEFINED. Loading CRO with this combination of PG and PE bits will raise a GP fault with error 
code 0. 

Paged Protected Mode. All the facilities of Protected mode, with paging enabled underneath 
segmentation. 


Table 4-3. On-Chip Cache Control Modes 


co[nw[S~S~*~<“~*~“‘~*~*«S rating Mode 
71 | 1+ | Cache ie abled witertrough andinvldetesdeabled 
710 | Cache fils disabled, witestrough andinvadetes enabled. 
To [1 | NVALI. CRO%s loaded with this conigraton of bits, a GP ful wih oor codes raued: 
To | 0 [Cache fits enabled witestrough andinvaldetes enabled 
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The low-order 16 bits of CRO are also known as the 
Machine Status Word (MSW), for compatibility with 


the 80286 processor protected mode. LMSW and | 


~ SMSW (load and store MSW) instructions are taken 
as special aliases of the load and store CRO opera- 
tions, where only the low-order 16 bits of CRO are 


involved. The LMSW and SMSW instructions in the. 


Intel486 processor work in an identical fashion to 
the LMSW and SMSW instructions in the 80286 
processor (i.e., they only operate on the low-order 
16 bits of CRO and ignores the new bits). New 
Intel486 processor operating systems should use 
the MOV CRO, Reg instruction. 
| NOTE: 
All Intel3886 and Intel486 processor CRO bits, 
except for ET and NE, are upwardly compati- 
ble with the 80286 processor, because they 
are in register bits not defined in the 80286 
processor. For strict compatibility with the 
80286 processor, the load machine status 
word (LMSW) instruction is defined to not 
change the ET or NE bits. 


The defined CRO bits are described below. 
PG (Paging Enable, bit 31) 


PG bit is used to indicate whether paging is 
enabled (PG= 1) or disabled (PG=0). (See 
Table 4-2.) 


CD (Cache Disable, bit 30) 


The CD bit is used to enable the on-chip 
cache. When. CD= 1, the cache will not be 
filled on cache misses. When CD=0, cache 
fills may be performed on misses. (See Ta- 
ble 4-3.) 


The state of the CD bit, the cache enable 
input pin (KEN#), and the relevant page 
cache disable (PCD) bit determine if a line 
read in response to a cache miss will be 
installed in the cache. A line is installed in 
the cache only if CD=0O and KEN# and 
PCD are both zero. The relevant PCD bit 
comes from either the page table entry, 
page directory entry or control register 3. 
(Refer to section 7.6, “Page Cacheability.’’) 


CD is set to one after RESET. 
NW (Not Write-Through, bit 29) 


The NW bit enables on-chip cache write- 
throughs and _ write-invalidate cycles 
(NW = 0). 
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AM 


_ WP 


| 

intel. 
When NW=0, all writes, including cache 
hits, are sent out to the pins. Invalidate cy- 
cles are enabled when NW=0. During an 
invalidate cycle a line will be removed from 


the cache if the invalidate address hits in 
the cache. (See Table 4-3.) 


When NW =1, write-throughs and write-in- 
validate cycles are disabled. A write will not 
be sent to the pins if the write hits in the 
cache. With NW=1 the only write cycles 
that reach the external bus are cache miss- 
es. Write hits with NW=1 will never update 
main memory. Invalidate cycles are ignored 
when NW =1. 


(Alignment Mask, bit 18) 


The AM bit controls whether the alignment 
check (AC) bit in the flag register (EFLAGS) 
can allow an alignment fault. AM=O dis- 
ables the AC bit. AM=1 enables the AC bit. 
AM = O is the Intel386 processor compatible 
mode. 


Intel386 processor software may load incor- 
rect data into the AC bit in the EFLAGS reg- 
ister. Setting AM=O will prevent AC faults 
from occurring before the Intel486 proces- 
sor has created the AC interrupt service 
routine. 


(Write Protect, bit 16). 


WP protects read-only pages from supervi- 
sor write access. The Intel386 processor al- 
lows a read-only page to be written from 
privilege levels 0-2. The Intel486 processor 
are compatible with the Intel3886 processor 
when WP=0. WP=1 forces a fault on a © 
write to a read-only page from any privilege 
level. Operating systems with Copy-on- 
Write features can be supported with the 
WP bit. (Refer to section 6.4.3 “Page Level 
Protection (R/W, U/S Bits).’’) 


NOTE: 


Refer to Tables 4-4, 4-5, and 4-6 for values 


and interpolation of NE, EM, TS, and MP 
bits, in addition to the sections below. 


NE 


(Numerics Exception, bit 5) 

Intel486 SX and IntelSX2 Processor NE 
Bit | 

For Intel486 SX and IntelSX2 processors, 
interrupt 7 will be generated upon encoun- 


_ tering any floating point instruction regard- 


less of the value of the NE bit. It is recom- 
mended that NE = 1 for normal operation of 
the Intel486 processor. 


tel. 


intel486 DX, IntelDX2 and IntelDX4 
Processor NE Bit 


For Intel486 DX, IntelDX2, and IntelDX4 
processors, the NE bit controls whether un- 
masked floating point exceptions (UFPE) 
are handled through interrupt vector 16 
(NE=1) or through an external interrupt 
(NE=0). NE=0O (default at reset) supports 
the DOS operating system error reporting 
scheme from the 8087, Intel287 and 
Intel387 math coprocessors. In DOS sys- 
tems, math coprocessor errors are reported 
via external interrupt vector 13. DOS uses 
interrupt vector 16 for an operating system 
call. (Refer to sections 9.2.15, “Numeric Er- 
ror Reporting (FERR#, IGNNE#),” and 
10.2.14 ‘Floating Point Error Handling.’’) 


For any UFPE, the floating point error out- 
put pin (FERR#) will be driven active. 


For NE=0, the Intel486 DX, IntelDX2 and 
IntelDX4 processors work in conjunction 
with the ignore numeric error input 
(IGNNE#) and the FERR# output pins. 
When a UFPE occurs and the IGNNE # in- 
put is inactive, the Intel486 DX, IntelDX2, 
and IntelDX4 processors freeze immediate- 
ly before executing the next floating point 
instruction. An external interrupt controller 
will supply an interrupt vector when FERR # 
is driven active. The UFPE is ignored if 
IGNNE # is active and floating point execu- 
tion continues. 


NOTE: 


The freeze does not take place if the next 
instruction is one of the control instructions 


FNCLEX, FNINIT,. FNSAVE, 


FNSTENV, 


FNSTCW, FNSTSW, FNSTSW AX, FNENI, 
FNDISI and FNSETPM. The freeze does oc- 
cur if the next instruction is WAIT. 


For NE=1, any UFPE will result in a soft- 
ware interrupt 16, immediately before exe- 
cuting the next non-control floating point or 
WAIT instruction. The ignore numeric error 
input (IGNNE #) signal will be ignored. 


TS 


(Task Switch, bit 3) 


Intel486 SX and IntelSX2 
Processor TS Bit 


For Intel486 SX and IntelSX2 processors, 
the TS bit is set whenever a task switch op- 
eration is performed. Execution of floating 
point instructions with TS=1 will cause a 


EM 


MP 


Intel486T™ PROCESSOR FAMILY 


Device Not Available (DNA) fault (trap vec- 
tor 7). With MP=0O, the value of TS bitis a 
don’t care for the WAIT instructions, i.e., 
these instructions will not generate trap 7. 


intel486 DX, IntelDX2, and IntelDX4 
Processor TS Bit 


For Intel486 DX, IntelDX2, and IntelDX4 
processors, the TS bit is set whenever a 
task switch operation is performed. Execu- 
tion of floating point instructions with TS = 1 
will cause a Device Not Available (DNA) 
fault (trap vector 7). If TS=1 and MP=1 
(monitor coprocessor in CRO), a WAIT in- 
struction will cause a DNA fault. . 


(Emulate Coprocessor, bit 2) 


intel486 SX and IntelSX2 
Processor EM Bit 


For Intel486 SX and IntelSX2 processors, 
the EM bit should be set to one. This will 
cause the Intel486 SX and IntelSX2 proces- 
sors to trap via interrupt vector 7 (Device 
Not Available) to a software exception han- 
dler whenever it encounters a floating point 
instruction. If EM bit is 0 for the Intel486 SX 
and IntelSX2 processors, the system will 
hang. (See Tables 4-4 and 4-5.) 

Intel486 DX, IntelDX2, and IntelDX4 
Processor EM Bit 

For the Intel486 DX, IntelDX2, and IntelDX4 
processors, the EM bit determines whether 
floating point instructions are trapped 
(EM=1) or executed. If EM=1, all floating 
point instructions will cause fault 7. 

lf EM=0O, the on-chip floating point will be 
used. 


NOTE: 


WAIT instructions are not affected by the 
state of EM. (See Tables 4-4 and 4-6.) 


(Monitor Coprocessor, bit 1) 


Intel486 SX and IntelSX2 
Processor MP Bit 


For Intel486 SX and IntelSX2 processors, 
the MP bit must be set to zero (MP =0). The 
MP bit is used in conjunction with the TS bit 
to determine if WAIT instructions should 
trap. For MP=0, the value of TS is a don’t 
care for these type of instructions. (See Ta- 
bles 4-4 and 4-5.) 
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Intel486 DX, IntelDX2, and IntelDX4 PE (Protection-Enable, bit 0) 

Processor MP Bit The PE bit enables the segment based pro- 
For the Intel486 DX, IntelDX2, and IntelDX4 tection mechanism if PE=1 protection is 
processors, the MP is used in conjunction enabled. When PE=0 the Intel486 proces- 
with the TS bit to determine if WAIT instruc- sor operates in REAL mode, with segment — 
tions cause fault 7. (See Table 4-6.) The TS based protection disabled, and addresses 
bit is set to 1 on task switches by the formed as in an 8086 processor. (Refer to 
Intel486 DX, IntelDX2, and IntelDX4 proces- . Table 4-2.) 


sors. Floating point instructions are not af- 
fected by the state of the MP bit. It is rec- 
ommended that the MP bit be set to one for 
normal processor operation. 


Table 4-4, Recommended Values of NE, EM, TS, and MP Bits in CRO 
Register for Intel486™ SX and IntelSX2™ Processors 


ion a CRO Bit | Instruction Type © 


Table 4-5. Recommended Values of the Floating Point Related Bits for All Intel486™ Processors 


CRO Bit intel486 SX and IntelSX2T™ | Intel486 DX, IntelDX2T™, and IntelIDX4™ 
Processors ; Processors 


Oe et 
a ae a eee RReamrene: See eel 


0, for DOS Systems 
1, for User-Defined Exception Handler | 


Table 4-6. Interpretation of Different Combinations of the 
EM, TS and MP Bits for All Intel486™ Processors 


NOTE: 
For Intel486 DX, IntelDX2™ and IntelDX4™ processors, if MP=1 and TS=1, the processor will generate a trap 7 so that 
the system software can save the floating point status of the old task. 
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Page Fault Linear Address Register 


Page Directory Base Register 


indicates Intel Reserved; Do not define. 


NOTE: 
See section 4.2.7, “Compatibility.” 


Control Register 1 (CR1) 
CR1 is reserved for use in future Intel processors. 
Control Register 2 (CR2) 


CR2, shown in Figure 4-7, holds the 32-bit linear ad- 
dress that caused the last page fault detected. The 
error code pushed onto the page fault handler’s 
stack when it is invoked provides additional status 
information on this page fault. 


Control Register 3 (CR3) 


CR3, shown in Figure 4-7, contains the physical 
base address of the page directory table. The page 
directory is always page aligned (4 Kbyte-aligned). 
This alignment is enforced by only storing bits 
12-31 in CRS. 


in the Intel486 processor, CR3 contains two bits, 
page write-through (PWT) (bit 3) and page cache 
disable (PCD) (bit 4). The page table entry (PTE) and 
page directory entry (PDE) also contain PWT and 
PCD bits. PWT and PCD control page cacheability. 
When a page is accessed in external memory, the 


Figure 4-7. Control Registers 2, 3 and 4 
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state of PWT and PCD are driven out on the PWT 
and PCD pins. The source of PWT and PCD can be 
CR3, the PTE or the PDE. PWT and PCD are 
sourced from CR3 when the PDE is being updated. 
When paging is disabled (PG = 0 in CRO), PCD and 
PWT are assumed to be 0, regardless of their state 
in CR3. 


A task switch through a task state ségment (TSS) 
which changes the values in CR3, or an explicit load 
into CR3 with any value, will invalidate all cached 
page table entries in the translation lookaside buffer 
(TLB). 


The page directory base address in CR is a physi- 
cal address. Tne page directory can be paged out 
while its associated task is suspended, but the oper- 
ating system must ensure that the page directory is 
resident in physical memory before the task is dis- 
patched. The entry in the TSS for CR3 has a physi- 
cal address, with no provision for a present bit. This 
means that the page directory for a task must be 
resident in physical memory. The CR3 image in a 
TSS must point to this area, before the task can be 
dispatched through its TSS. | 
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Control Register 4 (CR4) 


CR4, shown in Figure 4-7, contains bits that enable 
virtual mode exielisionis and protected mode virtual 
interrupts. 


VME (Virtual-8086 Mode Extensions, bit 0 of CR4) 


Setting this bit to 1 enables support for a virtual in- 
terrupt flag in virtual-8086 mode. This feature can 
improve the performance of virtual-8086 applica- 
tions by eliminating the overhead of faulting to a vir- 
tual-8086 monitor for emulation of certain opera- 
tions. (Refer to Appendix A, ‘“Advanced Features.’’) 


PVI (Protected-Mode Virtual Interrupts, bit 1 of CR4) 


‘Setting this bit to 1 enables support for a virtual in- 
terrupt flag in protected mode. This feature can en- 
able some programs designed for execution at privi- 
— lege level 0 to execute at privilege level 3. (Refer to 
Appendix A, “Advanced Features.”’) 


PSE (Page Size Extensions, bit 4 of CR4) 


Setting this bit to 1 enables 4-Mbyte pages. (Refer to 
Appendix A, “Advanced Features.’’) 


NOTE: 

Features described in CR4 (VME, PVI, and 
PSE) in the CPUID Feature Flag should be 
qualified with the CPUID instruction. The 
-CPUID instruction and CPUID Feature Flag 
are specific to particular models in the 
Intel486 processor family. (Refer to Appendix 
B, “Feature Determination.’’) 


4.2.3.2 System Address Registers 


Four special registers are defined to reference the 
tables or segments supported by the 80286, 
Intel3886, and Intel486 processors’ protection model. 
These tables or segments are: GDT (Global Descrip- 
tor Table), IDT (Interrupt Descriptor Table), LDT (Lo- 
cal Descriptor Table), TSS (Task State Segment). 


The addresses of these tables and segments are 
stored in special registers, the System Address and 
System Segment Registers, illustrated in Figure 4-5. 
These registers are named GDTR, IDTR, LDTR and 
TR respectively. Section 6, “Protected Mode Archi- 
tecture,”’ describes how to use these registers. 
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System Address Registers: GDTR and IDTR 


The GDTR and IDTR hold the 32-bit linear-base ad- 
dress and 16-bit limit of the GDT and IDT, respec- 
tively. | 


Because the GDT and IDT segments are global to 
all tasks in the system, the GDT and IDT are defined 
by 32-bit linear addresses (subject to page transla- 
tion if paging is enabled) and 16-bit limit values. 


System Segment Registers: LDTR and TR 


The LDTR and TR hold the 16-bit selector for the 
LDT descriptor and the TSS descriptor, respectively. 


Because the LDT and TSS segments are task spe- 
cific segments, the LDT and TSS are defined by se- — 
lector values stored in the system segment regis- 

ters. ; 


NOTE: 
A programmer-invisible segment descriptor 
register is associated with each system seg- 
ment register. 


4.2.4 FLOATING POINT REGISTERS 


Figure 4-8 shows the floating point register set. The 
on-chip FPU contains eight data registers, a tag 
word, a control register, a status register, an instruc- 
tion pointer and a data pointer. 


The operation of the Intel486 DX, IntelDX2, and 
IntelDX4 processor on-chip floating point unit is ex- 
actly the same as the Intel887 math coprocessor. 
Software written for the Intel387 math coprocessor 
will run on the on-chip floating point unit (FPU) with- 
out any modifications. : 


4.2.4.1 Floating Point Data Registers 


Floating point computations use the Intel486 DX, 
intelDX2, and IntelDX4 processor FPU data regis- 
ters. These eight 80-bit registers provide the equiva- 
lent capacity of twenty 32-bit registers. Each of the 
eight data registers is divided into “‘fields’’ corre- 
sponding to the FPU’s extended-precision data type. 
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TOP by one. Like other Intel486 DX, IntelDX2, and 
Tag Field IntelDX4 processor stacks in memory, the FPU reg- 
1 0 ister stack grows ‘down’ toward lower-addressed 
registers. 


Instructions may address the data registers either 
implicitly or explicitly. Many instructions operate on 
the register at the TOP of the stack. These instruc- 
tions implicitly address the register at which TOP 
points. Other instructions allow the programmer to 
explicitly specify which register to use. This explicit 
register addressing is also relative to TOP. 


4.2.4.2 Floating Point Tag Word 


The tag word marks the content of each numeric 
data register, as shown in Figure 4-9. Each two-bit 
tag represents one of the eight data registers. The 
principal function of the tag word is to optimize the 
FPU’s performance and stack handling by making it 
possible to distinguish between empty and non- 
empty register locations. It also enables exception 
handlers to check the contents of a stack location 
Figure 4-8. Floating Point Registers without the need to perform complex decoding of 
3 the actual data. 
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The FPU’s register set can be accessed either as a 

stack, with instructions operating on the top one or 4.2.4.3 Floating Point Status Word 

two stack elements, or as a fixed register set, with 

instructions operating on explicitly designated regis- | The 16-bit status word reflects the overall state of 
ters. The TOP field in the status word identifies the the FPU. The status word is shown in Figure 4-10 
current top-of-stack register. A ‘‘push” operation and is located in the status register. 

decrements TOP by one and loads a value into the 

new top register. A “pop” operation stores the value 

from the current top register and then increments 


15 | 0 
| , 242202-16 
NOTE: 


The index i of tag(i) is not top-relative. A program typically uses the “top” field of Status Word to determine which tagq(i) 
field refers to logical top of stack. 


TAG VALUES: 
00 = Valid 
01 = Zero 
10 = QNaN, SNaN, Infinity, Denormal and Unsupported Formats 
11 = Empty 


Figure 4-9. Floating Point Tag Word 
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BUSY 
TOP OF STACK POINTER © 
CONDITION CODE 


Cc c Ss 
TOP 
2 0 F 


ERROR SUMMARY STATUS 
STACK FLAG 


EXCEPTION FLAGS: 
PRECISION 
UNDERFLOW 
OVERFLOW 
ZERO DIVIDE 
DENORMALIZED OPERAND 


INVALID OPERATION 
242202-17 


ES is set if any unmasked exception bit is set; cleared otherwise. 
See Table 4-7 for interpretation of condition code. 
TOP values: 
| 000 = Register 0 is Top of Stack 
001 = Register 1 is Top of Stack 


¥ 


111 = Register 7 is Top of Stack 
For definitions of exceptions, refer to the section entitled “Exception Handling”. 


NOTES: 
The B bit (Busy, bit 15) is included for 8087 compatibility. The B bit reflects the contents of the ES bit (bit 7 of the status 
word). 

Bits 13-11 (TOP) point to the FPU register that is the current top-of-stack. 

The four numeric condition code bits, CO-C3, are similar to the flags in EFLAGS. Instructions that perform arithmetic 
operations update CO-C3 to reflect the outcome. The effects of these instructions on the condition codes are summa- 
rized in Table 4-7 through Table 4-10. 


Figure 4-10. Floating Point Status Word 
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Table 4-7. Floating Point Condition Code Interpretation 


[instruction [cosy | ca@ | ct cy 


FPREM, FPREM1 Three least significant bits of quotient (See Table 4-8.) Reduction 
0 = complete 
1 = incomplete 


Q1 or O/U# 
Zero or O/U# 


FCOM, FCOMP, 
FCOMPP, FTST, FUCOM, 
FUCOMP, FUCOMPP, 
FICOM, FICOMP 


FXAM Operand class (see Sign or O/U# Operand class 
Table 4-10) | | 


FCHS, FABS, FXCH, UNDEFINED Zero or O/U# UNDEFINED 
-FINCTOP, FDECTOP, | 

Constant loads, FXTRACT, 

FLD, FILD, FBLD, FSTP 

(ext real) 


FIST, FBSTP, FRNDINT, UNDEFINED Roundup or O/U# 
FST, FSTP, FADD, FMUL, 
FDIV, FDIVR, FSUB, 
FSUBR, FSCALE, FSQRT, 
FPATAN, F2XM1, FYL2X, 
FYL2XP 1 


FPTAN, FSIN, FCOS, UNDEFINED 
FSINCOS 


Result of comparison (see 
Table 4-9) 


Operand is not 
comparable 


UNDEFINED 


Roundup or O/U#, 
ifC2 = 1 


FLDENV, FRSTOR Each bit loaded from memory 
FINIT Clears these bits 


Reduction 
= complete 
1 = incomplete 


FLDCW, FSTENV, UNDEFINED 
FSTCW, FSTSW, FCLEX, 
‘| FSAVE 

NOTES: 

O/U# When both IE and SF bits of status word are set, indicating a stack exception, this bit distinguishes between 
stack overflow (C1 = 1) and underflow (C1 = 0). 

Reduction lf FPREM or FPREM1 produces a remainder that is less than the modulus, reduction is complete. When 
reduction is incomplete the value at the top of the stack is a partial remainder, which can be used as input to 
further reduction. For FPTAN, FSIN, FCOS, and FSINCOS, the reduction bit is set if the operand at the top of 
the stack is too large. In this case the original operand remains at the top of the stack. 

Roundup When the PE bit of the status word is set, this bit indicates whether the last rounding in the instruction was 


upward. 
UNDEFINED Do not rely on finding any specific value in these bits. (See Section 4.2.7, “Compatibility.”’) 
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Table 4-8. Condition Code Interpretation after FPREM and FPREM1 Instructions 


eondiienivede ar Interpretation after FPREM and FPREM1 
| X X | . Incomplete Reduction: | 
further interaction required for complete reduction 


ba 


Complete Reduction: 
CO, C3, and C1 contain the three 
least-significant bits of the quotient 


ee 
s-ascocelh 


Table 4-9. Condition Code Resulting from Comparison 


[ewer —Ci‘~dS=“‘sHOCdOUCO 
[Fors Opeend —PCt 
[708 < operand iP CdS 
“To = Operands pd 


+ Normal 
+ Infinity 


— Normal 
— Infinity 
+0 
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Bit 7 is the error summary (ES) status bit. The ES bit 
is set if any unmasked exception bit (bits O—5 in the 
status word) is set; ES is clear otherwise. The 
FERR # (floating point error) signal is asserted when 
ES is set. : 


Bit 6 is the stack flag (SF). This bit is used to distin- 
guish invalid operations due to stack overflow or un- 
derflow. When SF is set, bit 9 (C1) distinguishes be- 
tween stack overflow (C1=1) and underflow 
(C1=0). 


Table 4-11 shows the six exception flags in bits 0-5 
of the status word. Bits O-5 are set to indicate that 
the FPU has detected an exception while executing 
an instruction. 


The six exception flags in the status word can be 
individually masked by mask bits in the FPU control 
word. Table 4-11 lists the exception conditions, and 
- their causes in order of precedence. Table 4-11 also 
shows the action taken by the FPU if the corre- 
sponding exception flag is masked. 


An exception that is not masked by the control word 
will cause three things to happen: the corresponding 
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exception flag in the status word will be set, the ES 
bit in the status word will be set and the FERR# 
output signal will be asserted. When the Intel486 
DX, IntelDX2, or IntelDX4 processor attempts to ex- 
ecute another floating point or WAIT instruction, ex- 
ception 16 occurs or an external interrupt happens if 
the NE = 1 in control register 0. The exception condi- 
tion must be resolved via an interrupt service rou- 
tine. The FPU saves the address of the floating point 
instruction that caused the exception and the ad- 
dress of any memory operand required by that in- 
struction in the instruction and data pointers. (See 
section 4.2.4.4, “Instruction and Data Pointers.”’) 


Note that when a new value is loaded into the status 
word by the FLDENV (load environment) or 
FRSTOR (restore state) instruction, the value of ES 
(bit 7) and its reflection in the B bit (bit 15) are not 
derived from the values loaded from memory. The 
values of ES and B are dependent upon the values 
of the exception flags in the status word and their 
corresponding masks in the control word. If ES is set 
in such a case, the FERR# output of the Intel486 
DX, IntelDX2, or IntelDX4 processor is activated im- 
mediately. 


Table 4-11. FPU Exceptions 


Invalid 
Operation 


overflow/underflow (SF is also set). 


Zero Divisor 
number. 


Underflow 


Inexact Result 


(Precision) 
rounding mode. 


Default Action 
(if exception is masked) 


Operation on a signaling NaN, unsupported format, 
indeterminate form (0* 2, 0/0, (+ ©) + (— ©), etc.), or stack 


Denormalized | At least one of the operands is denormalized, i.e., it has the Normal processing 
Operand smallest exponent but a non-zero significand. continues 
The divisor is zero while the dividend is a non-infinite, non-zero 
The result is too large in magnitude to fit in the specified 
format. 


The true result is non-zero but too small to be represented in 
the specified format, and, if underflow exception is masked, 
denormalization causes loss of accuracy. 


The true result is not exactly representable in the specified 
format (e.g., 1/3); the result is rounded according to the 


Result is a quiet NaN, 
integer indefinite, or BCD 
indefinite 


Result is largest finite 
value or °° | 


Result is denormalized or 
zero 


Normal processing 
continues 
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4.2.4.4 Instruction and Data Pointers 


Because the FPU operates in parallel with the ALU 
(in the Intel486 DX, IntelDX2 and IntelDX4 proces- 


sors the arithmetic and logic unit (ALU) consists of 


the base architecture registers), any errors detected 
by the FPU may be reported after the ALU has exe- 
cuted the floating point instruction that caused it. To 
allow identification of the failing numeric instruction, 
the Intel486 DX, IntelDX2, and IntelDX4 processors 
contain two pointer registers that supply the address 
of the failing numeric instruction and the address of 
its numeric memory operand (if appropriate). 


The instruction and data pointers are provided for 
user-written error handlers. These registers are ac- 
cessed by the FLDENV (load environment), 
FSTENV (store environment), FSAVE (save state) 
and FRSTOR (restore state) instructions. Whenever 
the Intel486 DX, IntelDX2, and IntelDX4 processors 
decode a new floating point instruction, it saves the 
instruction (including any prefixes that may be 
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present), the address of the operand (if present) and 
the opcode. 


The instruction and data pointers appear in one of 
four formats depending on the operating mode of 
the Intel486 DX, IntelDX2, and IntelDX4 processors 
(protected mode or real-address mode) and depend- 
ing on the operand-size attribute in effect (32-bit op- 
erand or 16-bit operand). When the Intel486 DX, 
IntelDX2, or IntelDX4 processor is in the virtual-86 
mode, the real address mode formats are used. The 
four formats are shown in Figure 4-11 through Figure 
4-14. The floating point instructions FLDENV, 
FSTENV, FSAVE and FRSTOR are used to transfer 
these values to and from memory. Note that the val- 
ue of the data pointer is undefined if the prior float- 
ing point instruction did not have a memory operand. 


NOTE: 
The operand size attribute is the D bit in a 
segment descriptor. 
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32-Bit Protected Mode Format 
15 


Intel Reserved | Control Word 
intel Reserved Status Word 
intel Reserved Tag Word 


IP Offset 


Data Operand Offset 


Intel Reserved Operand Selector 
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Figure 4-11. Protected Mode FPU Instructions and Data Pointer Image in Memory, 32-Bit Format 


32-Bit Real Address Mode Format 


intel Reserved Control Word 
intel Reserved Status Word 
Intel Reserved Tag Word 


ST 
a 
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Figure 4-12. Real Mode FPU Instruction and Data Pointer Image in Memory, 32-Bit Format 
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16-Bit Protected Mode Format 


16-Bit Real Address Mode and 
Virtual-8086 Mode Format 


15 7 


0 
Control Word 


0 15 7 0 
| Status Word 2 Control Word 
Tag Word 4 
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CS Selector 8 - Instruction Pointer 15..0 
Operand Offset A 1P19.16 [0 OPCODE 10..0 
Operand Selector C Operand Pointer 15.0 
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Figure 4-13. Protected Mode FPU Instruction and : 
Data Pointer Image in Memory, 16-Bit Format Figure 4-14. Real Mode FPU Instruction and Data 


Pointer Image in Memory, 16-Bit Format 


RESERVED 


: RESERVED® 
ROUNDING CONTROL 
- PRECISION CONTROL 


7 ) 


RESERVED 
* "QO" AFTER RESET OR FINIT; 
CHANGEABLE UPON LOADING THE 
EXCEPTION MASKS: CONTROL WORD (CW). PROGRAMS 
PRECISION MUST IGNORE THIS BIT. 


UNDERFLOW 
OVERFLOW 
ZERO DIVIDE 
DENORMALIZED OPERAND 
INVALID OPERATION 
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Precision Control Rounding Control 

00-24 bits (single precision) 00-Round to nearest or even 
01-(reserved) 01-Round down (toward -°%) 
10-53 bits (double precision) 10-Round up (toward + ©) 
11-64 bits (extended precision) 11-Chop (truncate toward zero) 


NOTE: 
See section 4.2.7 “Compatibility,” for RESERVED bits. 


Figure 4-15. FPU Control Word 
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4.2.4.5 FPU Control Word 


The FPU provides several processing options that 
are selected by loading a control word from memory 
into the control register. Figure 4-15 shows the for- 
mat and encoding of fields in the control word. 


The low-order byte of the FPU control word config- 
ures the FPU error and exception masking. Bits 0-5 
of the control word contain individual masks for each 
of the six exceptions that the FPU recognizes. 


The high-order byte of the control word configures 
the FPU operating mode, including precision and 
rounding. 


RC (Rounding Control, bits 10-11) 


RC bits provide for directed rounding and true 
chop, as well as the unbiased round to nearest 
even mode specified in the iEEE standard. 
Rounding control affects only those instructions 
that perform rounding at the end of the opera- 
tion (and thus can generate a precision excep- 
tion); namely, FST, FSTP, FIST, all arithmetic 
instructions (except FPREM, FPREM1, 
FXTRACT, FABS and FCHS), and all transcen- 
dental instructions. 


PC (Precision Control, bits 8-9) 


PC bits can be used to set the FPU internal op- 
erating precision of the significand at less than 
the default of 64 bits (extended precision). This 
can be useful in providing compatibility with ear- 
ly generation arithmetic processors of smaller 
precision. PC affects only the instructions ADD, 
SUB, DIV, MUL, and SQRT. For all other instruc- 
tions, either the precision is determined by the 
opcode or extended precision is used. 


4.2.5 DEBUG AND TEST REGISTERS 


4.2.5.1 Debug Registers 


The six programmer accessible debug registers (Fig- 
ure 4-16) provide on-chip support for debugging. De- 
bug registers DRO-3 specify the four linear break- 
points. The Debug control register DR7, is used to 
set the breakpoints and the Debug Status Register, 
DR6, displays the current state of the breakpoints. 
The use of the Debug registers is described in sec- 
tion 12, “Debugging Support.”’ 
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Debug Registers 


Breakpoint Control 


Test Registers 
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TLB = Translation Lookaside Buffer 
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Figure 4-16. Debug and Test Registers 


4.2.5.2 Test Registers 


The Intel486 processor contains five test registers. 
The test registers are shown in Figure 4-16. TR6 and 
TR7 are used to control the testing of the translation 
look- aside buffer. TR3, TR4 and TR5 are used for 
testing the on-chip cache. The use of the test regis- 
ters is discussed in section 11, ‘Testability.’ 


4.2.6 REGISTER ACCESSIBILITY 


There are a few differences regarding the accessibil- 
ity of the registers in Real and Protected Mode. Ta- 
ble 4-12 summarizes these differences. (See section 
6, “Protected Mode Architecture.”’) 


4.2.6.1 FPU Register Usage 


In addition to the differences listed in Table 4-12, 
Table 4-13 summarizes the differences for the on- 
chip FPU. 
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Table 4-12. Register Usage 


Use in VT _  Usein _ Usein 
i Real Mode _ Protected Mode Virtual 8086 Mode 
| | General Registers 


| Store 
Yes Yes Yes Yes 

es 

| No 

| No 

Yes 

Yes 


GDTR Yes PL = Yes Yes 
IDTR Yes 


[No 
i [No [Pt=o | Yes 


NOTES: 
1. IOPL: The PUSHF and POPF instructions are made I/O Privilege Level sensitive in Virtual 8086 Mode. 
2. PL = 0: The registers can be accessed only when the current privilege level is zero. 


~< 
O 
” 
u 
~ 
I 
< 
© 
” 
©) 


Table 4-13. FPU Register Usage Differences 


Use in 
Virtual 8086 Mode 


‘a 
4) 
n 


Yes 


< 
£0) 
” 


i 
@ 
” 


Yes Yes 
4.2.7 COMPATIBILITY 2. Do not depend on the states of any un- 
| defined bits when storing them to mem- 
VERY IMPORTANT NOTE: ory or another register. 
COMPATIBILITY WITH FUTURE PROCESSORS 3. Do not depend on the ability to retain 
In the preceding register descriptions, information written into any undefined 
note certain Intel486 processor register _ bits. 
bits are Intel reserved. When reserved 4. When loading registers, always load the 
bits are called out, treat them as fully un- undefined bits as zeros. 
defined. This is essential for your soft- 5. However, registers that have been pre- 
ware compatibility with future proces- viously stored may be reloaded without 
sors! Follow the guidelines below: | masking. . 


1. Do not depend on the states of any un- 
defined bits when testing the values of 
defined register bits. Mask them out 
when testing. 
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Depending upon the values of undefined regis- 


ter bits will make your software dependent upon 
the unspecified Intel486 processor handling of 
these bits. Depending on undefined values risks 
making your software incompatible with future 
processors that define usages for the Intel486 
processor-undefined bits. AVOID ANY SOFT- 
WARE DEPENDENCE UPON THE STATE OF UN- 
DEFINED INTEL486 PROCESSOR REGISTER 
BITS. 


4.3 Instruction Set 

The Intel486 processor instruction set can be divid- 
ed into the following categories of operations: 
e Data Transfer 

e Arithmetic 

e Shift/Rotate 

¢ String Manipulation 

e Bit Manipulation 

e Control Transfer 

e High Level Language Support 

e Operating System Support 

e Processor Control 


The Intel486 processor instructions are listed in sec- 
tion 13, “Instruction Set Summary.” 


All Intel486 processor instructions operate on either 
0, 1, 2 or 3 operands; where an operand resides in a 
register, in the instruction itself or in memory. Most 
zero operand instructions (e.g., CLI, STI) take only 
one byte. One operand instructions generally are 
two bytes long. The average instruction is 3.2-bytes 
long. Because the Intel486 processor has a 32-byte 
instruction queue, an average of 10 instructions will 
be prefetched. The use of two operands permits the 
following types of common instructions: 


e Register to Register 
¢ Memory to Register 
¢ Memory to Memory 
e Immediate to Register 
e Register to Memory 
e immediate to Memory 
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The operands can be either 8-, 16-, or 32-bits long. 
As a general rule, when executing 32-bit code, oper- 
ands are 8 or 32 bits; when executing existing 80286 
or 8086 processor code (16-bit code), operands are 
8 or 16 bits. Prefixes can be added to all instructions 
that override the default length of the operands (i.e., 
use 32-bit operands for 16-bit code, or 16-bit oper- 
ands for 32-bit code). 


4.3.1 FLOATING POINT INSTRUCTIONS 


In addition to the instructions listed above, the 
Intel486, IntelDX2, and IntelDX4 processors have 
the following floating point instructions. Note that all 
floating point unit instruction mnemonics begin with 
an F. . 


e Floating Point 
e Floating Point Control 


4.4 Memory Organization 


Memory on the Intel486 processor is divided up into 
8-bit quantities (bytes), 16-bit quantities (words), and 
32-bit quantities (dwords). Words are stored in two 
consecutive bytes in memory with the low-order byte 
at the lowest address, the high order byte at the high 
address. Dwords are stored in four consecutive 
bytes in memory with the low-order byte at the low- 
est address, the high-order byte at the highest ad- 
dress. The address of a word or dword is the byte 
address of the low-order byte. 


In addition to these basic data types, the Intel486 
processor supports two larger units of memory: 
pages and segments. Memory can be divided up 
into one or more variable-length segments, which 
can be swapped to disk or shared between pro- 
grams. Memory can also be organized into one or 
more 4-Kbyte pages. Both segmentation and paging 
can be combined, gaining the advantages of both 
systems. The Intel486 processor supports both 
pages and segments in order to provide maximum 
flexibility to the system designer. Segmentation and 
paging are complementary. Segmentation is useful 
for organizing memory in logical modules, and as 
such is a tool for the application programmer, while 
pages are useful for the system programmer for 
managing the physical memory of a system. 
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4.4.1 ADDRESS SPACES 


The Intel486 processor has three distinct address 
spaces: logical, linear, and physical. A logical ad- 
dress (also known as a virtual address) consists of 
_ a selector and an offset. A selector is the contents 
of a segment register. An offset is formed by sum- 
ming all of the addressing components (BASE, 
INDEX, DISPLACEMENT) discussed in section 4.6.3 
‘32-Bit Memory Addressing Modes,” into an effec- 
tive address. Because each task on the Intel486 
processor has a maximum of 16K (214 -1) selectors, 
and offsets can be 4 Gbytes (292 bits), this gives a 
total of 246 bits or 64 terabytes of logical address 
space per task. The programmer sees this virtual 
address space. 


The segmentation unit translates the logical ad- 
dress space into a 32-bit linear address space. If the 
paging unit is not enabled then the 32-bit linear ad- 
dress corresponds to the physical address. The 


Effective Address Calculation 


Effective 
|32, Address 


15 3 2 0 Logical or 


Descriptor 
Index 


Segmentation 
Virtual Address Unit 
R43 | 
Selector P 
L 


Segment Register 
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Figure 4-17. Address Translation 
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paging unit translates the linear address space into 
the physical address space. The physical address 
is what appears on the address pins. | 


The primary difference between Real Mode and Pro- 
tected Mode is how the segmentation unit performs 
the translation of the logical address into the linear 
address. In Real Mode, the segmentation unit shifts 
the selector left four bits and adds the result to the 
offset to form the linear address. While in Protected 
Mode every selector has a linear base address as- 
sociated with it. The linear base address is stored in 
one of two operating system tables (i.e., the Local 
Descriptor Table or Global Descriptor Table). The 
selector’s linear base address is added to the offset 
to form the final linear address. 


Figure 4-17 shows the relationship between the vari- - 
Ous address spaces. 


BE3#—BE0# 
A31~—A2 


32 Paging Unit 32 
Linear (Optional Use) f{ Physical 
Address Address 


242202-24 


intel. 


4.4.2 SEGMENT REGISTER USAGE 


The main data structure used to organize memory is 
the segment. On the Intel486 processor, segments 
are variable sized blocks of linear addresses which 
have certain attributes associated with them. There 
are two main types of segments: code and data. The 
segments are of variable size and can be as small 
as 1 byte or as large as 4 Gbytes (292 bytes). 


In order to provide compact instruction encoding, 
and increase Intel486 processor performance, in- 
structions do not need to explicitly specify which 
segment register is used. A default segment register 
is automatically chosen according to the rules of Ta- 
ble 4-14. In general, data references use the selec- 
tor contained in the DS register; Stack references 
use the SS register and Instruction fetches use the 
CS register. The contents of the Instruction Pointer 
provide the offset. Special segment override prefix- 
es allow the explicit use of a given segment register, 
and override the implicit rules listed in Table 4-14. 
The override prefixes also allow the use of the ES, 
FS and GS segment registers. 


There are no restrictions regarding the overlapping 
of the base addresses of any segments. Thus, all 6 
segments could have the base address set to zero 
and create a system with a 4-Gbyte linear address 
space. This creates a system where the virtual ad- 
dress space is the same as the linear address 
space. Further details of segmentation are dis- 
cussed in section 6.0, ‘Protected Mode Architec- 
ture.” 


4.5 1/O Space 


The Intel486 processor has two distinct physical ad- 
dress spaces: Memory and I/O. Generally, peripher- 
als are placed in I/O space although the Intel486 
processor also supports memory-mapped peripher- 
als. The |/O space consists of 64 Kbytes, it can be 
divided into 64K 8-bit ports, 32K 16-bit ports, or 16K 
32-bit ports, or any combination of ports which add 
up to less than 64 Kbytes. The 64K I/O address 
space refers to physical memory rather than linear 
address, because I/O instructions do not go through 
the segmentation or paging hardware. The M/IlO# 
pin acts as an additional address line thus allowing 
the system designer to easily determine which ad- 
dress space the processor is accessing. 
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Table 4-14. Segment Register Selection Rules 


Implied 
(Default) 
Segment 


Segment 
Override 
Prefixes 
Possible 


Type of Memory 
Reference 


Code Fetch 


Destination of PUSH, 
PUSHF, INT, CALL, 
PUSHA Instructions 


Source of POP, POPA, 
POPF, IRET, RET 
instructions 


Destination of STOS, 
MOVS, REP STOS, REP 
MOVS Instructions (DI is 
Base Register) 


Other Data References, 
with Effective Address 
using Base Register of: 

[EAX] 

[EBX] 

[ECX] 

[EDX] 

[ESI] 

[EDI] 

[EBP] 
[ESP] 


The |/O ports are accessed via the IN and OUT I/O 
instructions, with the port address supplied as an 
immediate 8-bit constant in the instruction or in the 
DX register. All 8- and 16-bit port addresses are zero 
extended on the upper address lines. The !/O in- 
structions cause the M/lO# pin to be driven low. 


1/O port addresses 00F8H through OOFFH are re- 
served for use by Intel. 


1/O instruction code is cacheable. 
1/O data is not cacheable. 


|/O transfers (data or code) can be bursted. 
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4.6 Addressing Modes 


4.6.1 ADDRESSING MODES OVERVIEW 


The Intel486 processor provides a total of 11 ad- 
dressing modes for instructions to specify operands. 
The addressing modes are optimized to allow the 
efficient execution of high-level languages such as C 
and FORTRAN, and they cover the vast majority of 
data references needed by high-level languages. 


4.6.2 REGISTER AND IMMEDIATE MODES 


The following two addressing modes provide for in- - 


structions that operate on register or immediate op- 
erands: 


e Register Operand Mode: The operand is locat- 
ed in one of the 8-, 16- or 32-bit general registers. 


e immediate Operand Mode: The operand is in- 
cluded in the instruction as part of the opcode. 


4.6.3 32-BIT MEMORY ADDRESSING MODES 


The- remaining modes provide a mechanism for 
specifying the effective address of an operand. The 
linear address consists of two components: the seg- 
ment base address and an effective address. The 
effective address is calculated by using combina- 
tions of the following four address elements: 


| © DISPLACEMENT: An 8-, or 32-bit immediate val- 
ue, following the instruction. 


e BASE: The contents of any general purpose reg- 
ister. The base registers are generally used by 
compilers to ponte to the start of the local variable 
area. 


e INDEX: The contents of any general purpose reg- 
ister except for ESP. The index registers are used 
to access the elements of an array, or a string of 
characters. 


@ SCALE: The index register’s value can be multi- 
plied by a scale factor, either 1, 2, 4 or 8. Scaled 
index mode is especially useful for accessing ar- 
rays or structures. 


’ Combinations of these 4 components make up the 9 


additional addressing modes. There is no perform- 
ance penalty for using any of these addressing 
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combinations, because the effective address calcu- 
lation is pipelined with the execution of other instruc- 
tions. The one exception is the simultaneous use of 
Base and Index components, which requires one ad- 
ditional clock. 


As shown in Figure 4-18, the effective address (EA) 
of an operand is calculated according to the follow- 
ing formula: 


EA = Base Reg + (Index Reg * Scaling) + 
Displacement 


Direct Mode: The operand’s offset is contained as 
part of the instruction as an 8-, 16- or 32-bit dis- 
placement. 

Example: INC Word PTR [500] 


Register Indirect Mode: A BASE register contains 
the address of the operand. 


Example: MOV [ECX], EDX 


Based Mode: A BASE register’s contents is added 
to a DISPLACEMENT to form the operand’s offset. 


Example: MOV ECX, [EAX + 24] 


index Mode: An INDEX register’s contents is added 
to a DISPLACEMENT to form the operand’s offset. 


Example: ADD EAX, TABLE[ESI] 

Scaled Index Mode: An INDEX register’s contents 
is multiplied by a scaling factor which is added to a 
DISPLACEMENT to form the operand’s offset. 
Example: IMUL EBX, TABLE[ESI*4],7 

Based Index Mode: The contents of a BASE 
register is added to the contents of an INDEX regis- 
ter to form the effective address of an operand. 
Example: MOV EAX, [ESI] [EBX] 

Based Scaled Index Mode: The contents of an 


INDEX register is multiplied by a SCALING factor 
and the result is added to the contents of a BASE 


register to obtain the operand’s offset. 


Exampie: MOV ECx, [EDX*8] [EAX] 
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Figure 4-18. Addressing Mode Calculations 


Based Index Mode with Displacement: The con- 
tents of an INDEX Register and a BASE register’s 
contents and a DISPLACEMENT are all summed to- 
gether to form the operand offset. 


Example: ADD EDX, [ESI] [EBP + OOFFFFFOH] 


Based Scaled Index Mode with Displacement: 
The contents of an INDEX register are multiplied by 
a SCALING factor, the result is added to the con- 
tents of a BASE register and a DISPLACEMENT to 
form the operand’s offset. 

MOV EAX, 


Example: LOCALTABLE[EDI*4] 


[EBP+ 80] 


4.6.4 DIFFERENCES BETWEEN 16- AND 32-BIT 
ADDRESSES 


In order to provide software compatibility with 80286 
and 8086 processors, the Intel486 processor can 
execute 16-bit instructions in Real and Protected 
Modes. The processor determines the size of the 
instructions it is executing by examining the D bit in 
the CS segment Descriptor. If the D bit is 0 then all 
operand lengths and effective addresses are as- 
sumed to be 16 bits long. If the D bit is 1 then the 
default length for operands and addresses is 32 bits. 
In Real Mode the default size for operands and ad- 
dresses is 16-bits. 


Regardless of the default precision of the operands 


or addresses, the Intel486 processor is able to exe- 
cute either 16- or 32-bit instructions. This is 
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specified via the use of override prefixes. Two prefix- 
es, the Operand Size Prefix and the Address 

Length Prefix, override the value of the D bit on an 
individual instruction basis. These prefixes are auto- 
matically added by Intel assemblers. 


Example: The Intel486 processor is executing in 
Real Mode and the programmer needs to access 
the EAX registers. The assembler code for this 
might be MOV EAX, 32-bit MEMORYOP, ASM486 
' Macro Assembler automatically determines that an 
Operand Size Prefix is needed and generates it. 


Example: The D bit is 0, and the programmer wishes 
to use Scaled Index addressing mode to access an 
array. The Address Length Prefix allows the use of 
MOV DX, TABLE[ESI*2]. The assembler uses an 
Address Length Prefix because, with D=0, the de- 
fault addressing mode is 16-bits. 


Example: The D bit is 1, and the program wants to 
store a 16-bit quantity. The Operand Length Prefix is 
used to specify only a 16-bit value; MOV MEM16, 
DX. 


The OPERAND LENGTH and Address Length Pre- 
fixes can be applied separately or in combination to 
any instruction. The Address Length Prefix does not 
allow addresses over 64 Kbytes to be accessed in 
Real Mode. A memory address which exceeds 
FFFFH will result in a General Protection Fault. An 
Address Length Prefix only allows the use of the ad- 
ditional Intel486 processor addressing modes. 


When executing 32-bit code, the Intel486 processor 
uses either 8-, or 32-bit displacements, and any reg- 
ister can be used as base or index registers. When 
executing 16-bit code, the displacements are either 
8, or 16 bits, and the base and index register con- 
form to the 80286 processor model. Table 4-15 illus- 
trates the differences. 
‘Table 4-15. BASE and INDEX Registers for 
16- and 32-Bit Addresses 


16-Bit 32-Bit 
Addressing| Addressing 
BASE REGISTER BX,BP Any 32-bit GP 
Register 
INDEX REGISTER SI,DI ‘Any 32-bit GP 
| Register Except 
ESP 


SCALEFACTOR | none | 1,248 | 
DISPLACEMENT | 0,8, 16bits | 0, 8, 32 bits 
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4.7 Data Formats 


4.7.1 DATA TYPES 


The Intel486 processor can support a wide-variety of 
data types. In the following descriptions, the proces- 
sor consists of the base architecture registers. 


4.7.1.1 Unsigned Data Types 
Byte: Unsigned 8-bit quantity 
Word: Unsigned 16-bit quantity 
Dword: Unsigned 32-bit quantity 


The least significant bit (LSB) in a byte is bit 0, and 
the most significant bit is 7. 


4.7.1.2 Signed Data Types 


All signed data types assume 2’s complement nota- 
tion. The signed data types contain two fields, a sign 
bit and a magnitude. The sign bit is the most signifi- 
cant bit (MSB). The number is negative if the sign bit 


is 1. If the sign bit is 0, the number is positive. The 


magnitude field consists of the remaining bits in the 
number. (Refer to Figure 4-19.) 


8-bit Integer: Signed 8-bit quantity 

16-bit Integer: Signed 16-bit quantity 
32-bit Integer: Signed 32-bit quantity 
64-bit Integer: Signed 64-bit quantity 


The integer core of the Intel486 processors only 


_ gupport 8-, 16- and 32-bit integers. (See section 


4.7.1.4, “Floating Point Data Types.’’) 


4.7.1.3 BCD Data Types 


The Intel486 processor supports packed and un- 
packed binary coded decimal (BCD) data types. A 
packed BCD data type contains two digits per byte, 
the lower digit is in bits O-3 and the upper digit in 
bits 4-7. An unpacked BCD data type contains 1 
digit per byte stored in bits 0-3. 


The Intel486 processor supports 8-bit packed and 
unpacked BCD data types. (Refer to Figure 4-19.) 


; | 
intel. — 
4.7.1.4 Floating Point Data Types 


In addition to the base registers, the Intel486 DX, 
IntelDX2, and IntelDX4 processors’ on-chip floating 
point unit consists of the floating point registers. The 
floating point unit data type contain three fields: sign, 
significand and exponent. The sign field is one bit 
and is the MSB of the floating point number. The 
number is negative if the sign bit is 1. If the sign bit is 
0, the number is positive. The significand gives the 
significant bits of the number. The exponent field 
contains the power of 2 needed to scale the signifi- 
cand. (Refer to Figure 4-19.) 


Only the FPU supports floating point data types. 


Single Precision Real: 23-bit significand and 
8-bit exponent. 32 bits 
total. 

52-bit significand and 
11-bit exponent. 64 bits 
total. 

64-bit significand and 


15-bit exponent. 80 bits 
total. 


Double Precision Real: 


Extended Precision Real: 


Floating Point Unsigned Data Types 


‘The on-chip FPU does not support unsigned data 
types. (Refer to Figure 4-19.) 


Floating Point Signed Data Types 


The on-chip FPU only supports 16-, 32- and 64-bit 
integers. 
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Floating Point BCD Data Types 

The on-chip FPU only supports 80-bit packed BCD 
data types. 

4.7.1.5 String Data Types 

A string data type is a contiguous sequence of bits, 
bytes, words or dwords. A string may contain be- 
tween 1 byte and 4 Gbytes. (Refer to Figure 4-20.) 


String data types are only supported by the CPU 
section of the Intel486 processor. 


Byte String: | Contiguous sequence of bytes. 
Word String: | Contiguous sequence of words. 
Dword String: Contiguous sequence of dwords. 
Bit String: A set of contiguous bits. In the In- 


tel486 processor bit strings can be 
up to 4-gigabits long. 


4.7.1.6 ASCil Data Types 


The Intel486 processor supports ASCII (American 
Standard Code for Information Interchange) strings 
and can perform arithmetic operations (Such as ad- 
dition and division) on ASCII data. The Intel486 proc- 
essor can only operate on ASCII data. (Refer to Fig- 
ure 4-20.) 
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Supported by Supported by 
Base Registers VS FPU Least Significant Byte | 
[_DataFormat |_| _| Range |Precision|7 0 
| | | 
Byte | 0-255 | 8 bits 


H 


. | | 
Word ' O-84K | 16 bits | 
| | 


Dword / O4G | 32 bits 


8-Bit Integer | ' 8 bits 
| | 
16-Bit Integer | ' 16 bits 
32-Bit Integer ! i 32 bits 
| 64-Bit Integer 
8-Bit Unpacked BCD 
8-Bit Packed BCD | 0-9 


I 


| 80-Bit Packed BCD Xj210218 | 18 Digits 


Single Precision Real X | +10*8 = 24 bits 


Double Precision Real x | +104908 


| Extended Precision Real | |X/#10 64 bits 


{ 


\ 
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Figure 4-19. Intel486™ Processor Data Types 


2-74 


intel A _ Intel486™ PROCESSOR FAMILY 


String Data Types 


Address 


A+4N+43 A+4N42 A+4Ne1 A+4N 
Dword 
String 


A + 268,435,455 


A+3 A+2 
Bit 
String |7 o |7 0 7 0 |7 0 17 


+2,147,483,647 — 2,147,483,648 


a 
Character {7 _—_._—so 
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Figure 4-20. String and ASCII Data Types 


Least Significant Byte 


47 31 0 


31 


0 
32-Bit Pointer 


Figure 4-21. Pointer Data Types 
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4.7.1.7 Pointer Data Types support the following two types of pointers (see Fig- 
ure 4-21): 
A pointer data type contains a value that gives the ¢ 48-bit Pointer: 16-bit selector and 32-bit offset 


address of a piece of data. Intel486 processors 
P P © 32-bit Pointer: 32-bit offset 
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4.7.2 LITTLE ENDIAN vs. BIG ENDIAN DATA 
FORMATS 


The Intel486 processors, as well as all other mem- 
bers of the Intel architecture, use the “little-endian” 
method for storing data types that are larger than 
one byte. Words are stored in two consecutive bytes 
in memory with the low-order byte at the lowest ad- 
dress and the high order byte at the high address. 
Dwords are stored in four consecutive bytes in mem- 
ory with the low-order byte at the lowest address 
and the high order byte at the highest address. The 
address of a word or dword data item is the byte 
address of the low-order byte. 


Figure 4-22 illustrates the differences between the 
big-endian and little-endian formats for dwords. The 


32 bits of data are shown with the low order bit num- | 


bered bit 0 and the high order bit numbered 32. Big- 
endian data is stored with the high-order bits at the 
lowest addressed byte. Little-endian data is stored 
with the high-order bits in the highest addressed 
byte. “ 


The Intel486 processor has the following two in- 
structions that can convert 16- or 32-bit data be- 
tween the two byte orderings: 


e BSWAP (byte swap) handles 4-byte values 
e XCHG (exchange) handles 2-byte values 


m+3 m+2 m+ 
31 24 23 16 15 


Dword in Little-Endian Memory Format | 


m m+1 m+2 m+3 


24 23 16 15 


Dword in Big-Endian Memory Format 
242202-29 


Figure 4-22. Big vs. Little Endian Memory Format 
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4.8.1 INTERRUPTS AND EXCEPTIONS 


4.8 Interrupts 


Interrupts and exceptions alter the normal program | 
flow, in order to handle external events, to report 
errors or exceptional conditions. The difference be- 
tween interrupts and exceptions is that interrupts are — 
used to handle asynchronous external events while 
exceptions handle instruction faults. Although a pro- 
gram can generate a software interrupt via an INT N 
instruction, the Intel486 processors treat software 
interrupts as exceptions. 


_Hardware interrupts occur as the result of an exter- 


nal event and are classified into two types: maskable 
or non-maskable. Interrupts are serviced after the 
execution of the current instruction. After the inter- 
rupt handler is finished servicing the interrupt, exe- 


_ cution proceeds with the instruction immediately af- 


ter the interrupted instruction. Sections 4.8.3, 
“Maskable Interrupt,’ and 4.8.4, ‘““Non-Maskable In- 
terrupt,”’ discuss the differences between Maskable 
and Non-Maskable interrupts. 


Exceptions are classified as faults, traps, or aborts, 
depending on the way they are reported, and wheth- 
er or not restart of the instruction causing the excep- 
tion is supported. Faults are exceptions that are de- 


‘tected and serviced before the execution of the 


faulting instruction. A fault would occur in a virtual 
memory system when the processor referenced a 
page or a segment that was not present. The operat- 
ing system would fetch the page or segment from 
disk, and then the Intel486 processor would restart 
the instruction. Traps are exceptions that are report- 
ed immediately after the execution of the instruction | 
that caused the problem. User defined interrupts are 
examples of traps. Aborts are exceptions that do 
not permit the precise location of the instruction 
causing the exception to be determined. Aborts are 
used to report severe errors, such as a hardware 
error or illegal values in system tables. 


Thus, when an interrupt service routine has been 
completed, execution proceeds from the instruction 
immediately following the interrupted instruction. On 
the other hand, the return address from an excep- 
tion fault routine will always point at the instruction 
causing the exception and include any leading in- 
struction prefixes. Tables 4-16 and 4-17 summarize 
the possible interrupts for Intel486 processors and 
shows where the return address points. 


intel. 


Intel486 processors can handle up to 256 different 
interrupts and/or exceptions. In order to service the 
~ interrupts, a table with up to 256 interrupt vectors 
must be defined. The interrupt vectors are simply 
pointers to the appropriate interrupt service routine. 
In Real Mode (see section 5.0, “Real Mode Archi- 
tecture”), the vectors are 4-byte quantities, a Code 
Segment plus a 16-bit offset; in Protected Mode, the 
interrupt vectors are 8-byte quantities, which are put 
in an Interrupt Descriptor Table. (See section 
6.2.3.4, “Interrupt Descriptor Table.”) Of the 256 
possible interrupts, 32 are reserved for use by intel, 
the remaining 224 are free to be used by the system 
designer. 


4.8.2 INTERRUPT PROCESSING 


When an interrupt occurs, the following actions hap- 
pen. First, the current program address and the 
Flags are saved on the stack to allow resumption of 
the interrupted program. Next, an 8-bit vector is sup- 
plied to the Intel486 processor which identifies the 


appropriate entry in the interrupt table. The table - 


contains the starting address of the interrupt service 
routine. Then, the user supplied interrupt service 
routine is executed. Finally, when an IRET instruc- 
tion is executed the old Intel486 processor state is 
restored and program execution resumes at the ap- 
propriate instruction. 


The 8-bit interrupt vector is supplied to the Intel486 
processor in several different ways: exceptions sup- 
ply the interrupt vector internally; software INT 
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instructions contain or imply the vector; maskable 
hardware interrupts supply the 8-bit vector via the 
interrupt acknowledge bus sequence. Non-Maska- 
ble hardware interrupts are assigned to interrupt 


_ vector 2. 


4.8.3 MASKABLE INTERRUPT 


Maskable interrupts are the most common way used 
by the Intel486 processor to respond to asynchro- 
nous external hardware events. A hardware interrupt 
occurs when the INTR is pulled high and the Inter- 
rupt Flag bit (IF) is enabled. The Intel486 processor 
only responds to interrupts between instructions, 
(REPeat String instructions, have an “interrupt win- 
dow,” between memory moves, which allows inter- 
rupts during long string moves). When an interrupt 
occurs, the Intel486 processor reads an 8-bit vector 
supplied by the hardware which identifies the source 
of the interrupt, (one of 224 user defined interrupts). 
The exact nature of the interrupt sequence is dis- 
cussed in section 10.2.10, “Interrupt Acknowledge.” 


The IF bit in the EFLAG registers is reset when an 
interrupt is being serviced. This effectively disables 
servicing additional interrupts during an interrupt 
service routine. However, the IF may be set explicitly 
by the interrupt handler, to allow the nesting of inter- 
rupts. When an IRET instruction is executed, the 
Original state of the IF is restored. 
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Table 4-16. Interrupt Vector Assignments 


Interrupt | Instruction that Can Cause neu foo 
E : Points to Faulting 
xception : 
| | _ Instruction _ 
Debug Exception Any instruction 
NMI Interrupt | INT 2 or NMI 
One Byte Interrupt INT 
Interrupt on Overflow 
Array Bounds Check 5 BOUND 
Invalid OP-Code | 
Device Not Available ESC, WAIT 
Double Fault | Any instruction that can 
generate an exception 
Intel Reserved | 
Invalid TSS | 


Page Fault 


Tw Byte nterupt | 0-255 


*Some debug exceptions may report both traps on the previous instruction, and faults on the next instruction. 


Table 4-17. FPU Interrupt Vector Assignments 


Instruction Which Can Return Address 


interrupt 


Function Number 


- Cause Exception Points to Faulting 
Instruction — 


Floating Point Error Floating Point, WAIT 
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4.8.4 NON-MASKABLE INTERRUPT 


Non-maskable interrupts provide a method of servic- 
ing very high priority interrupts. A common example 
of the use of a non-maskable interrupt (NMI) would 
be to activate a power failure routine or SMI# to 
activate a power saving mode. When the NMI input 
is pulled high, it causes an interrupt with an internally 
supplied vector value of 2. Unlike a normal hardware 
interrupt, no interrupt acknowledgment sequence is 
performed for an NMI. 


While executing the NMI servicing procedure, the 
Intel486 processor will not service further NMI re- 
quests until an interrupt return (IRET) instruction is 
executed or the processor is reset (RSM in the case 
of SMI#). If NMI occurs while currently servicing an 
NMI, its presence will be saved for servicing after 
executing the first IRET instruction. The IF bit is 
cleared at the beginning of an NMI interrupt to inhibit 
further INTR interrupts. 


4.8.5 SOFTWARE INTERRUPTS 


A third type of interrupt/exception for the Intel486 
processor is the software interrupt. An INT n instruc- 
tion causes the processor to execute the interrupt 
service routine pointed to by the nth vector in the 
interrupt table. 


A special case of the two byte software interrupt INT 
n is the one byte INT 3, or breakpoint interrupt. By 
inserting this one byte instruction in a program, the 
user can set breakpoints in his program as a debug- 
ging tool. 


A final type of software interrupt is the single step 
interrupt. It is discussed in section 12.2, “Single- 
Step Trap.” 


4.8.6 INTERRUPT AND EXCEPTION PRIORITIES 


Interrupts are externally-generated events. Maska- 
ble Interrupts (on the INTR input) and Non-Maskable 
Interrupts (on the NMI input or SMI # input) are rec- 
ognized at instruction boundaries. When more than 
one interrupt or external event are both recognized 
at the same instruction boundary, the Intel486 proc- 
essor invokes the highest priority routine first. (See 
list below.) If, after the NMI service routine has been 
invoked, maskable interrupts are still enabled, then 
the Intel486 processor will invoke the appropriate 
interrupt service routine. 
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Priority for Servicing External Events for All 
intel486 Processors Except the Write-Back En- 
hanced IntelDX2 Processor 


1. RESET/SRESET 
2. FLUSH # 

3. SMI# 

4. NMI 

5. INTR 

6. STPCLK# 


NOTE: th * 
STPCLK#¥ will be recognized while in an in- 
terrupt service routine or an SMM handler. 


For the Write-Back Enhanced IntelDX2 processor, 
the priority of servicing external events is modified 
from the standard Intel486 processor. The list below 
shows the priority for write-back enhanced mode. 


Priority for Servicing External Events for the 
Write-Back Enhanced IntelIDX2 Processor 


1. RESET 
2. FLUSH # 
3. SRESET 
4. SMI # 
5. NMI 
6. INTR 

7. STPCLK# 


Exceptions are internally-generated events. Excep- 
tions are detected by the Intel486 processor if, in the 
course of executing an instruction, the Intel486 proc- 
essor detects a problematic condition. The IntelDX4 
processor then immediately invokes the appropriate 
exception service routine. The state of the Intel486 
processor is such that the instruction causing the 
exception can be restarted. If the exception service 
routine has taken care of the problematic condition, 
the instruction will execute without causing the same 
exception. _ 


It is possible for a single instruction to generate sev- 
eral exceptions (for example, transferring a single 
operand could generate two page faults if the oper- 
and location spans two “not present’ pages). How- 
ever, only one exception is generated upon each at- 
tempt to execute the instruction. Each exception 
service routine should correct its corresponding ex- 
ception, and restart the instruction. In this manner, 
exceptions are serviced until the instruction exe- 
cutes successfully. 
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As the Intel486 processor executes instructions, it ed. This cycle is repeated as each instruction is exe- 


follows a consistent cycle in checking for excep- cuted, and occurs in parallel with instruction decod- 
tions. Consider the case of the Intel486 processor ing and execution. Checking for EM, TS, or FPU er- 
having just completed an instruction. It then per- ror status only occurs for processors with on-chip 
forms the checks listed in Table 4-18 before reach- floating point units. 


ing the point where the next instruction is complet- 


Table 4-18. Sequence of Exception Checking 


| Description 


Check for Exception 1 Traps from the instruction just completed (single-step via Trap Flag, or 
Data Breakpoints set in the Debug Registers). : 


Check for Exception 1 Faults in the next instruction (Instruction Execution Breakpoint set in the 
Debug Registers for the next instruction). 


Check for external NMI and INTR . 


Check for Segmentation Faults that prevented fetching the entire next instruction (exceptions 
11 or 13). 


Check for Page Faults that prevented fetching the entire next instruction (exception 14). 


Check for Faults decoding the next instruction (exception 6 if illegal opcode; exception 6 if in 
Real Mode or in Virtual 8086 Mode and attempting to execute an instruction for Protected - 
Mode only (see section 6.5.4, ‘‘Protection and I/O Permission Bitmap’”’); or exception 13 if 
instruction is longer than 15 bytes, or privilege violation in Protected Mode (i.e., not at IOPL or 
at CPL=0). 


If WAIT opcode, check if TS= 1 and MP = 1 (exception 7 if both are 1). | 
If opcode for Floating Point Unit, check if EM=1 or TS=1 (exception 7 if either are 1). 


If opcode for Floating Point Unit (FPU), check FPU error status (exception 16 if error status is 
asserted). _ 


Check in the following order for each memory reference required by the instruction: 


a. Check for Segmentation Faults that prevent transferring the entire memory quantity 
(exceptions 11, 12, 13). | 
b. Check for Page Faults that prevent transferring the entire memory quantity (exception 14). 


a 
as 


NOTE: 

The order stated supports the concept of the paging mechanism being “underneath” the segmentation mechanism. There- 
fore, for any given code or data reference in memory, segmentation exceptions are generated before paging exceptions are 
generated. . . 
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4.8.7 INSTRUCTION RESTART 


The Intel486 processor fully supports restarting all 
instructions after faults. If an exception is detected in 


the instruction to be executed (exception categories — 


4 through 10 in Table 4-18), the Intel486 processor 
invokes the appropriate exception service routine. 


The Intel486 processor is in a state that permits re- 
start of the instruction, for all cases except the fol- 
lowing. An instruction causes a task switch to a task 
whose Task State Segment is partially “not pres- 
ent.” (An entirely “not present’ TSS is restartable.) 
Partially present TSSs can be avoided either by 
keeping the TSSs of such tasks present in memory, 
or by aligning TSS segments to reside entirely within 
a single 4K page (for TSS segments of 4 Kbytes or 
less). 


NOTE: 
Such cases are easily avoided by proper de- 
sign of the operating system. 
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4.8.8 DOUBLE FAULT 


A Double Fault (exception 8) results when the 
Intel486 processor attempts to invoke an exception 
service routine for the segment exceptions (10, 11, 
12 or 13), but in the process of doing so, detects an 
exception other than a Page Fault (exception 14). 


A Double Fault (exception 8) will also be generated 
when the Intel486 processor attempts to invoke the 
Page Fault (exception 14) service routine, and de- 
tects an exception other than a second Page Fault. 
In any functional system, the entire Page Fault serv- 
ice routine must remain “present” in memory. 


When a Double Fault occurs, the Intel486 processor 
invokes the exception service routine for exception 
8. 


4.8.9 FLOATING POINT INTERRUPT VECTORS 


Several interrupt vectors of the Intel486 Dx, 
IntelDX2, and IntelDX4 processors are used to re- 
port exceptional conditions while executing numeric 
programs in either real or protected mode. Table 4- 
19 shows these interrupts and their causes. 


Table 4-19. Interrupt Vectors Used by FPU 


ae Number Cause of Interrupt 


A Floating Point instruction was encountered when EM or TS of the Intel486 DX, 
IntelDX2, and IntelDX4 processor control register zero (CRO) was set. EM = 1 
indicates that software emulation of the instruction is required. When TS is set, either a 
Floating Point or WAIT instruction causes interrupt 7. This indicates that the current 
FPU context may not belong to the current task. 


The previous numerics instruction caused an unmasked exception. The address of the 
faulty instruction and the address of its operand are stored in the instruction pointer 
and data pointer registers. Only Floating Point and WAIT instructions can cause this 
interrupt. The Intel486 DX, IntelDX2, and IntelDX4 processors return address pushed 
onto the stack of the exception handler points to a WAIT or Floating Point instruction 
(including prefixes). This instruction can be restarted after clearing the exception 
condition in the FPU. The FNINIT, FNCLEX, FNSTSW, FNSTENV, and FNSAVE 


The first word or doubleword of a numeric operand is not entirely within the limit of its 
segment. The return address pushed onto the stack of the exception handler points at 
the Floating Point instruction that caused the exception, including any prefixes. The 
FPU has not executed this instruction; the instruction pointer and data pointer register 
refer to a previous, correctly executed instruction. . 


instructions can not cause this interrupt. 
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5.0 REAL MODE ARCHITECTURE 


5.1 Introduction 


When the Intel486 processor is reset or powered up, 
it is initialized in Real Mode. Real Mode has the 
same base architecture as the 8086 processor, ex- 
cept that it allows access to the 32-bit register set of 
the Intel486 processor. The Intel486 processor ad- 
dressing mechanism, memory size and interrupt 
handling are identical to those of Real Mode on the 
80286 processor. 


All of the Intel486 processor instructions are avail- 
able in Real Mode (except those instructions listed 
in section 6.5.4, “Protection and I/O Permission Bit- 
map’). The default operand size in Real Mode is 16 
bits, as in the 8086 processor. In order to use the 
32-bit registers and addressing modes, override pre- 
fixes must be used. Also, the segment size on the 
Intel486 processor in Real Mode is 64 Kbytes, forc- 
ing 32-bit effective addresses to have a value less 
than OOOOFFFFH. The primary purpose of Real 
Mode is to enable Protected Mode Operation. 


The LOCK prefix on the Intel486 processor, even in 
Real Mode, is more restrictive than on the 80286 
processor. This is due to the addition of paging on 
the Intel486 processor in Protected Mode and Virtu- 
al 8086 Mode. Paging makes it impossible to guar- 
antee that repeated string instructions can be 
LOCKed. The Intel486 processor can not require 
that all pages holding the string be physically pres- 
ent in memory. Hence, a Page Fault (exception 14) 
might have to be taken during the repeated string 
instruction. Therefore, the LOCK prefix can not be 
supported during repeated string instructions. 


Table 5-1 lists the only instruction forms where the 
LOCK prefix is legal on the Intel486 processor. 


An exception 6 will be generated if a LOCK prefix is 
placed before any instruction form or opcode not 
listed above. The LOCK prefix allows indivisible 
read/modify/write operations on memory operands 
using the instructions above. For example, even the 
ADD Reg, Mem is not LOCKable, because the Mem 
operand is not the destination (and therefore no 
memory read/modify/operation is being performed). 


Because, on the Intel486 processor, repeated string 
instructions are not LOCKable, it is not possible to 
LOCK the bus for a long period of time. Therefore, 
the LOCK prefix is not lOPL-sensitive on the 
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Intel486 processor. The LOCK prefix can be used at 
any privilege level, but only on the instruction toms 
listed above. 
Table 5-1. Instruction Forms Where LOCK 
Prefix Is Legal 


Operands 
(Dest, Source) 


BIT Test and Mem, Reg/ immed 
SET/RESET/COMPLEMENT 


[cde =i Mom, Rog 


ADD, OR, ADC, SBB, AND, | Mem, =e ea 
SUB, XOR 


5.2 Memory Addressing 


In Real Mode the maximum memory size is limited to 
1 megabyte. (See Figure 5-1.) Thus, only address 
lines A2-A19 are active. (Exception, after RESET | 
address lines A20-A31 are high during CS-relative 
memory cycles until an intersegment jump or call is 
executed. See section 9.5, “Reset and Initializa- 
tion’’.) 


OFFSET 
SEGMENT 


MAX LIMIT 
FIXED AT 64K IN 
REAL MODE 


MEMORY OPERAND 


SELECTED 
SEGMENT 


SEGMENT BASE 


Figure 5-1. Real Address Mode Addressing 
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Because paging is not allowed in Real Mode, the 


_ linear addresses are the same as the physical ad- 


dresses. Physical addresses are formed in Real 
Mode by adding the contents of the appropriate seg- 
ment register, which is shifted left by four bits to an 


effective address. This addition results in a physi- 
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cal address from OOOOQ0Q00H to 0010FFEFH. This is 
compatible with 80286 Real Mode. Because seg- 
ment registers are shifted left by 4 bits, Real Mode 
segments always start on 16-byte boundaries. 


All segments in Real Mode are exactly 64-Kbytes 
long, and may be read, written, or executed. The 
Intel486 processor will generate an exception 13 if a 
data operand or instruction fetch occurs past the 
end of a segment (i.e., if an operand has an offset 
greater than FFFFH, for example, a word with a low 
byte at FFFFH and the high byte at 0000H). 


Segments may be overlapped in Real Mode. Thus, if 
a particular segment does not use all 64 Kbytes, an- 
other segment can be overlaid on top of the unused 
portion of the previous segment. This allows the pro- 
grammer to minimize the amount of physical memo- 
ry needed for a program. 


5.3 Reserved Locations 


There are two fixed areas in memory which are re- 
served in Real address mode: system initialization 
area and the interrupt table area. Locations OO000H 
through OOSFFH are reserved for interrupt vectors. 
Each one of the 256 possible interrupts has a 4-byte 
jump vector reserved for it. Locations FFFFFFFOH 
through FFFFFFFFH are reserved for system initiali- 
zation. 


5.4 Interrupts 


Many of the exceptions shown in Table 4-16 and 
discussed in section 4.8.3, ““Maskable Interrupt,” are 
not applicable to Real Mode operation, in particular 
exceptions 10, 11, 14, 17, which do not happen in 
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Real Mode. Other exceptions have slightly different 
meanings in Real Mode; Table 5-2 identifies these 
exceptions. 


5.5 Shutdown and Halt 


The HALT instruction stops program execution and 
prevents the Intel486 processor from using the local 
bus until restarted. Either NMI, INTR with interrupts 
enabled (IF=1), or RESET will force the Intel486 
processor out of halt. If interrupted, the saved CS:IP 
will point to the next instruction after the HLT. 


As in the case of protected mode, the shutdown will 
occur when a severe error is detected that prevents 
further processing. In Real Mode, shutdown can oc- 
cur under two conditions, as follows: 


e An interrupt or an exception occurs (exceptions 8 
or 13) and the interrupt vector is larger than the 
Interrupt Descriptor Table (i.e., there is not an in- 
terrupt handler for the interrupt). 


e A CALL, INT or PUSH instruction attempts to 
wrap around the stack segment when SP is not 
even (i.e., pushing a value on the stack when SP 
= 0001 resulting in a stack segment greater than 
FFFFH). 


An NMI input can bring the processor out of shut- 
down if the Interrupt Descriptor Table limit is large 
enough to contain the NMI interrupt vector (at least 
0017H) and the stack has enough room to contain 
the vector and flag information (i.e., SP is greater 
than 0005H). If these conditions are not met, the 
Intel486 processor is unable to execute the NMI and 
executes another shutdown cycle. In this case, the 
Intel486 processor remains in the shutdown and can 
only exit via the RESET input. 


Table 5-2. Exceptions with Different Meanings in Real Mode (see Table 4-17) 


CS, DS, ES, FS, GS Segment 
overrun exception 


Interrupt Raiaied Instructions Return Address 
Number Location 
Interrupt table limit too small ae INT Vector is not within table limit Before Instruction 


Word memory reference beyond offset = 
FFFFH. 

An attempt to execute past the end of CS 
segment. 


SS Segment overrun exception Stack Reference beyond offset = FFFFH | Before Instruction 


Before Instruction 
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6.0 PROTECTED MODE 
ARCHITECTURE 


The complete capabilities of the Intel486 processor 


are unlocked when the Intel486 processor operates 
in Protected Virtual Address Mode (Protected 
Mode). Protected Mode vastly increases the linear 
address space to four Gbytes (232 bytes) and allows 
the running of virtual memory programs of almost 
unlimited size (64 terabytes or 246 bytes). In addition 
Protected Mode allows the Intel486 processor to run 
all of the existing 8086, 80286 and Intel386 proces- 
sor software, while providing a sophisticated memo- 
ry management and a hardware-assisted protection 
mechanism. Protected Mode allows the use of addi- 
tional instructions especially optimized for support- 
ing multitasking operating systems. The base archi- 


tecture of the Intel486 processor remains the same, 


the registers, instructions, and addressing modes 
' described in the previous sections are retained. The 
main difference between Protected Mode and Real 


Mode from a programmer’s view is the increased 


address space and a different addressing mecha- 
nism. | 


48/32 BIT POINTER 


SELECTOR OFFSET . 


ACCESS RIGHTS 
LIMIT 
BASE ADDRESS 


SEGMENT 
DESCRIPTOR 
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MEMORY OPERAND 


SEGMENT BASE 
ADDRESS 


Figure 6-1. Protected Mode Addressing 
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Like Real Mode, Protected Mode uses two compo- 
nents to form the logical address, a 16-bit selector is 
used to determine the linear base address of a seg- 
ment, the base address is added to a 32-bit effective 
address to form a 32-bit linear address. The linear 
address is then either used as the 32-bit physical 
address, or if paging is enabled the paging mecha- 
nism maps the 32-bit linear address into a 32-bit 
physical address. 


6.1 Addressing Mechanism 


The difference between the two modes lies in calcu- 
lating the base address. In Protected Mode the se- 
lector is used to specify an index into an operating 
system defined table. (See Figure 6-1.) The table 
contains the 32-bit base address of a given seg- 
ment. The physical address is formed by adding the 
base address obtained from the table to the offset. 


Paging provides an additional memory management 
mechanism which operates only in Protected Mode. 
Paging provides a means of managing the very large 
segments of the Intel486 processor. As such, paging 
operates beneath segmentation. The paging mecha- 
nism translates the protected linear address which 
comes from the segmentation unit into a physical 
address. Figure 6-2 shows the complete Intel486 
processor addressing mechanism with paging en- 
abled. 


SEGMENT LIMIT 


SELECTED 
SEGMENT 
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48 Bit Pointer 
Physical Address 


Segment 4K Bytes 


4K Bytes 


Intel486™ 
Processor 


Access Rights | 4K Bytes 
Limit Paging 
Base Address 


Physical 


|__ Address Memory Operand Physical Page: 


4K Bytes 


Linear 


Segment Address 


Descriptor 
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Figure 6-2. Paging and Segmentation 


6.2 Segmentation 


6.2.1 SEGMENTATION INTRODUCTION 


Segmentation is one method of memory manage- 
ment. Segmentation provides the basis for protec- 
tion. Segments are used to encapsulate regions of 
memory which have common attributes. For exam- 
ple, all of the code of a given program could be con- 
tained in a segment, or an operating system table 
may reside in a segment. All information about a 
segment is stored in an 8-byte data structure called 
a descriptor. All of the descriptors in a system are 
contained in tables recognized by hardware. 


6.2.2 TERMINOLOGY 


The following terms are used throughout the discus- 
sion of descriptors, privilege levels and protection: 


PL: Privilege Level—One of the four hierarchical 
privilege levels. Level 0 is the most privileged level 
and level 3 is the least privileged. More privileged 
levels are numerically smaller than less privileged 
levels. 


RPL: Requester Privilege Level—The privilege level 
of the original supplier of the selector. RPL is deter- 
mined by the least two significant bits of a selector. 


DPL: Descriptor Privilege Level—This is the least 
privileged level at which a task may access that de- 
scriptor (and the segment associated with that de- 
scriptor). Descriptor Privilege Level is determined by 
bits 6:5 in the Access Right Byte of a descriptor. 


CPL: Current Privilege Level—The privilege level at 
which a task is currently executing, which equals the 
privilege level of the code segment being executed. 
CPL can also be determined by examining the low- 
est 2 bits of the CS register, except for conforming 
code segments. 


EPL: Effective Privilege Level—The effective privi- 
lege level is the least privileged of the RPL and DPL. 
Because smaller privilege level values indicate 
greater privilege, EPL is the numerical maximum of 
RPL and DPL. 


Task: One instance of the execution of a program. 
Tasks are also referred to as processes. 
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6.2.3 DESCRIPTOR TABLES 


6.2.3.1 Descriptor Tables Introduction 


The descriptor tables define all of the segments 
which are used in an Intel486 processor system. 
(See Figure 6-3.) There are three types of tables on 
the Intel486 processor which hold descriptors: the 
Global Descriptor Table, Local Descriptor Table, and 
the Interrupt Descriptor Table. All of the tables are 


variable length memory arrays. They can range in. 


size between 8 bytes and 64 Kbytes. Each table can 
hold up to 8192 8-byte descriptors. The upper 13 
bits of a selector are used as an index into the de- 
scriptor table. The tables have registers associated 
with them which hold the 32-bit linear base address, 
and the 16-bit limit of each table. 


Each of the tables has a register associated with it, 
the GDTR, LDTR, and the IDTR (see Figure 6-3). 
The LGDT, LLDT, and LIDT instructions, load the 
base and limit of the Global, Local, and Interrupt De- 
scriptor Tables, respectively, into the appropriate 
register. The SGDT, SLDT, and SIDT store the base 
and limit values. These tables are manipulated by 


the operating system. Therefore, the load descriptor 


table instructions are privileged instructions. 


15 0 15 0 


o. or um 

LDT BASE 

LINEAR ADDRESS 
15 

32 
IDT LIMIT 1 PROGRAM INVISIBLE 
| F AUTOMATICALLY LOADED 
IDT BASE | 4 FROM LDT DESCRIPTOR 


LINEAR ADDRESS 
31 0 


GDT LIMIT 


GDT BASE 
LINEAR ADDRESS 


15 


0 
0 


31 
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Figure 6-3. Descriptor Table Registers 
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6.2.3.2 Global Descriptor Table 


The Global Descriptor Table (GDT) contains de- 
scriptors which are possibly available to all of the . 
tasks in a system. The GDT can contain any type of 
segment descriptor except for descriptors which are 
used for servicing interrupts (i.e., interrupt and trap 
descriptors). Every Intel486 processor system con- 
tains a GDT. Generally the GDT contains code and 
data segments used by the operating systems and 


‘task state segments, and descriptors for the LDTs in 


a system. , 


The first slot of the Global Descriptor Table corre- 
sponds to the null selector and is not used. The null 
selector defines a null pointer value. 


6.2.3.3 Local Descriptor Table 


LDTs contain descriptors which are associated with 
a given task. Generally, operating systems are de- 
signed so that each task has a separate LDT. The 
LDT may contain only code, data, stack, task gate, 
and call gate descriptors. LDTs provide a mecha- 
nism for isolating a given task’s code and data seg- 
ments from the rest of the operating system, while 
the GDT contains descriptors for segments which 
are common to ail tasks. A segment cannot be ac- 
cessed by a task if its segment descriptor does not 
exist in either the current LDT or the GDT. This pro- 
vides both isolation and protection for a task’s seg- 
ments, while still allowing global data to be shared 
among tasks. 


Unlike the 6-byte GDT or IDT registers which contain 
a base address and limit, the visible portion of the 
LDT register contains only a 16-bit selector. This se- 
lector refers to a Local Descriptor Table descriptor in 
the GDT. 


6.2.3.4 Interrupt Descriptor Table 


The third table needed for Intel486 processor sys- 
tems is the Interrupt Descriptor Table. (See Figure 6- 
4.) The IDT contains the descriptors which point to 
the location of up to 256 interrupt service routines. 
The IDT may contain only task gates, interrupt. 
gates, and trap gates. The IDT should be at least 
256 bytes in size in order to hold the descriptors for 
the 32 Intel Reserved Interrupts. Every interrupt 
used by a system must have an entry in the IDT. The 
IDT entries are referenced via INT instructions, ex- 
ternal interrupt vectors, and exceptions. (See sec- 
tion 4.8, ‘“Interrupts.’’) 


MEMORY ~ 


GATE FOR 
INTERRUPT #n 
GATEFOR 
INTERRUPT #n-1 


INTERRUPT 
DESCRIPTOR 
TABLE 
a (lOT) 
GATE FOR 
GATE FOR 
INTERRUPT #0 


IDT LIMIT 


\OT BASE i 


MEMORY 
ADDRESSES 


—_—— 
INCREASING 


N 
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Figure 6-4. Interrupt Descriptor Table 
Register Use 


6.2.4 DESCRIPTORS 


6.2.4.1 Descriptor Attribute Bits 


The object to which the segmeni selector points to 
is called a descriptor. Descriptors are eight-byte 
quantities that contain attributes about a given re- 
gion of linear address space (i.e., a segment). These 
attributes include the 32-bit base linear address of 
the segment, the 20-bit length and granularity of the 
segment, the protection level, read, write or execute 
privileges, the default size of the operands (16-bit or 
32-bit), and the type of segment. All of the attribute 
information about a segment is contained in 12 bits 
in the segment descriptor. All segments on the 
Intel486 processor have three attribute fields in 
common: the P bit, the DPL bit, and the S bit. The 
Present P bit is 1 if the segment is loaded in physical 
memory. If P=0, any attempt to access this seg- 
ment will cause a not present exception (exception 
11). The Descriptor Privilege Level DPL is a two-bit 
field that specifies the protection level O-3 associat- 
ed with a segment. 


The Intel486 processor has two main categories of 
segments: system segments and non-system seg- 
ments (for code and data). The segment S bit in the 
segment descriptor determines if a given segment is 
a system segment or a code or data segment. If the 
S bit is 1, the segment is either a code or data seg- 
ment. If it is 0, the segment is a system segment. 
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6.2.4.2 Intel486 Processor Code, Data 
Descriptors (S= 1) 


Figure 6-5 shows the general format of a code and 
data descriptor and Table 6-1 illustrates how the bits 
in the Access Rights Byte are interpreted. The Ac- 
cess Rights Bytes is bits 24-31 associated with the 
segment limit. i 


Code and data segments have several descriptor 
fields in common. The accessed A bit is set whenev- 
er the processor accesses a descriptor. The A bit is 
used by operating systems to keep usage statistics 
on a given segment. The G bit, or granularity bit, 
specifies if a segment length is byte-granular or 
page-granular. Intel486 processor segments can be 
one megabyte long with byte granularity (G=0) or 
four gigabytes with page granularity (G= 1), (i.e., 220 
pages each page is 4 Kbytes in length). The granu- 
larity is totally unrelated to paging. An Intel486 proc- 
essor system can consist of segments with byte 
granularity, and page granularity, whether or not 
paging is enabled. 


The executable E bit tells if a segment is a code or 
data segment. A code segment (E= 1, S=1) may be 
execute-only or execute/read as determined by the 
Read R bit. Code segments are execute only if 
R=0, and execute/read if R=1. Code segments 
may never be written into. 


NOTE: 
Code segments may be modified via aliases. 
Aliases are writeable data segments which 
occupy the same range of linear address 
space as the code segment. 


The D bit indicates the default length for operands 
and effective addresses. If D=1 then 32-bit oper- 
ands and 32-bit addressing modes are assumed. If 
D=0 then 16-bit operands and 16-bit addressing 
modes are assumed. Therefore all existing 80286 
code segments will execute on the Intel486 proces- 
sor assuming the D bit is set 0. 


Another attribute of code segments is determined by 
the conforming C bit. Conforming segments, C= 1, 
can be executed and shared by programs at differ- 
ent privilege levels. (See section 6.3, ‘Protection.’’) 
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= Address 


. Segment Base 15...0 Segment Limi 15...0 0 
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BASE Base Address of the segment 
LIMIT The length of the segment 


P _ Present Bit 1= Present, 0= Not Present 
DPL Descriptor Privilege Level 0-3 
S) Segment Descriptor 0 = System Descriptor, 1= Code or Data Segment Descriptor 
TYPE Type of Segment 
A Accessed Bit 
G Granularity Bit 1 = Segment length is page granular, O= Segment length is byte granular 
D Default Operation Size (recognized in cole segment descriptors only) 
1=32-bit segment, 0= 16-bit segment 
0 Bit must be zero (0) for compatibility with future processors 
AVL Available field for user or OS 
~ NOTE: 


In a maximum- -size segment (i.e., a segment with G =1 and segment limit 19...0= EERO: the lowest 12 bits of the 
segment base should be zero (i.e:, segment base 11. . .000=000h). 


Figure 6-5 Segment Descriptors 


Table 6-1. Access Rights Byte Definition for Code and Data Descriptions 


Bit 
Position 


Present (P) 


Function 


Segment is mapped into physical memory. 

P=0 No mapping to physical memory exits, base and 
limit are not used. | 

Segment privilege attribute used in privilege tests. 


Descriptor Privilege 
Level (DPL) 
Segment 
Descriptor (S) 


Code or Data (includes stacks) segment descriptor. 
System Segment Descriptor or Gate Descriptor. 


If Date Segment (S = 1, E = 0) 


Executable (E) E=0 Descriptor type is data segment: 
Expansion ED = ) Expand up segment, offsets must be < limit. 
Direction (ED) ED = | Expand down segment, offsets must be > limit. 


Writeable (W) Data segment may not be written into. , 


Data segment may be written into. 


If Code Segment (S = 1, E = 1) 

E= Descriptor type is code segment: 

C= Code segment may only be executed when CPL = 
DPL and CPL remains unchanged. 

Code segment may not be read. 

Code segment may be read. 


Executable (E) 
Conforming (C) 


Readable (R) 


Accessed (A) 


Segment has not been accessed. 
Segment selector has been loaded into segment 
register or used by selector test instructions. 
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Segments identified as data segments (E=0, S=1) 
are used for two types of Intel486 processor seg- 
ments: stack and data segments. The expansion di- 
rection (ED) bit specifies if a segment expands 
downward (stack) or upward (data). If a segment is a 
stack segment all offsets must be greater than the 
segment limit. On a data segment all offsets must be 
less than or equal to the limit. In other words, stack 
segments start at the base linear address plus the 
maximum segment limit and grow down to the base 
linear address plus the limit. On the other hand, data 
segments start at the base linear address and ex- 
pand to the base linear address plus limit. 


The write W bit controls the ability to write into a 
segment. Data segments are read-only if W=0. The 
stack segment must have W= 1. 


The B bit controls the size of the stack pointer regis- 
ter. lf B=1, then PUSHes, POPs, and CALLs all use 
the 32-bit ESP register for stack references and as- 
sume an upper limit of FFFFFFFFH. lf B=0, stack 
instructions all use the 16-bit SP register and as- 
sume an upper limit of FFFFH. 


Segment Base 15...0 
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6.2.4.3 System Descriptor Formats 


System segments describe information about oper- — 
ating system tables, tasks, and gates. Figure 6-6 
shows the general format of system segment de- 
scriptors, and the various types of system segments. 
Intel486 processor system descriptors contain a 
32-bit base linear address and a 20-bit segment lim- 
it. 80286 system descriptors have a 24-bit base ad- 
dress and a 16-bit segment limit. 80286 system de- 
scriptors are identified by the upper 16 bits pon all 
zero. 


6.2.4.4 LDT Descriptors (S=0, TYPE = 2) | 


LDT descriptors (S=0, TYPE=2) contain informa- 
tion about Local Descriptor Tables. LDTs contain a 
table of segment descriptors, unique to a particular 
task. Because the instruction to load the LDTR is 
only available at privilege level 0, the DPL field is 
ignored. LDT descriptors are only allowed in the 
Global Descriptor Table (GDT). 


Byte 
Address 


Segment Limit 15...0 0 


Type Defines 
Invalid 
Available 80286 TSS 
LDT 
Busy 80286 TSS 
80286 call gate 
Task Gate (for 80286, Intel486™ processor 
task) 
80286 interrupt gate 
80286 trap gate 


242202-36 
Invalid 
Available Intel486 processor TSS 
Undefined (Intel Reserved) 
Busy Intel486 processor TSS 
Intel486 processor call gate 
Undefined (Intel Reserved) 
Intel486 processor 
Intel486 processor 
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Figure 6-6. System Segment Descriptors 
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6.2.4.5 TSS. Descriptors 
(S=0, TYPE = 1, 3, 9, B) 


A Task State Segment (TSS) descriptor contains in- 
formation about the location, size, and privilege level 
of a Task State Segment (TSS). A TSS in turn is a 
special fixed format segment which contains all the 
state information for a task and a linkage field to 
permit nesting tasks. The TYPE field is used to indi- 
cate whether the task is currently BUSY (i.e.; on a 
chain of active tasks) or the TSS is available. The 
TYPE field also indicates if the segment contains an 
80286 processor TSS or an Intel486 processor TSS. 
The Task Register (TR) contains the selector which 
points to the current Task State Segment. 


6.2.4.6 Gate Descriptors 
(S=0, TYPE = 4-7, C, F) 


Gates are used to control access to entry points 
within the target code segment. The various types of 
gate descriptors are call gates, task gates, inter- 
rupt gates, and trap gates. Gates provide a level 


| selector sd 


| 

intel. 
of indirection between the source and destination of 
the control transfer. This indirection allows the proc- 
essor to automatically perform protection checks. It 
also allows system designers to control entry points 
to the operating system. Call gates are used to 
change privilege levels (see section 6.3, “Protec- 
tion’), task gates are used to perform a task switch, 


and interrupt and trap gates are used to specify in- 
terrupt service routines. 


Figure 6-7 shows the format of the four types of gate 
descriptors. Call gates are primarily used to transfer 
program control to a more privileged level. The call 
gate descriptor consists of three fields: the access 
byte, a long pointer (selector and offset) which 
points to the start of a routine and a word count 
which specifies how many parameters are to be cop- 
ied from the caller’s stack to the stack of the called 
routine. The word count field is only used by call 
gates when there is a change in the privilege level, 
other types of gates ignore the word count field. 


Byte 
Address 


Offset 15...0 0 


Word 
Offset 31.. rat +4 


Gate Descriptor Fields 


Name Value Description 


Type 80286 call gate 
5 
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Task gate (for 80286 or Intel486 processor task) 


80286 interrupt gate 
80286 trap gate 
Intel486™ processor call gate 
Intel486 processor interrupt gate 
Intel486 processor trap gate 

P Descriptor contents are not valid 
Descriptor contents are valid 


DPL—least privileged level at which a task may access the gate. WORD COUNT 0-31—the number of parameters to 
copy from caller’s stack to the called procedure’s stack. The parameters are 32-bit quantities for Intel486 processor 


gates, and 16-bit quantities for 80286 gates. 
DESTINATION 16-bit 
SELECTOR selector or 


Selector to the target code segment 


Selector to the target task state segment for task gate 


DESTINATION offset 
OFFSET 16-bit 80286 
32-bit Intel486 processor 


Entry point within the target code segment 


Figure 6-7. Gate Descriptor Formats 
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Interrupt and trap gates use the destination selector 
and destination offset fields of the gate descriptor as 
a pointer to the start of the interrupt or trap handler 
routines. The difference between interrupt gates and 
trap gates is that the interrupt gate disables inter- 
rupts (resets the IF bit), while the trap gate does not. 


Task gates are used to switch tasks. Task gates 
may only refer to a task state segment. (See section 
6.3.6, “Task Switching.’’) Therefore, only the desti- 
nation selector portion of a task gate descriptor is 
used, and the destination offset is ignored. 


Exception 13 is generated when a destination selec- 
tor does not refer to a correct descriptor type, i.e., a 
code segment for an interrupt, trap or call gate, a 
TSS for a task gate. 


The access byte format is the same for all gate de- 
scriptors. P= 1 indicates that the gate contents are 
valid. P=0 indicates the contents are not valid and 
Causes excepiion 11 if referenced. DPL is the de- 
scriptor privilege level and specifies when this de- 
scriptor may be used by a task. (See section 6.3, 
“Protection.’’) The S field, bit 4 of the access rights 
byte, must be 0 to indicate a system control descrip- 
tor. The type field specifies the descriptor type as 
indicated in Figure 6-7. 


6.2.4.7 Differences Between Intel486 Processor 
and 80286 Descriptors 


In order to provide operating system compatibility 
between 80286 and Intel486 processors, the 


| Segment Base 15.0 Base 15...0 
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Intel486 processor supports all of the 80286 seg- 
ment descriptors. Figure 6-8 shows the general for- 
mat of an 80286 system segment descriptor. The 
only differences between 80286 and Intel486 proc- 
essor descriptor formats are that the values of the 
type fields, and the limit and base address fields 
have been expanded for the Intel486 processor. The 
80286 system segment descriptors contained a 
24-bit base address and 16-bit limit, while the In- 
tel486 processor system segment descriptors have 
a 32-bit base address, a 20-bit limit field, and a gran- 
ularity bit. 


By supporting 80286 system segments the Intel486 
processor is able to execute 80286 application pro- 
grams on an Intel486 processor operating system. 
This is possible because the Intel486 processor au- 
tomatically understands which descriptors are 
80286-style descriptors and which descriptors are 
Intel486 processor-style descriptors. In particular, if 
the upper word of a descriptor is zero, then that de- 
scriptor is an 80286-style descriptor. 


The only other differences between 80286-style de- 
scriptors and Intel486 processor descriptors is the 
interpretation of the word count field of call gates 
and the B bit. The word count field specifies the 
number of 16-bit quantities to copy for 80286 call 
gates and 32-bit quantities for Intel486 processor 
call gates. The B bit controls the size of PUSHes 
when using a call gate; if B=0 PUSHes are 16 bits, 
if B= 1 PUSHes are 32 bits. 


Byte 
Address 


Segment Limit 15...0 0 


Intel Reserved 


Base Address of the segment 

The length of the segment 

Present Bit: 1=Present, 0= Not Present 
Descriptor Privilege Level 0-3 

System Descriptor: 0= System, 1 = User 
Type of Segment 
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Figure 6-8. 80286 Code and Data Segment Descriptors 
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6.2.4.8 Selector Fields 


A selector in Protected Mode has three fields: Local 
or Global Descriptor Table Indicator (Tl), Descriptor 
Entry Index (Index), and Requester (the selector’s) 
Privilege Level (RPL) as shown in Figure 6-9. The TI 
bits select one of two memory-based tables of de- 
scriptors (the Global Descriptor Table or the Local 
Descriptor Table). The Index selects one of 8K de- 
scriptors in the appropriate descriptor table. The 
RPL bits allow high speed testing of the selector’s 
privilege attributes. 


6.2.4.9 Segment Descriptor Cache 


In addition to the selector value, every segment reg- | 


ister has a segment descriptor cache register asso- 
ciated with it. Whenever a segment register’s con- 
tents are changed, the 8-byte descriptor associated 
with that selector is automatically loaded (cached) 
on the chip. Once loaded, all references to that seg- 
ment use the cached descriptor information instead 
of reaccessing the descriptor. The contents of the 


SELECTOR 


43 2 1 0 


SEGMENT 
REGISTER 


TABLE 


intel. 


descriptor cache are not visible to the programmer. 
Because descriptor caches only change when a 
segment register is changed, programs that modify 
the descriptor tables must reload the appropriate 
segment registers after changing a descriptor’s val- 
ue. , 


6.2.4.10 Segment Descriptor Register Settings 


The contents of the segment descriptor cache vary 
depending on the mode the Intel486 processor is 
operating in. When operating in Real Address Mode, 
the segment base, limit, and other attributes within 
the segment cache registers are defined as shown 
in Figure 6-10. For compatibility with the 8086 archi- 


‘tecture, the base is set to sixteen times the current 


selector value, the limit is fixed at OOOOFFFFH, and 
the attributes are fixed so as to indicate the segment 
is present and fully usable. In Real Address Mode, 
the internal “privilege level’ is always fixed to the — 
highest level, level 0, so I/O and other privileged 
opcodes may be executed. 


INDICATOR 


DESCRIPTOR 
TABLE 


DESCRIPTOR 
NUMBER 


GLOBAL 
DESCRIPTOR 
TABLE 
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Figure 6-9. Example Descriptor Selection 
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SEGMENT DESCRIPTOR CACHE REGISTER CONTENTS 


32 - BIT BASE 


(UPDATED DURING SELECTOR 
LOAD INTO SEGMENT REGISTER) 


CONFORMING PRIVILEGE 
STACK SIZE 
EXECUTABLE 
WRITEABLE 

READABLE 

EXPANSION DIRECTION 
GRANULARITY 
ACCESSED 


32 - BIT LIMIT 
(FIXED) 


OTHER ATTRIBUTES 
(FIXED) 


Y 
Y 


16X CURRENT GS SELECTOR OOOOFFFFH Y 


242202-40 
*Except the 32-bit CS base is initialized to FFFFFOOOH after reset until first intersegment control transfer (i.e., interseg- 
ment CALL, or intersegment JMP, or INT). (See Figure 6-12 for an example.) 


Key: 

yes 

no 

privilege level 0 
privilege level 1 
privilege level 2 
privilege level 3 
expand up 


expand down 

byte granularity 

page granularity 

push/pop 16-bit words 

push/pop 32-bit dwords 

does not apply to that segment cache register 


ee | | | 
en | 


Figure 6-10. Segment Descriptor Caches for Real Address Mode. 
(Segment Limit and Attributes Are Fixed) 


When operating in Protected Mode, the segment 
base, limit, and other attributes within the segment 
cache registers are defined as shown in Figure 6-11. 
In Protected Mode, each of these fields are defined 
according to the contents of the segment descriptor 
indexed by the selector value loaded into the seg- 
ment register. 


When operating in a Virtual 8086 Mode within the 
Protected Mode, the segment base, limit, and other 


attributes within the segment cache registers are de- 
fined as shown in Figure 6-12. For compatibility with 
the 8086 architecture, the base is set to sixteen 
times the current selector value, the limit is fixed at 
OOOOFFFFH, and the attributes are fixed so as to 
indicate the segment is present and fully usable. The 
virtual program executes at lowest privilege level, 
level 3, to allow trapping of all IOPL-sensitive in- 
structions and level-0-only instructions. 
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SEGMENT 
32 - BIT BASE 


DESCRIPTOR CACHE REGISTER CONTENTS 
32 - BIT LIMIT 


OTHER ATTRIBUTES 


(UPDATED DURING (UPDATED DURING (UPDATED DURING 
SELECTOR LOAD INTO. SELECTOR LOAD INTO SELECTOR LOAD INTO 
SEGMENT REGISTER) SEGMENT REGISTER) SEGMENT REGISTER) 
CONFORMING PRIVILEGE 


STACK SIZE 
EXECUTABLE 
WRITEABLE 
READABLE 
EXPANSION DIRECTION 
GRANULARITY 
ACCESSED 
PRIVILEGE LEVEL 
PRESENT 


BASE PER SEG DESCR LIMIT PER SEG DESCR 


p 
BASE PER SEG DESCR | LIMIT PER SEG DESCR | p| 
BASE PER SEG DESCR LIMIT PER SEG DESCR —| p | 

p 


BASE PER SEG DESCR | _ LIMIT PER SEG DESCR 


BASE PER SEG DESCR LIMIT PER SEG DESCR 
ee ee Ore Cte ay ght rn ee phe Ti yy a EG es te ee a pyr ge oe 242202-41 


Key: 

Y = fixed yes 
N = fixed no 
d 

p 


per segment descriptor 
per segment descriptor; descriptor must indicate ‘“‘present’”’ to avoid exception 11 j 
(exception 12 in case of SS) 


r = per segment descriptor, but descriptor must indicate “readable” to avoid exception 13 
(special case for SS) 
w = per segment descriptor, but descriptor must indicate “writeable” to avoid exception 13 


(special case for SS) 
— = does not apply to that segment cache register 


Figure 6-11. Segment Descriptor Caches for Protected Mode (Loaded per Descriptor) 
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SEGMENT 
32 - BIT BASE 


(UPDATED DURING SELECTOR 
LOAD INTO SEGMENT REGISTER) 


CONFORMING PRIVILEGE 
STACK SIZE 
EXECUTABLE 
WRITEABLE 
READABLE 
EXPANSION DIRECTION 
GRANULARITY 


32 - BIT LIMIT 


ACCESSED 
PRIVILEGE LEVEL 
PRESENT 
BASE LIMIT J 
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DESCRIPTOR CACHE REGISTER CONTENTS 


OTHER ATTRIBUTES 
(FIXED) 


(FIXED) 
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does not apply to that segment cache register 


Key: 

Y = yes D = expand down 

N =n0 B =byte granularity 

0 = privilege level 0 P = page granularity 

1 = privilege level 1 W = push/pop 16-bit words 
2 = privilege level 2 F = push/pop 32-bit dwords 
3 = privilege level 3 —_— = 

U = expand up 


Figure 6-12. Segment Descriptor Caches for Virtual 8086 Mode within Protected Mode 
(Segment Limit and Attributes are Fixed) 


6.3 Protection 


6.3.1 PROTECTION CONCEPTS 


The Intel486 processor has four levels of protection 
which are optimized to support the needs of a multi- 
tasking operating system to isolate and protect user 
programs from each other and the operating system. 
The privilege levels control the use of privileged in- 
structions, |/O instructions, and access to segments 
and segment descriptors. Unlike traditional proces- 
sor-based systems where this protection is achieved 
only through the use of complex external hardware 
and software the Intel486 processor provides the 


protection as part of its integrated Memory Manage- 
ment Unit. The Intel486 processor offers an addi- 
tional type of protection on a page basis, when pag- 
ing is enabled (See section 6.4.3, ‘‘Page Level Pro- 
tection.”’) 


The four-level hierarchical privilege system is illus- 
trated in Figure 6-13. It is an extension of the user/ 
supervisor privilege mode commonly used by mini- 
computers and, in fact, the user/supervisor mode is 
fully supported by the Intel486 processor paging 
mechanism. The privilege levels (PL) are numbered 
0 through 3. Level 0 is the most privileged or trusted 
level. 
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APPLICATIONS 


CPU 
ENFORCED 
SOFTWARE 
INTERFACES 


OS EXTENSIONS 


HIGH SPEED 
OPERATING 
SYSTEM 
INTERFACE 
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Figure 6-13. Four-Level Hierarchical Protection 


6.3.2 RULES OF PRIVILEGE 


The Intel486 processor controls access to both data 
and procedures between levels of a task, according 
to the following rules. 


e Data stored in a segment with privilege level p 
can be accessed only by code executing at a 
privilege level at least as privileged as p. 


e A code segment/procedure with privilege level p 
can only be called by a task executing at the 
same or a lesser privilege level than p. 


6.3.3 PRIVILEGE LEVELS 


6.3.3.1 Task Privilege 


At any point in time, a task on the Intel486 processor 

always executes at one of the four privilege levels. 
The Current Privilege Level (CPL) specifies the 
task’s privilege level. A task’s CPL may only be 
changed by control transfers through gate descrip- 
tors to a code segment with a different privilege iev- 
el. (See section 6.3.4, ‘Privilege Level Transfers.’’) 
Thus, an application program running at PL = 3 may 
call an operating system routine at PL = 1 (viaa 
gate) which would cause the task’s CPL to be set to 
1 until the operating system routine was finished. 
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The privilege level of a selector is specified by the 
RPL field. The RPL is the two least significant bits of 
the selector. The selector’s RPL is only used to es- 
tablish a less trusted privilege level than the current 
privilege level for the use of a segment. This level is 
called the task’s effective privilege level (EPL). The 
EPL is defined as being the least privileged (i.e., nu- 
merically larger) level of a task’s CPL and a selec- 
tor’s RPL. Thus, if selector’s RPL = 0 then the CPL 
always specifies the privilege level for making an ac- 
cess using the selector. On the other hand if RPL = 
3 then a selector can only access segments at level 
3 regardless of the task’s CPL. The RPL is most 
commonly used to verify that pointers passed to an 
operating system procedure do not access data that 
is of higher privilege than the procedure that origi- 
nated the pointer. Because the originator of a selec- 
tor can specify any RPL value, the Adjust RPL 
(ARPL) instruction is provided to force the RPL bits 
to the originator’s CPL. 


6.3.3.2 Selector Privilege (RPL) 


6.3.3.3 I/O Privilege and I/O Permission Bitmap . 


The 1/O privilege level (IOPL, a 2-bit field in the 
EFLAG register) defines the least privileged level at 
which I/O instructions can be unconditionally per- 
formed. I/O instructions can be unconditionally per- 
formed when CPL = IOPL. (The I/O instructions are 
IN, OUT, INS, OUTS, REP INS, and REP OUTS.) 
When CPL > IOPL, and the current task is associat- 
ed with a 286 TSS, attempted I/O instructions cause 
an exception 13 fault. When CPL > IOPL, and the 
current task is associated with an Intel486 processor 
TSS, the I/O Permission Bitmap (part of an Intel486 
processor TSS) is consulted on whether |/O to the 
port is allowed, or an exception 13 fault is to be gen- 
erated instead. For diagrams of the |/O Permission 
Bitmap, refer to Figure 6-14 and Figure 6-15. For 
further information on how the I/O Permission Bit- 
map is used in Protected Mode or in Virtual 8086 
Mode, refer to section 6.5.4, “Protection and I/O 
Permission Bitmap.” 


The !/O privilege level (IOPL) also affects whether 

several other instructions can be executed or cause 

an exception 13 fault instead. These instructions are 

called “lIOPL-sensitive” instructions and they are 

CLI and STI. (Note that the LOCK prefix is not One 
sensitive on the Intel486 Processelr): 
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Figure 6-14. Intel486™ Processor TSS and TSS Registers 
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I/O Ports Accessible: 2 —> 9, 12, 13, 15, 20 —> 24, 27, 33, 34, 40, 41, 48, 50, 52, 53, 58 —> 60, 62, 63, 96 —> 127 


Figure 6-15. Sample I/O Permission Bit Map 


The IOPL also affects whether the IF (interrupts en- 
able flag) bit can be changed by loading a value into 
the EFLAGS register. When CPL = IOPL, then the 
IF bit can be changed by loading a new value into 
the EFLAGS register. When CPL > IOPL, the IF bit 
cannot be changed by a new value POPed into (or 
otherwise loaded into) the EFLAGS register; the IF 
bit merely remains unchanged and no exception is 
generated. 


_ 6.3.3.4 Privilege Validation 


. The Intel486 processor provides several instructions 
to speed pointer testing and help maintain system 
integrity by verifying that the selector value refers to 
an appropriate segment. Table 6-2 summarizes the 
selector validation procedures available for the In- 
tel486 processor. — 


This pointer verification prevents the common prob- 
lem of an application at PL = 3 calling a operating 
systems routine at PL = 0 and passing the operat- 
ing system routine a “bad” pointer which. corrupts a 
data structure belonging to the operating system. If 
the operating system routine uses the ARPL instruc- 
tion to ensure that the RPL of the selector has no 
greater privilege than that of the caller, then this 
problem can be avoided. 


6.3.3.5 Descriptor Access 


There are basically two types of segment accesses: 
those involving code segments such as control 
transfers, and those involving data accesses. Deter- 
mining the ability of a task to access a segment in- 
volves the type of segment to be accessed, the in- 
struction used, the type of descriptor used and CPL, 
RPL, and DPL as described above. 


Table 6-2. Pointer Test Instructions 


Selector, 
Register 


LAR Register, 
Selector 


2-98 


= a aT a 


| Adjust Requested Privilege Level: adjusts the RPL of the selector to the numeric 
maximum of current selector RPL value and the RPL value in the register. Set 
zero flag if selector RPL was changed. 


VERR Selector VERify for Read: sets the zero flag if the segment referred to By the selector can 
be read. 
VERW Selector VERify for Write: sets the zero flag if the segment referred to by ihe selector can 
be written. 
Register, Load Segment Limit: reads the segment limit into the register if privilege rules 
Selector and descriptor type allow. Set zero flag if successful. 


Load Access Rights: reads the descriptor access rights byte into the register if 
privilege rules allow. Set zero flag if successful. 
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Any time an instruction loads data segment registers 
(DS, ES, FS, GS) the Intel486 processor makes pro- 
tection validation checks. Selectors loaded in the 
DS, ES, FS, GS registers must refer only to data 
segments or readable code segments. The data ac- 
cess rules are specified in section 6.3.2, “Rules of 
Privilege.” The only exception to those rules is read- 
able conforming code segments which can be ac- 
cessed at any privilege level. 


Finally the privilege validation checks are performed. 
The CPL is compared to the EPL and if the EPL is 
more privileged than the CPL an exception 13 (gen- 
eral protection fault) is generated. 


The rules regarding the stack segment are slightly 
different than those involving data segments. In- 
structions that load selectors into SS must refer to 
data segment descriptors for writeable data seg- 
ments. The DPL and RPL must equal the CPL. All 
other descriptor types or a privilege level violation 
will cause exception 13. A stack not present fault 
causes exception 12. Note that an exception 11 is 
used for a not-present code or data segment. 
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6.3.4 PRIVILEGE LEVEL TRANSFERS | 


Inter-segment control transfers occur when a selec- 
tor is loaded in the CS register. For a typical system 
most of these transfers are simply the result of a call 
or a jump to another routine. There are five types of 
control transfers which are summarized in Table 6-3. 
Many of these transfers result in a privilege level 
transfer. Changing privilege levels is done only via 
control transfers, by using gates, task switches, and 
interrupt or trap gates. 


Control transfers can only occur if the operation 
which loaded the selector references the correct de- 
scriptor type. Any violation of these descriptor usage 
rules will cause an exception 13 (e.g., JMP through a 
call gate, or IRET from a normal subroutine call). 


In order to provide further system security, all control 
transfers are also subject to the privilege rules. 


Table 6-3. Descriptor Types Used for Control Transfer 


Intersegment to the same or higher privilege 


Interrupt within task may change CPL 


Intersegment to a lower privilege level (changes 
task CPL) 


NOTES: 
1. NT (Nested Task bit of flag register) 


= 0 
2. NT (Nested Task bit of flag register) = 1 


. Descriptor Descriptor 
Control Transfer Types Operation Types 


Intersegment within the same privilege level JMP, CALL, RET, IRET* | Code Segment | GDT/LDT 


CALL Call Gate 
level 


Interrupt Instruction, 
Exception, External 
Interrupt 


RET, IRET() Code Segment | GDT/LDT 


CALL, JMP Task State 
Segment 


Task Switch CALL, JMP 
IRET(2) | 
Interrupt Instruction, 
Exception, External 
Interrupt 


GDT/LDT 


Trap or Interrupt 
Gate 


_ _ 


GDT 
GDT/LDT 
IDT 
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The privilege rules require that: 


e Privilege level transitions can only occur via 
gates. 


@ JMPs can be made to a non-conforming code 
segment with the same privilege or to a conform- 
ing code segment with greater or equal privilege. 


e CALLs can be made to a non-conforming code 
segment with the same privilege or via a gate toa 
more privileged level. 


e interrupts handled within the task obey the same 
privilege rules as CALLs. 


e Conforming Code segments are accessible by 
privilege levels which are the same or less privi- 
leged than the conforming-code segment’s DPL. 


¢ Both the requested privilege level (RPL) in the 
selector pointing to the gate and the task’s CPL 
must be of equal or greater privilege than the 
gate’s DPL. . 


e The code segment selected in the gate must be 
the same or more privileged than the task’s CPL. 


e Return instructions that do not switch tasks can 
only return control to a code segment with same 
or less privilege. 


e Task switches can be performed by a CALL, 
JMP, or INT which references either a task gate 
or task state segment who’s DPL is less privi- 
leged or the same privilege as the old task’s CPL. 


Any control transfer that changes CPL within a task 
causes a change of stacks as a result of the privi- 
lege level change. The initial values of SS:ESP for 
privilege levels 0, 1, and 2 are retained in the task 
state segment. (See section 6.3.6, “Task Switch- 
ing.”’) During a JMP or CALL control transfer, the 
new stack pointer is loaded into the SS and ESP 
registers and the previous stack pointer is pushed 
onto the new stack. 


When RETurning to the original privilege level, use 
of the lower-privileged stack is restored as part of 
the RET or IRET instruction operation. For subrou- 
tine calls that pass parameters on the stack and 
cross privilege levels, a fixed number of words (as 
specified in the gate’s word count field) are copied 
from the previous stack to the current stack. The 
inter-segment RET instruction with a stack adjust- 
ment value will correctly restore the previous stack 
pointer upon return. 
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Gates provide protected, indirect CALLs. One of the 
major uses of gates is to provide a secure method of 
privilege transfers within a task. Because the operat- 
ing system defines all of the gates in a system, it can 
ensure that all gates only allow entry into a few trust- 
ed procedures (such as those which allocate memo- 
ry, or perform I/O). 


6.3.5 CALL GATES 


Gate descriptors follow the data access rules of priv- - 
ilege; that is, gates can be accessed by a task if the 
EPL, is equal to or more privileged than the gate 
descriptor’s DPL. Gates follow the control transfer 
rules of privilege and therefore may only transfer 
control to a more privileged level. 


Call Gates are accessed via a CALL instruction and 
are syntactically identical to calling a normal subrou- 
tine. When an inter-level Intel486 processor call gate 
is activated, the following actions occur. 


1. Load CS:EIP from gate check for validity 
2. SS is pushed zero-extended to 32 bits 
3. ESP is pushed | 


4. Copy Word Count 32-bit parameters from the old 
stack to the new stack 


5. Push Return address on stack 


The procedure is identical for 80286 Call gates, ex- 


cept that 16-bit parameters are copied and 16-bit 
registers are pushed. | 


Interrupt Gates and Trap gates work in a similar 
fashion as the call gates, except there is no copying © 
of parameters. The only difference between Trap 
and Interrupt gates is that control transfers through 
an Interrupt gate disable further interrupts (i.e., the 
IF bit is set to 0), and Trap gates leave the interrupt 
status unchanged. 


6.3.6 TASK SWITCHING 


A very important attribute of any multitasking/multi- 
user operating system is its ability to rapidly switch 
between tasks or processes. The Intel486 processor 
directly supports this operation by providing a task 
switch instruction in hardware. The Intel486 proces- 
sor task switch operation saves the entire state of 
the machine (all of the registers, address space, and 
a link to the previous task), loads a new execution 
state, performs protection checks, and commences 
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execution in the new task, in about 10 microsec- 
onds. Like transfer of control via gates, the task 
switch operation is invoked by executing an inter- 
segment JMP or CALL instruction which refers to a 
Task State Segment (TSS), or a task gate descriptor 
in the GDT or LDT. An INT n instruction, exception, 
trap, or external interrupt may also invoke the task 
switch operation if there is a task gate descriptor in 
the associated IDT descriptor slot. 


The TSS descriptor points to a segment (see Figure 
6-14) containing the entire Intel486 processor exe- 
cution state while a task gate descriptor contains a 
TSS selector. The Intel486 processor supports both 
80286 and Intel486 processor style TSSs. Figure 6- 
16 shows an 80286 TSS. The limit of an Intel486 
processor TSS must be greater than 0064H (002BH 
for an 80286 TSS), and can be as large as 4 Gbytes. 
In the additional TSS space, the operating system is 
free to store additional information such as the rea- 
son the task is inactive, time the task has spent run- 
ning, and open files belong to the task. 


INITIAL 
STACKS 
FOR CPL 0, 1,2 


CURRENT 


AVAILABLE 
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Figure 6-16. 80286 TSS 
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Each task must have a TSS associated with it. The 
current TSS is identified by a special register in the 
Intel486 processor called the Task State Segment 


Register (TR). This register contains a selector refer- 


ring to the task state segment descriptor that de- 
fines the current TSS. A hidden base and limit regis- 
ter associated with TR are loaded whenever TR is 
loaded with a new selector. Returning from a task is 
accomplished by the IRET instruction. When IRET is 
executed, control is returned to the task which was 
interrupted. The current executing task’s state is 
saved in the TSS and the old task state is restored 
from its TSS. 


Several bits in the flag register and machine status . 
word (CRO) give information about the state of a 
task which are useful to the operating system. The 
Nested Task (NT) (bit 14 in EFLAGS) controls the 
function of the IRET instruction. If NT = 0, the IRET 
instruction performs the regular return; when 
NT = 1, IRET performs a task switch operation 
back to the previous task. The NT bit is set or reset 
in the following fashion: 


When a CALL or INT instruction initiates a task 
switch, the new TSS will be marked busy and the 
back link field of the new TSS set to the old TSS 
selector. The NT bit of the new task is set by CALL 
or INT initiated task switches. An interrupt that does 
not cause a task switch will clear NT. (The NT bit will 
be restored after execution of the interrupt handler) 
NT may also be set or cleared by POPF or IRET 
instructions. 


The Intel486 processor task state segment is 
marked busy by changing the descriptor type field 
from TYPE 9H to TYPE BH. An 80286 TSS is 
marked busy by changing the descriptor type field 
from TYPE 1 to TYPE 3. Use of a selector that refer- 
ences a busy task state segment causes an excep- 
tion 13. 


The Virtual Mode (VM) bit 17 is used to indicate if a 
task, is a virtual 8086 task. If VM = 1, then the tasks 
will use the Real Mode addressing mechanism. The 
virtual 8086 environment is only entered and exited 
via a task switch. (See section 6.5, “Virtual 8086 
Environment.’’) 


The T bit in the Intel486 processor TSS indicates 
that the processor should generate a debug excep- 
tion when switching to a task. If T = 1, upon entry to 
a new task, a debug exception 1 will be generated. 


2-101 


Intel486™ PROCESSOR FAMILY 


6.3.6.1 Floating Point Task Switching 


The FPU’s state is not automatically saved when a 
task switch occurs, because the incoming task may 
not use the FPU. The Task Switched (TS) Bit (bit 3 in 
the CRO) helps deal with the FPU’s state in a multi- 
tasking environment. Whenever the Intel OverDrive 
processors switch tasks, they set the TS bit. The 
Intel OverDrive processors detect the first use of a 
processor extension instruction after a task switch 
and causes the processor extension not available 
exception 7. The exception handler for exception 7 
may then decide whether to save the state of the 
FPU. A processor extension not present exception 
(7) will occur when attempting to execute a Floating 
Point or WAIT instruction if the Task Switched and 
Monitor coprocessor extension bits are both set (i.e., 
TS = 1 and MP = 1). 


6.3.7 INITIALIZATION AND TRANSITION TO 
| PROTECTED MODE 


Because the Intel486 processor begins executing in 
Real Mode immediately after RESET it is necessary 
to initialize the system tables and registers with the 
appropriate values. 


The GDT and IDT registers must refer to a valid GDT 
and IDT. The IDT should be at least 256-bytes long, 
and GDT must contain descriptors for the initial 
code, and data segments. Figure 6-17 shows the 
tables and Figure 6-18 the descriptors needed for a 
simple Protected Mode Intel486 processor system. 
It has a single code and single data/stack segment 
each four-Gbytes long and a single privilege level 
PL = 0. | 


The actual method of enabling Protected Mode is to 
load CRO with the PE bit set, via the MOV CRO, R/M 
instruction. This puts the Intel486 processor in Pro- 
tected Mode. 


After enabling Protected Mode, the next instruction 
should execute an intersegment JMP to load the CS 
register and flush the instruction decode queue. The 
_ final step is to load all of the data segment registers 
with the initial selector values. 
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Figure 6-17. Simple Protected System 


An alternate approach to entering Protected Mode 
which is especially appropriate for multitasking oper- 
ating systems, is to use the built in task-switch to 


load ail of the registers. In this case the GDT would 


contain two TSS descriptors in addition to the code 
and data descriptors needed for the first task. The 
first JMP instruction in Protected Mode would jump 
to the TSS causing a task switch and loading all of 
the registers with the values stored in the TSS. Be- 
cause a task switch saves the state of the current 
task in a task state segment, the Task State Seg- . 
ment Register should be initialized to point to a valid 
TSS descriptor. 


6.4 Paging 


6.4.1 PAGING CONCEPTS 


Paging is another type of memory management use- 
ful for virtual memory multitasking operating 
systems. Unlike segmentation which modularizes 
programs and data into variable length segments, 
paging divides programs into multiple uniform size 
pages. Pages bear no direct relation to the logical 
structure of a program. While segment selectors can 
be considered the logical “name” of a program 
module or data structure, a page most likely corre- 
sponds to only a portion of a module or data struc- 
ture. | 
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Figure 6-18. GDT Descriptors for Simple System 


By taking advantage of the locality of reference dis- 
played by most programs, only a small number of 
pages from each active task need be in memory at 
any one moment. 


6.4.2 PAGING ORGANIZATION 


6.4.2.1 Page Mechanism 


The Intel486 processor uses two levels of tables to 
translate the linear address (from the segmentation 
unit) into a physical address. There are three com- 
ponents to the paging mechanism of the Intel486 
processor: the page directory, the page tables, and 
the page itself (page frame). All memory-resident el- 
ements of the Intel486 processor paging mechanism 
are the same size, namely, 4 Kbytes. A uniform size 
for all of the elements simplifies memory allocation 
and reallocation schemes, because there is no prob- 
lem with memory fragmentation. Figure 6-19 shows 
how the paging mechanism works. 


6.4.2.2 Page Descriptor Base Register 


CR2 is the Page Fault Linear Address register. It 
holds the 32-bit linear address which caused the last 
page fault detected. 


CR3 is the Page Directory Physical Base Address 
Register. It contains the physical starting address of 
the Page Directory. The lower 12 bits of CR3 are 


always zero to ensure that the Page Directory is al- 
ways page aligned. Loading it via a MOV CR3 reg 
instruction causes the Page Table Entry cache to be 
flushed, as will a task switch through a TSS that 
changes the value of CRO. (See section 6.4.5, 
“Translation Lookaside Buffer.’’) 


6.4.2.3 Page Directory 


The Page Directory is 4-Kbytes long and allows up 
to 1024 Page Directory Entries. Each Page Directory 
Entry contains the address of the next jievel of ta- 
bies, the Page Tables and information about the 
page table. The contents of a Page Directory Entry 
are shown in Figure 6-20. The upper 10 bits of the 
linear address (A22—A31) are used as an index to 
select the correct Page Directory Entry. 


6.4.2.4 Page Tables 


Each Page Table is 4 Kbytes and holds up to 1024 
Page Table Entries. Page Table Entries contain the 
starting address of the page frame and statistical 
information about the page. (See Figure 6-21.) Ad- 
dress bits A12-A21 are used as an index to select 
one of the 1024 Page Table Entries. The 20 upper- 
bit page frame address is concatenated with the 
lower 12 bits of the linear address to form the physi- 
cal address. Page tables can be shared between 
tasks and swapped to disks. 
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Figure 6-19. Paging Mechanism 
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6.4.2.5 Page Directory/Table Entries 


The lower 12 bits of the Page Table Entries and 
Page Directory Entries contain statistical information 


about pages and page tables respectively. The P | 


(Present) bit 0 indicates if a Page Directory or Page 
Table entry can be used in address translation. If 
P = 1 the entry can be used for address translation, 
if P = O the entry cannot be used for translation, 
and all of the other bits are available for use by the 
software. For example the remaining 31 bits could 
be used to indicate where on the disk the page is 
stored. 


The A (Accessed) bit 5, is set by the Intel486 proc- 
essor for both types of entries before a read or write 
access occurs to an address covered by the entry. 
The D (Dirty) bit 6 is set to 1 before a write to an 
address covered by that page table entry occurs. 
The D bit is undefined for Page Directory Entries. 
When the P, A and D bits are updated by the In- 
tel486 processor, a Read-Modify-Write cycle is gen- 
erated which locks the bus and prevents conflicts 
with other processors or peripherals. Software which 
modifies these bits should use the LOCK prefix to 
ensure the integrity of the page tables in multimaster 
systems. 


The 3 bits marked OS Reserved in Figure 6-20 and 
Figure 6-21 (bits 9-11) are software definable. OSs 
are free to use these bits for whatever purpose they 
wish. An example use of the OS Reserved bits 
would be to store information about page aging. By 
keeping track of how long a page has been in mem- 
ory since being accessed, an operating system can 
implement a page replacement algorithm such as 
Least Recently Used. 
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The (User/Supervisor) U/S bit 2 and the (Read/ 
Write) R/W bit 1 are used to provide protection attri- 
butes for individual pages. . 


- 6.4.3 PAGE LEVEL PROTECTION 


(R/W, U/S BITS) 


The Intel486 processor provides a set of protection 
attributes for paging systems. The paging mecha- 
nism distinguishes between two levels of protection: 
User which corresponds to level 3 of the segmenta- 
tion based protection, and supervisor which encom- 
passes all of the other protection levels (0, 1, 2). 


The R/W and U/S bits are used in conjunction with 
the WP bit in the flags register (EFLAGS). The In- 


tel886 processor does not contain the WP bit. The Hm 


WP bit has been added to the Intel486 processor to 
protect read-only pages from supervisor write ac- 
cesses. The Intel3886 processor allows a read-only 
page to be written from protection levels 0, 1 or 2. 
WP =0 is the Intel386 processor compatible mode. 
When WP= 0, the supervisor can write to a read- 
only page as defined by the U/S and R/W bits. 
When WP=1, supervisor access to a read-only 
page (R/W=0) will cause a page fault (exception 
14). 


Table 6-4 shows the affect of the WP, U/S and R/W 
bits on accessing memory. When WP = 0, the super- 
visor can write to pages regardless of the state of 
the R/W bit. When WP=1 and R/W=0, the super- 
visor cannot write to a read-only page. A user at- 
tempt to access a supervisor only page (U/S=0) or 
to write to a read-only page will cause a page fault 
(exception 14). 


Table 6-4. Page Level Protection Attributes 


[ws [aw | we | _UserAccess | Supervisor Access 


0 0 ) None Read/Write/Execute 
0 1 0 None Read/Write/Execute 
1 0 0 Read/Execute Read/Write/Execute 
1 1 0 Read/Write/Execute Read/Write/Execute 
0 0 1 None Read/Execute 

0 1 1 None Read/Write/Execute 
1 0 1 Read/Execute Read/Execute 

1 1 1 Read/Write/Execute Read/Write/Execute 
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The R/W and U/S bits provide protection from user 
access on a page by page basis because the bits 
are contained in the Page Table Entry and the Page 
Directory Table. The U/S and R/W bits in the first- 
level Page Directory Table apply to all entries in the 
page table pointed to by that directory entry. The 
U/S and R/W bits in the second-level Page Table 
Entry apply only to the page described by that entry. 
The most restrictive of the.U/S and R/W bits from 
_ the Page Directory Table and the Page Table Entry 
are used to address a page. 


Example: If the U/S and R/W bits for the Page Di- 
rectory entry were 10 (user read/execute) and the 
U/S and R/W bits for the Page Table Entry were 01 
(no user access at all), the access rights for the 
page would be 01, the numerically smaller of the 
two. . 


Note that a given segment can be easily made read- 
only for level 0, 1 or 2 via use of segmented protec- 
tion mechanisms. (Section 6.3, ‘‘Protection’’.) 


6.4.4 PAGE CACHEABILITY 
(PWT AND PCD BITS) 


See section 7.6, “Page Cacheability,” for a detailed 
description of page cacheability and the PWT and 
‘PCD bits. 


6.4.5 TRANSLATION LOOKASIDE BUFFER 


‘The Intel486 processor paging hardware is designed 
to support demand paged virtual memory systems. 
However, performance would degrade substantially 
if the Intel486 processor was required to access two 
levels of tables for every memory reference. To 
solve this problem, the Intel486 processor keeps a 
cache of the most recently accessed pages. This 
cache is called the Translation Lookaside Buffer 
(TLB). The TLB is a four-way set associative 32-en- 
try page table cache. It automatically keeps the most 
commonly used Page Table Entries in the Intel486 
processor. The 32-entry TLB coupled with a 4K 
page size, results in coverage of 128 Kbytes of 
memory addresses. For many common multitasking 
systems, the TLB will have a hit rate of about 98%. 
This means that the Intel486 processor will only 
have to access the two-level page structure on 2% 
_ of all memory references. Figure 6-22 illustrates how 
the TLB complements the Intel486 processor’s pag- 
ing mechanism. 
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Figure 6-22. Translation Lookaside Buffer 


Reading a new entry into the TLB (TLB refresh) is a 
two step process handled by the Intel486 processor 
hardware. The sequence of data cycles to perform a 
TLB refresh are the following: 


1. Read the correct Page Directory Entry, as pointed 
to by the page base register and the upper 10 bits 
of the linear address. The page base register is in 
control register 3. 


a. Optionally perform a locked read/write to set. 
the accessed bit in the directory entry. The di- 
rectory entry will actually get read twice if the 
Intel486 processor needs to set any of the bits 
in the entry. If the page directory entry changes 
between the first and second reads, the data 
returned for the second read will be used. 


2. Read the correct entry in the Page Table and 
place the entry in the TLB. 


a. Optionally perform a locked read/write to set 
the accessed and/or dirty bit in the page table 
entry. Again, note that the page table entry will — 
actually get read twice if the Intel486 processor 
needs to set any of the bits in the entry. Like 
the directory entry, if the data changes be- 
tween the first and second read the data re- 
turned for the second read will be used. 


Note that the directory entry must always be read | 
into the Intel486 processor, because directory en- 
tries are never placed in the paging TLB. Page faults 
can be signaled from either the page directory read 
or the page table read. Page directory and page ta- 
ble entries may be placed in the Intel486 processor 
on-chip cache just like normal data. 


intel. 


6.4.6 PAGING OPERATION 


The paging hardware operates in the following fash- 
ion. The paging unit hardware receives a 32-bit lin- 
ear address from the segmentation unit. The upper 
20 linear address bits are compared with all 32 en- 
tries in the TLB to determine if there is a match. If 
there is a match (i.e., a TLB hit), then the 32-bit 
physical address is calculated and will be placed on 
the address bus. 


However, if the page table entry is not in the TLB, 
the Intel486 processor will read the appropriate 
Page Directory Entry. If P = 1 on the Page Directory 
Entry indicating that the page table is in memory, 
then the Intel486 processor will read the appropriate 
Page Table Entry and set the Access bit. If P = 1 on 
the Page Table Entry indicating that the page is in 
memory, the Intel486 processor will update the Ac- 
cess and Dirty bits as needed and fetch the oper- 
and. The upper 20 bits of the linear address, read 
from the page table, will be stored in the TLB for 
future accesses. However, if P = O for either the 
Page Directory Entry or the Page Table Entry, then 
the Intel486 processor will generate a page fault, an 
Exception 14. 


The Intel486 processor will also generate an excep- 
tion 14 page fault if the memory reference violated 
the page protection attributes (i.e., U/S or R/W) 
(e.g., trying to write to a read-only page). CR2 will 
hold the linear address which caused the page fault. 
If a second page fault occurs, while the Intel486 
processor is attempting to enter the service routine 
for the first, then the Intel486 processor will invoke 
the page fault (exception 14) handler a second time, 
rather than the double fault (exception 8) handler. 
Because Exception 14 is classified as a fault, CS: 
EIP will point to the instruction causing the page 
fault. The 16-bit error code pushed as part of the 
page fault handler will contain status bits which. indi- 
cate the cause of the page fault. 


The 16-bit error code is used by the operating sys- 
tem to determine how to handle the page fault. The 
upper portion of Figure 6-23 shows the format of the 
page-fault error code and the interpretation of the 
bits. 
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NOTES: 
*Descriptor table access will fault with U/S = 
0, even if the program is executing at level 3. 


U: UNDEFINED 


U/S: The U/S bit indicates whether the access 
causing the fault occurred when the Intel486 
processor was executing in User Mode (U/S = 
1) or in Supervisor mode (U/S = 0). 

W/R: The W/R bit indicates whether the ac- 
cess causing the fault was a Read (W/R = 0) 
or a Write (W/R = 1). 

P: The P bit indicates whether a page fault was 
caused by a not-present page (P = 0), or bya 
page level protection violation (P = 1). 


Figure 6-23. Page Fault System Information 


NOTE: 
Even though the bits in the error code (U/S, 
W/R, and P) have similar names as the bits 
in the Page Directory/Table Entries, the in- 
terpretation of the error code bits is different. 
Figure 6-23 indicates what type of access 
caused the page fault. 


6.4.7 OPERATING SYSTEM RESPONSIBILITIES 


The Intel486 processor takes care of the page ad- 
dress translation process, relieving the burden from 


- an operating system in a demand-paged system. 


The operating system is responsible for setting 
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up the initial page tables, and handling any page 
faults. The operating system also is required to inval- 
idate (i.e., flush) the TLB when any changes are 
made to any of the page table entries. The operating 
system must reload CR3 to cause the TLB to be 
flushed. a 


Setting up the tables is simply a matter of loading 
CR3 with the address of the Page Directory, and 
allocating space for the Page Directory and the 
Page Tables. The primary responsibility of the oper- 
ating system is to implement a swapping policy and 
handle all of the page faults. 


A final concern of the operating system is to ensure 
that the TLB cache matches the information in the 
paging tables. In particular, any time the operating 
system sets the P present bit of page table entry to 
zero, the TLB must be flushed. Operating systems 
may want to take advantage of the fact that CR3 is 
stored as part of a TSS, to give every task or group 
of tasks its own set of page tables. 


6.5 Virtual 8086 Environment 


6.5.1 EXECUTING 8086 PROGRAMS 


The Intel486 processor allows the execution of 8086 


application programs in both Real Mode and in the . 


Virtual 8086 Mode (Virtual Mode). Of the two meth- 
ods, Virtual 8086 Mode offers the system designer 
the most flexibility. The Virtual 8086 Mode allows the 
execution of 8086 applications, while still allowing 
the system designer to take full advantage of the 
Intel486 processor protection mechanism. In partic- 
ular, the Intel486 processor allows the simultaneous 
execution of 8086 operating systems and its applica- 
tions, and an Intel486 processor operating system 
and both 80286 and Intel486 processor applications. 
Thus, in a multi-user Intel486 processor computer, 
one person could be running an MS-DOS* spread- 
sheet, another person using MS-DOS", and a third 
person could be running multiple UNIX utilities and 
applications. Each person in this scenario would be- 
lieve that he had the computer completely to him- 
self. Figure 6-24 illustrates this concept. 
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6.5.2 VIRTUAL 8086 MODE ADDRESSING 
MECHANISM 


One of the major differences between Intel486 proc- 
essor Real and Protected modes is ‘how the seg- 
ment selectors are interpreted. When the Intel486 


processor is executing in Virtual 8086 Mode the seg- 


ment registers are used in an identical fashion to 


Real Mode. The contents of the segment register is — 
shifted left 4 bits and added to the offset to form the | 


segment base linear address. 


The Intel486 processor allows the operating system 
to specify which programs use the 8086 style ad- 
dress mechanism, and which programs use Protect- 
ed Mode addressing, on a per task basis. Through 
the use of paging, the one megabyte address space 
of the Virtual Mode task can be mapped to any- 
where in the 4-Gbyte linear address space of the 
Intel486 processor. Like Real Mode, Virtual Mode 
effective addresses (i.e., segment offsets) that ex- 


_ceed 64 Kbyte will cause an exception 13. However, - 


these restrictions should not prove to be important, 
because most tasks running in Virtual 8086 Mode 
will simply be existing 8086 application programs. 


6.5.3 PAGING IN VIRTUAL MODE 


The paging hardware allows the concurrent running 
of multiple Virtual Mode tasks, and provides protec- 
tion and operating system isolation. Although it is 
not strictly necessary to have the paging hardware 
enabled to run Virtual Mode tasks, it is needed in 
order to run multiple Virtual Mode tasks or to relo- 
cate the address space of a Virtual Mode task to 
physical address space greater than one Mbyte. 


The paging hardware allows the 20-bit linear ad- 
dress produced by a Virtual Mode program to be 
divided into up to 256 pages. Each one of the pages 
can be located anywhere within the maximum 
4-Gbyte physical address space of the Intel486 
processor. In addition, because CR3 (the Page Di- 
rectory Base Register) is loaded by a task switch, 
each Virtual Mode task can use a different mapping 
scheme to map pages to different physical locations. 
Finally, the paging hardware allows the sharing of 
the 8086 operating system code between multiple 
8086 applications. Figure 6-24 shows how the 
Intel486 processor paging hardware enables multi- 
ple 8086 programs to run under a virtual memory 
demand paged system. 
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Figure 6-24. Virtual 8086 Environment Memory Management 


6.5.4 PROTECTION AND I/O PERMISSION 
BITMAP 


All Virtual 8086 Mode programs execute at privilege 
level 3, the level of least privilege. As such, Virtual 
8086 Mode programs are subject to all of the protec- 
tion checks defined in Protected Mode. (This is dif- 
ferent from Real Mode which implicitly is executing 
at privilege level 0, the level of greatest privilege.) 
Thus, an attempt to execute a privileged instruction 
when in Virtual 8086 Mode will cause an exception 
13 fault. 


The following are privileged instructions, which may 
be executed only at Privilege Level 0. Therefore, at- 
tempting to execute these instructions in Virtual 
8086 Mode (or anytime CPL > 0) causes an excep- 
tion 13 fault: 


LIDT; MOV DRn,reg; MOV reg,DRn; 
LGDT; MOV TRn,reg; MOV reg,TRn; 
LMSW; MOV CRn,reg; MOV reg,CRn. 
CLTS ; 
HLT; 
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Several instructions, particularly those applying to 
the multitasking model and protection model, are 
available only in Protected Mode. Therefore, at- 
tempting to execute the following instructions in 
Real Mode or in Virtual 8086 Mode generates an 
exception 6 fault: 


LTR; STR; 
LLDT ; SLDT ; 
LAR; VERR ; 
‘LSL; VERW ; | 
ARPL. 


The instructions that are IOPL-sensitive in Protected 
Mode are: 


IN; sii: 
OUT; CLI 
INS; 

OUTS; 

~ REP INS; 

- REP OUTS; 


In Virtual 8086 Mode, a slightly different set of in- 
structions are made IOPL-sensitive. The following in- 
structions are [OPL-sensitive in Virtual 8086 Mode: 


INT n; SEL 
PUSHF ; CLI ; 
POPF ; IRET © 


The PUSHF, POPF, and IRET instructions are lIOPL- 
sensitive in Virtual 8086 Mode only. This provision 
allows the IF flag (interrupt enable flag) to be virtual- 
ized to the Virtual 8086 Mode program. The INT n 
software interrupt instruction is also |OPL-sensitive 
in Virtual 8086 Mode. Note, however, that the INT 3 
(opcode OCCH), INTO, and BOUND instructions are 
not IOPL-sensitive in Virtual 8086 mode (they aren’t 
lIOPL sensitive in Protected Mode either). 


Note that the I/O instructions (IN, OUT, INS, OUTS, 
REP INS, and REP OUTS) are not IOPL-sensitive in 
Virtual 8086 mode. Rather, the I/O instructions be- 
come automatically sensitive to the I/O Permission 
Bitmap contained in the Intel486 processor Task 
State Segment. The |/O Permission Bitmap, auto- 
matically used by the Intel486 processor in Virtual 
8086 Mode, is illustrated by Figure 6-14 and Figure 
6-15. 


The I/O Permission Bitmap can be viewed as a 
0-64 Kbit string, which begins in memory at offset 
Bit_.Map__Offset in the current TSS. Bit_Map__ 
Offset must be < DFFFH so the entire bit map and 
the byte FFH which follows the bit map are all at 
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offsets < FFFFH from the TSS base. The 16-bit 
pointer Bit_Map__Offset (15:0) is found in the word 
beginning at offset 66H (102 decimal) from the TSS 
base, as shown in Figure 6-14. 


Each bit in the |/O Permission Bitmap corresponds 
to a single byte-wide I/O port, as illustrated in Figure 
6-14. If a bit is 0, 1/0 to the corresponding byte-wide 
port can occur without generating an exception. Oth- 
erwise the I/O instruction causes an exception 13 
fault. Because every byte-wide I/O port must be pro- 
tectable, all bits corresponding to a word-wide or 
dword-wide port must be 0 for the word-wide or 
dword-wide !/O to be permitted. If all the referenced 
bits are 0, the I/O will be allowed. If any referenced 
bits are 1, the attempted I/O will cause an exception 
13 fault. 


Due to the use of a pointer to the base of the I/O 
Permission Bitmap, the bitmap may be located any- 
where within the TSS, or may be ignored completely 
by pointing the Bit_Map__Offset (15:0) beyond the 
limit of the TSS segment. In the same manner, only 
a small portion of the 64K I/O space need have an 
associated map bit, by adjusting the TSS limit to 
truncate the bitmap. This eliminates the commitment 
of 8K of memory when a complete bitmap is not 
required, while allowing the fully general case if de- 
sired. 


Example of Bitmap for !/O Ports 0-255: Setting 
the TSS limit to {bit_Map__Offset + 31 + 1**} 
[** see note below] will allow a 32-byte bitmap for 
the |1/O ports #0-255, plus a terminator byte of all 
1’s [** see note below]. This allows the I/O bitmap 
to control 1/O Permission to 1/O port 0-255 while 
causing an exception 13 fault on attempted I/O to 
any I/O port 80256 through 65,565. 


**IMPORTANT IMPLEMENTATION NOTE: 

Beyond the last byte of |/O mapping infor- 

mation in the I/O Permission Bitmap must 

be a byte containing all 1’s. The byte of all 

1’s must be within the limit of the Intel486 
_ processor TSS segment (see Figure 6-14). 


6.5.5 INTERRUPT HANDLING 


In order to fully support the emulation of an 8086 
machine, interrupts in Virtual 8086 Mode are han- 
dled in a unique fashion. When running in Virtual 
Mode all interrupts and exceptions involve a privi- 
lege change back to the host Intel486 processor op- 
erating system. The Intel486 processor operating 
system determines if the interrupt comes from a 


| 
intel. 
Protected Mode application or from a Virtual Mode 


program by examining the VM bit in the EFLAGS 
image stored on the stack. 


When a Virtual Mode program is interrupted and ex- 
ecution passes to the interrupt routine at level 0, the 
VM bit is cleared. However, the VM bit is still set in 
the EFLAG image on the stack. ; 


The Intel486 processor operating system in turn 
handles the exception or interrupt and then returns 
control to the 8086 program. The Intel486 processor 
operating system may choose to let the 8086 oper- 
ating system handle the interrupt or it may emulate 
the function of the interrupt handler. For example, 
many 8086 operating system calls are accessed by 
PUSHing parameters on the stack, and then execut- 
ing an INT n instruction. If the IOPL is set to 0 then 
all INT n instructions will be intercepted by the in- 
tel486 processor operating system. The Intel486 
processor operating system could emulate the 8086 
operating system’s call. Figure 6-25 shows how the 
Intel486 processor operating system could intercept 
an 8086 operating system’s call to “Open a File.” 


An Intel486 processor operating system can provide 
a Virtual 8086 Environment which is totally transpar- 
ent to the application software via intercepting and 
then emulating 8086 operating system’s calls, and 
intercepting IN and OUT instructions. 


6.5.6 ENTERING AND LEAVING VIRTUAL 8086 
MODE 


Virtual 8086 mode is entered by executing an IRET 
instruction (at CPL=0), or Task Switch (at any CPL) 
to an Intel486 processor task whose Intel486 proc- 
essor TSS has a FLAGS image containing a 1 in the 
VM bit position while the Intel486 processor is exe- 
cuting in Protected Mode. That is, one way to enter 
Virtual 8086 mode is to switch to a task with an In- 
tel486 processor TSS that has a 1 in the VM bit in 
the EFLAGS image. The other way is to execute a 
32-bit IRET instruction at privilege level 0, where the 
stack has a 1 in the VM bit in the EFLAGS image. 
POPF does not affect the VM bit, even if the Intel486 
processor is in Protected Mode or level 0, and so 
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cannot be used to enter Virtual 8086 Mode. PUSHF 
always pushes a 0 in the VM bit, even if the Intel486 
processor is in Virtual 8086 Mode, so that a program 
cannot tell if it is executing in REAL mode, or in Vir- 
tual 8086 mode. 


The VM bit can be set by executing an IRET instruc- 
tion only at privilege level 0, or by any instruction or 
Interrupt which causes a task switch in Protected 
Mode (with VM=1 in the new FLAGS image), and 
can be cleared only by an interrupt or exception in 
Virtual 8086 Mode. IRET and POPF instructions exe- 
cuted in REAL mode or Virtual 8086 mode will not 
change the value in the VM bit. | 


The transition out of virtual 8086 mode to Intel486 
processor protected mode occurs only on receipt of 
an interrupt or exception (such as due to a sensitive 
instruction). In Virtual 8086 mode, all interrupts and 
exceptions vector through the protected mode IDT, 
and enter an interrupt handler in protected Intel486 
processor mode. That is, as part of interrupt pro- 
cessing, the VM bit is cleared. 


Because the matching IRET must occur from level 0, 
if an Interrupt or Trap Gate is used to field an inter- 
rupt or exception out of Virtual 8086 mode, the Gate 
must perform an inter-level interrupt only to level 0. 
Interrupt or Trap Gates through conforming seg- 
ments, or through segments with DPL> 0, will raise a 
GP fault with the CS selector as the error code. 


6.5.6.1 Task Switches To and From Virtual 8086. 
Mode 


Tasks which can execute in virtual 8086 mode must 
be described by a TSS with the new Intel486 proces- 
sor format (TYPE 9 or 11 descriptor). 


A task switch out of virtual 8086 mode will operate 
exactly the same as any other task switch out of a 
task with an Intel486 processor TSS. All of the pro- 
grammer visible state, including the FLAGS register 
with the VM bit set to 1, is stored in the TSS. 


The segment registers in the TSS will contain 8086 
segment base values rather than selectors. 
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A task switch into a task described by an Intel486 


processor TSS will have an additional check to de- 


termine if the incoming task should be resumed in 
virtual 8086 mode. Tasks described by 80286 format 
TSSs cannot be resumed in virtual 8086 mode, so 
no check is required there (the FLAGS image in 


80286 format TSS has only the low order 16 FLAGS. 


bits). Before loading the segment register images 
from an Intel486 processor TSS, the FLAGS image 
is loaded, so that the segment registers are loaded 
from the TSS image as 8086 segment base values. 
The task is now ready to resume in virtual 8086 exe- 
cution mode. 


6.5.6.2 Transitions Through Trap and Interrupt 
Gates, and IRET 


A task switch is one way to enter or exit virtual 8086 
mode. The other method is to exit through a Trap or 
Interrupt gate, as part of handling an interrupt, and 
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. Figure 6-25. Virtual 8086 Environment Interrupt and Call Handling 


. 8086 Application makes “Open 
File Call,” causes General 
Protection Fault. 


. Virtual 8086 Monitor intercepts 
call. Calls Intel486 
processor OS. 


. Intel486 processor OS opens 
files. Returns control to 8086 
OS. 


4. 8086 OS returns control to 
Application. 


(Transparent to Application.) _ 
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to enter as part of executing an IRET instruction. 
The transition out must use an Intel486 processor 
Trap Gate (Type 14), or Intel486 processor Interrupt 
Gate (Type 15), which must point to a non-conform- 
ing level 0 segment (DPL=0) in order to permit the 
trap handler to IRET back to the Virtual 8086 pro- 
gram. The Gate must point to a non-conforming lev- 
el 0 segment to perform a level switch to level 0 so 
that the matching IRET can change the VM bit. In- 
tel486 processor gates must be used, because 
80286 gates save only the low 16 bits of the FLAGS 
register, so that the VM bit will not be saved on tran- 
sitions through the 80286 gates. Also, the 16-bit 
IRET (presumably) used to terminate the 80286 in- 
terrupt handler will pop only the lower 16 bits from 
FLAGS, and will not affect the VM bit. The action 
taken for an Intel486 processor Trap or Interrupt 
gate if an interrupt occurs while the task is executing 
in virtual 8086 mode is given by the following se- 
quence. 


intel. 


1. Save the FLAGS register in a temp to push later. 
Turn off the VM and TF bits, and if the interrupt is 
serviced by an Interrupt Gate, turn off IF also. 


2. Interrupt and Trap gates must perform a level 
switch from 3 (where the VM86 program exe- 
cutes) to level 0 (so IRET can return). This pro- 
cess involves a stack switch to the stack given in 
the TSS for privilege level 0. Save the Virtual 
8086 Mode SS and ESP registers to push in a 


later step. The segment register load of SS will be ~ 


done as a Protected Mode segment load, be- 
cause the. VM bit was turned off above. 


3. Push the 8086 segment register values onto the 
new stack, in the order: GS, FS, DS, ES. These 
are pushed as 32-bit quantities, with undefined 
values in the upper 16 bits. Then load these 4 
registers with null selectors (0). | 


4. Push the old 8086 stack pointer onto the new 
stack by pushing the SS register (as 32-bits, high 
bits undefined), then pushing the 32-bit ESP regis- 
ter saved above. 


5. Push the 32-bit FLAGS register saved in step 1. 


6. Push the old 8086 instruction pointer onto the 
new stack by pushing the CS register (as 32-bits, 
high bits undefined), then pushing the 32-bit EIP 
register. 


7. Load up the new CS:EIP value from the interrupt 
gate, and begin execution of the interrupt routine 
in protected Intel486 processor mode. 


The transition out of virtual 8086 mode performs a 
level change and stack switch, in addition to chang- 
ing back to protected mode. In addition, all of the 
8086 segment register images are stored on the 
stack (behind the SS:ESP image), and then loaded 
with null (0) selectors before entering the interrupt 
handler. This will permit the handler to safely save 
and restore the DS, ES, FS, and GS registers as 
80286 selectors. This is needed so that interrupt 
handlers which don’t care about the mode of the 
interrupted program can use the same prolog and 
epilog code for state saving (i.e., push all registers in 
prolog, pop all in epilog) regardless of whether or not 
a ‘native’ mode or Virtual 8086 mode program was 
interrupted. Restoring null selectors to these regis- 
ters before executing the IRET will not cause a trap 
in the interrupt handler. Interrupt routines which ex- 
pect values in the segment registers, or return val- 
ues in segment registers will have to obtain/return 
values from the 8086 register images pushed onto 
the new stack. They will need to know the mode of 
the interrupted program in order to know where to 
find/return segment registers, and also to know how 
to interpret segment register values. 
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~ The IRET instruction will perform the inverse of the 


above sequence. Only the extended Intel486 proc- 
essor IRET instruction (operand size=32) can be 
used, and must be executed at level 0 to change the 
VM bit to 1. 


1. If the NT bit in the FLAGs register is on, an inter- 
task return is performed. The current state is 
stored in the current TSS, and the link field in the 
current TSS is used to locate the TSS for the in- 
terrupted task which is to be resumed. Otherwise, 
continue with the following sequence. 


2. Read the FLAGS image from SS:8[ESP] into the 
FLAGS register. This will set VM to the value ac- 
tive in the interrupted routine. 


3. Pop off the instruction pointer CS:EIP. EIP is. 
popped first, then a 32-bit word is popped which 
contains the CS value in the lower 16 bits. If 
VM = 0, this CS load is done as a protected mode 
segment load. If VM=1, this will be none as an 
8086 segment load. 


4. ESP register by 4 to bypass the FLAGS image 
which was “popped” in step 7. 


5. If VM=1, load segment registers ES, DS, FS, and 
GS from memory locations SS:[ESP+ 8], 
SS:[ESP + 12], SS:[ESP + 16], and 
SS:[ESP + 20], respectively, where the new value 
of ESP stored in step 4 is used. Because VM= 1, 
these are done as 8086 segment register loads. 
Else if VM=0O, check that the selectors in ES, DS, 
FS, and GS are valid in the interrupted routine. 
Null out invalid selectors to trap if an attempt is 
made to access through them. 


6. If (RPL(CS) > CPL), pop the stack pointer 
SS:ESP from the stack. The ESP register is 
popped first, followed by 32-bits containing SS in 
the lower 16 bits. If VM=0, SS is loaded as a 
protected mode segment register load. If VM=1, 
an 8086 segment register load is used. 


7. Resume execution of the interrupted routine. The 
VM bit in the FLAGS register (restored from the 
interrupt routine’s stack image in step 1) deter- 
mines whether the Intel486 processor resumes 
the interrupted routine in Protected mode of Virtu- 
al 8086 mode. 
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7.0 ON-CHIP CACHE 


All members of the Intel486 processor family, except 
the IntelDX4 processor, contain an on-chip 8-Kbyte 


cache. (See section 7.1.2, ‘“IntelDX4 Processor On- 
Chip Cache,” for the IntelDX4 processor cache or- 
ganization.) The cache is software transparent to 
maintain binary compatibility with previous genera- 
tions of the Intel Architecture. 


The on-chip cache has been designed for maximum 
flexibility and performance. The cache has several 
_ operating modes offering flexibility during program 
execution and debugging. Memory areas can be de- 
fined as non-cacheable by software and external 
hardware. Protocols for cache line invalidations and 
- replacement are implemented in hardware, easing 
system design. 


intelDX4™ Processor Only 
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The on-chip cache is a unified code and data cache. 
The cache is used for both instruction and data ac- 
cesses and acts on physical addresses. (See sec- 
tion 7.1.2 for IntelDX4 processor details). 


7.1 Cache Organization 


The cache organization is 4-way set associative and 
each line is 16-bytes wide. The eight Kbytes of 
cache memory are logically organized as 128 sets, 
each containing four lines. 


The cache memory is physically split into four 
2-Kbyte blocks, each containing 128 lines. (See Fig- 


- ure.7-1.) There are 128 21-bit tags associated with 


I a 


each 2-Kbyte block. There is a valid bit for each line 
in the cache. Each line in the cache is either valid or 
not valid. There are no provisions for partially valid 
lines. | 


All Other Intel486™ Processors 


242202-53 


Figure 7-1. On-Chip Cache Physical Organization 
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For all Intel486 processors except the Write-Back 
Enhanced IntelDX2, the on-chip cache is write- 
through only. All writes will drive an external write 
bus cycle in addition to writing the information to the 
internal cache if the write was a cache hit. A write to 
an address not contained in the internal cache will 
only be written to external memory. Cache alloca- 
tions are not made on write misses. 


The Write-Back Enhanced IntelDX2 processor sup- 
ports two modes of operation with respect to internal 
cache configurations: Standard Bus Mode (write- 
through cache) and Enhanced Bus Mode (write- 
back cache). Standard Bus Mode operation for the 
Write-Back Enhanced IntelDX2 is the same as the 
write-through cache for all other Intel486 proces- 
sors. (See section 7.1.1, “Write-Back Enhanced 
IntelDX2 Processor Cache” and other write-back en- 
hanced sections below for write-back cache infor- 
mation.) 


7.1.1 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR CACHE 


The Write-Back Enhanced IntelDX2 processor im- 
plements a unified cache, with a total cache size of 8 
Kbytes. The processor’s on-chip cache supports a 
modified MESI (modified/exclusive/shared/invalid) 
write-back cache consistency protocol. 


The Write-Back Enhanced IntelDX2 processor inter- 
nal cache is configurable as write-back or write- 
through on a line by line basis, provided the cache is 
enabled for write-back operation. The cache is en- 
abled for write-back operation by driving the WB/ 
WT # pin to a high state for at least two clocks be- 
fore and two clocks after the falling edge of the RE- 
SET. Cache write-back and invalidations can be ini- 
tiated by hardware or software. Protocols for cache 
consistency and line replacement are implemented 
in hardware to ease system design. 


Once the cache configuration is selected, the 
Write-Back Enhanced IntelDX2 processor will 
continue to operate in the selected configura- 
tion and can only be changed to a different configu- 
ration by starting the RESET process again. Asser- 
tion of SRESET will not change the operating mode 
of the processor. WB/WT# has an internal pull 
down; If WB/WT# is unconnected, the processor 
will be in Standard Bus Mode, i.e., the on-chip cache 
is write-through. Table 7-1 lists the two modes of 
operation and the differences between the two 
modes. 
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Unless specifically noted, the following sections 


apply to the Write-Back Enhanced IntelDX2 in 
standard Bus Mode (Write-Through Cache) and 


~ all other Intel486 processors. 


7.1.2 INTELDX4 PROCESSOR CACHE 


The IntelDX4 processor contains a 16-Kbyte write- 
through cache. The 16 Kbytes of cache memory are 
logically organized as 256 sets, each containing four 
lines. 


The cache memory is physically split into four 
4-Kbyte blocks, each containing 256 lines. (See Fig- 
ure 7-1.) There are 256 20-bit tags associated with 
each 2-Kbyte block. 


All other details listed in section 7.1 for the 8-Kbyte 
on-chip cache also apply to the IntelDX4 on-chip 
cache. . 


7.2 Cache Control 


Control of the cache is provided by the CD and NW 
bits in CRO. CD enables and disables the cache. NW 
controls memory write-through and invalidates. 


The CD and NW bits define four operating modes of 
the on-chip cache as given in Table 7-2. These 
modes provide flexibility in how the on-chip cache is 
used. ~ 


CD=1, NW=1 


The cache is completely disabled by setting 
CD=1 and NW=1 and then flushing the 
cache. This mode may be useful for debug- 
ging programs where it is important to see 
all memory cycles at the pins. Writes that hit 
in the cache will not appear on the external 
bus. 


It is possible to use the on-chip cache as 
fast static RAM by “pre-loading’’ certain 
memory areas into the cache and then set- 
ting CD=1 and NW=1. Pre-loading can be 
done by careful choice of memory refer- 
ences with the cache turned on or by use of 
the testability functions. (See section 11.2, 
“On-Chip Cache Testing.’’) When the cache 
is turned off, the memory mapped by the 
cache is “frozen” into the cache because 
fills and invalidates are disabled. 
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State of WB/WT # : | | | 
at Falling Edge of Effect on Write-Back Enhanced IntelDX2™ Processor Operation 
RESET 


| WB/WT# = LOW | Processor is in Standard Bus Mode (Write-Through Cache) 
| . When FLUSH # is asserted, the internal cache will be invalidated in one system 

CLK. : 

. No Special FLUSH # Acknowledge Cycles appear on the bus after the assertion of 
the FLUSH # pin. : 

. All write-back specific inputs are ignored (INV, WB/WT #) 

. SRESET does not clear the SMBASE register. It behaves much like a RESET 
(invalidating the on-chip cache and resetting the CRO register, for example). 
SRESET is NOT an interrupt. 


WB/WT# = HIGH | Processor is in Enhanced Bus Mode (Write-Back Cache) 
1. Write backs will be performed when a cache flush is requested (via the FLUSH # 
pin or the WBINVD instruction). The system must watch for the FLUSH # special 

_ cycles to determine the end of the flush. 

. The special FLUSH # Acknowledge Cycles will appear on the bus after the 
assertion of the FLUSH# and after all the cache write backs (if any) are completed 
on the bus. | | 

. WB/WT# is a sampled on a line by line basis to determine the state of a line to be 
allocated in the cache (as a Write Through (S state) or as Write Back (E state)). 

. The WB/WT# and INV inputs are no longer ignored. HITM# and CACHE # will be: 
driven during appropriate bus cycles. 

. PLOCK ¢ is always driven inactive. 

. SRESET is an interrupt. SRESET does not reset the SMBASE register or flush the 
on-chip cache. The CRO register gets the same values as after RESET with the 
exception of the CD and NW bits. These two bits retain their previous status. (See 
section 9.2.18.4, “Soft Reset (SRESET)” and Table 3-7 for details on SRESET for 
write-back enhanced mode.) 


Table 7-2. Cache Operating Modes the same as if the KEN# pin was 

: strapped HIGH disabling cache fills. 
Write-throughs and invalidates may still 
a fille di 7m occur to keep the cache valid. This mode 

ea NE ame * is useful if the software must disable the 
g cache for a short period of time, and then 

disabled re-enable it without flushing the original 


Cache fills disabled, write- contents. 
through and invalidates CD=0, NW=+1 
enabled | Invalid. If CRO is loaded with this bit con- 


INVALID. If CRO is loaded figuration, a General Protection fault with 
error code of 0 will occur. 


with this configuration of bits, 
a GP fault with error code of 0 CD=0, NW=0 
is raised. | . This is the normal operating mode. 


Cache fills enabled, we: Completely disabling. the cache is a two-step pro- 
through and invalidates cess. First, CD and NW must be set to 1, and then 
enabled the cache must be flushed. If the cache is not 
flushed, cache hits on reads will still occur and data 
will be read from the cache. 


Cache fills are disabled but write-throughs 
and invalidates are enabled. This mode is 
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7.2.1 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR CACHE CONTROL AND 
OPERATING MODES | 


The Write-Back Enhanced IntelDX2 processor re- 
tains the usage of CRO.CD and CRO.NW,, in which 
the 1,1 state forces a cache-off condition after RE- 
SET, and the 0,0 state is the normal run state. Table 
7-3 defines these control bits when the cache is en- 
abled for write-back operation. Table 7-3 is also val- 
id when the cache is in write-back mode and some 
lines are in a write-through state. 


CD=1, NW=1 


The 1,1 state is best used when no lines 
are allocated, which occurs naturally after 
RESET (but not SRESET), but must be 
forced (e.g., by instruction WBINVD) if en- 
tered during normal operation. In these 
cases, the Write-Back Enhanced IntelDX2 
processor will operate as if it had no 
cache at all. 


If the 1,1 state is exited, lines that are allo- 
cated as write back will be written back 
-upon a snoop hit or replacement cycle. 
Lines that were allocated as write-through 
(and later modified while in the 1,1 state) 
will never appear on the bus. 


NW=0 


The only difference from the normal 0,0 
“run” state is that new line fills (and the 
line replacements that result from capaci- 
ty limitations) do not occur. This causes 
the contents of the cache to be locked in, 
unless lines are invalidated using snoops. 


CD=1, 
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7.3 Cache Line Fills 


Any area of memory can be cached in the Intel486 
processor. Non-cacheable portions of memory can 
be defined by the external system or by software. 
The external system can inform the Intel486 proces- 
sor that a memory address is non-cacheable by re- 
turning the KEN# pin inactive during a memory ac- 
cess. (Refer to section 10.2.3, ‘““Cacheable Cycles.’”’) 
Software can prevent certain pages from being 
cached by setting the PCD bit in the page table en- 


try. 


A read request can be generated from program op- 
eration or by an instruction pre-fetch. The data will 
be supplied from the on-chip cache if a cache hit 
occurs on the read address. If the address is not in 
the cache, a read request for the data is generated 
on the external bus. 


If the read request is to a cacheable portion of mem- 
ory, the Intel486 processor initiates a cache line fill. 
During a line fill a 16-byte line is read into the In- 
tel486 processor. Cache line fills will only be gener- 
ated for read misses. Write misses will never cause 
a line in the internal cache to be allocated. If a cache 
hit occurs on a write, the line will be updated. Cache 
line fills can be performed over 8- and 16-bit buses 
using the dynamic bus sizing feature. Refer to sec- 
tion 10.1.2, “Dynamic Data Bus Sizing” for a de- 
scription of dynamic bus sizing and section 10.2.3, 
“Cacheable Cycles” for further information on 
cacheable cycles. . 


Table 7-3. Write-Back Enhanced IntelDX2T™ Processor Write-Back Cache Operating Modes 


1,0 read cache read bus 
(no fill) 


; read bus, 
(state DURING line fill 


normal operation) 


read cache 


NOTE: 


WRITE 
1 
1,1 read cache read bus write cache write bus 
(state after reset) (no fill) (no write-through) accepted 


write cache, write bus if S 


0,1 This is a fault-protected disallowed state. A GP(0) will occur if an attempt is made to 
load CRO with this state. | 
0,0 


write cache, write bus if S 


not 


normal 
operation 


write bus 


3 


normal 
operation 


1. Normal MESI state transitions occur on write hits in all legal states. 
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7.4 Cache Line Invalidations 


_ The Intel486 processor contain both a hardware and 
software mechanism for invalidating lines in its inter- 
nal cache. Cache line invalidations are needed to 
keep the Intel486 processor cache contents consist- 
ent with external memory. 


Refer to section 10.2.8, ‘‘Invalidate Cycles” for fur- 
ther information on cache line invalidations. 


7.4.1 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR SNOOP CYCLES AND | 
_WRITE-BACK MODE INVALIDATION 


In Enhanced bus mode, the Write-Back Enhanced 
IntelDX2 processor performs invalidations differently 
than other Intel486 processors. Snoop Cycles are 
initiated by the system to determine if a line is pres- 
ent in the cache, and what the state is. Snoop cycles 
may further be classified as Inquire cycles or Invali- 
date cycles. Inquire cycles are driven to the Write- 
Back Enhanced IntelDX2 processor when another 
bus master initiates a memory read cycle, to deter- 
mine if the processor cache contains the latest data. 
. If the snooped line is in the Write-Back Enhanced 
IntelDX2 processor cache and has the most recent 
information, the processor must schedule a write 
back of the data. Inquire cycles are driven with 
INV = “0”. Invalidate cycles are driven to the Write- 
Back Enhanced IntelDX2 processor when the other 
bus master initiates a memory write cycle to deter- 
mine if the Write-Back Enhanced IntelDX2 proces- 
sor cache contains the snooped line. The Invalidate 
cycles are driven with INV = “1”, so that if the 


Table 7-4. Encoding of the Special Cycles for Write-Back Cache 


| 

intel. 
snooped line is in the on-chip cache, the line is inval- 
idated. Snoop cycles are described in detail in the 


“Bus Functional Description’’ section. 


The Write-Back Enhanced IntelDX2 processor has 


control mechanisms (including snooping) for writing 


back the modified write-back lines and invalidating 
the cache. There are special bus cycles associated 
with write-backs and invalidation. All of the Write- 


- Back Enhanced IntelDX2 processor special cycles 


require acknowledgment by RDY # or BRDY #. Dur- 
ing the special cycles, the addresses shown in the 
Table 7-4 are driven onto the address bus and the 
data bus is left undefined. : 


7.5 Cache Replacement 


When a line needs to be placed in its internal cache. 
the Intel486 processor first checks to see if there is 
a non-valid line in the set that can be replaced. If all 
four lines in the set are valid, a pseudo least-recent- 
ly-used mechanism is used to determine which line 
should be replaced. 


A valid bit is associated with each line in the cache. 
When a line needs to be placed in a set, the four 
valid bits are checked to see if there is a non-valid 
line that can be replaced. If a non-valid line is found, 
that line is marked for replacement. 


The four lines in the set are labeled IO, 11, 12, and 13. 
The order in which the valid bits are checked during 
an invalidation is 10, 11, 12 and 13. All valid bits are 
cleared when the processor is reset or when the 
cache is flushed. . 


* For the Write-Back Enhanced IntelDX2 processor only. FLUSH is present on all Intel486 processors, but differs for Stan- 
dard Mode. Refer to appropriate sections. . 


2-118 


| 

intel. 

Replacement in the cache is handled by a pseudo 
least recently used (LRU) mechanism when all four 
lines in a set are valid. Three bits, BO, B1 and B2, 
are defined for each of the 128 sets in the cache. 


These bits are called the LRU bits. The LRU bits are 
updated for every hit or replace in the cache. 


If the most recent access to the set was to 10 or 11, 
BO is set to 1. BO is set to 0 if the most recent ac- 
cess was to 1/2 or I3. If the most recent access to 


10:11 was to 10, B1 is set to 1, else Bi is set to 0. If . 


the most recent access to 12:I3 was to |2, B2 is set to 
1, else B2 is set to 0. 


The pseudo LRU mechanism works in the following 
manner. When a line must be replaced, the cache 
will first select which of 10:11 and 12:13 was least re- 
cently used. Then the cache will determine which of 
the two lines was least recently used and mark it for 
replacement. This decision tree is shown in Figure 
7-2. 


All four lines 
in the set valid? 


No Replace 
non-vailid line 


Yes: 10 or I1 No: 12 or 13 
least recently least recently 
used 8 used 


—> 


B2 =0? 


ves//\vo ANG 


Replace Replace Replace Replace 
10 14 12 13 
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B1 =0? 


Figure 7-2. On-Chip Cache 
Replacement Strategy 


7.6 Page Cacheability 


Two bits for cache control, PWT and PCD, are de- 
fined in the page table and page directory entries. 
The state of these bits are driven out on the PWT 
and PCD pins during memory access cycles. 
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The PWT bit controls the write policy for second lev- 
el caches used with the Intel486 processor. Setting 
PWT =1 defines a write-through policy for the cur- 
rent page while PWT=0 defines the possibility of 
write-back. The state of PWT is ignored internally by 
the Intel486 processor for on-chip cache in write 
through mode. 


The PCD bit controls cacheability on a page by page 
basis. The PCD bit is internally AND’ed with the 
KEN# signal to control cacheability on a cycle by 
cycle basis (see Figure 7-3). PCD=0 enables cach- 
ing while PCD = 1 forbids it. Note that cache fills are 
enabled when PCD=0 AND KEN# =0. This logical 
AND is implemented physically with a NOR gate. 


The state of the PCD bit in the page table entry is 
driven on the PCD pin when a page in external mem- 
ory is accessed. The state of the PCD pin informs 
the external system of the cacheability of the re- | 
quested information. The external system then re- 
turns KEN # telling the Intel486 processor if the area 
is cacheable. The Intel486 processor initiates a 
cache line fill if PCD and KEN# indicate inet the 
requested information is cacheable. 


The PCD bit is OR’ed with the CD (cache disable) bit 
in control register 0 to determine the state of the 
PCD pin. If CD=1, the Intel486 processor forces the 
PCD pin HIGH. If CD=0O, the PCD pin is driven with 
the value for the page table entry/directory. (See 
Figure 7-3.) 


The PWT and PCD bits for a bus cycle are obtained 
from either CR3, the page directory or page table 
entry. These bits are assumed to be zero during real 
mode, whenever paging is disabled, or for cycles 
that bypass paging, (I/O references, interrupt ac- 
knowledge and HALT cycles). 


When paging is enabled, the bits from the page table 
entry are cached in the TLB, and are driven any time 


the page mapped by the TLB entry is referenced. 


For normal memory cycles, PWT and PCD are taken 
from the page table entry. During TLB refresh cycles 
where the page table and directory entries are read, 
the PWT and PCD bits must be obtained elsewhere. 
During page table updates the bits are obtained from 
the page directory. When the page directory is up- 
dated the bits are obtained from CR3. PCD and PWT 
bits are initialized to zero at reset, but can be modi- 
fied by level 0 software. 
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Figure 7-3. Page Cacheability 


7.6.1 Write-Back Enhanced IntelDX2 | 
PROCESSOR PAGE CACHEABILITY 


In Write-Back Enhanced IntelDX2 processor-based 
system, both the processor and the system hard- 
ware must determine the cacheability and the con- 
figuration (write-back or write-through) on a line by 
line basis. The system hardware’s cacheability is de- 
termined by KEN# and the configuration by 
WB/WT #. The processor’s indication of cacheabili- 
ty is determined by PCD and the configuration by 
PWT. The PWT bit controls the write policy for the 
second level caches used with the Write-Back En- 
hanced IntelDX2 processor. Setting PWT to 1 de- 
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fines a write-through policy for the current page, 
while clearing PWT to 0 defines a write-back policy 
for the current page. 


7.7 Cache Flushing 


The on-chip cache can be flushed by external hard- 
ware or by software instructions. Flushing the cache 
clears all valid bits for all lines in the cache. The 
cache is flushed when external parawale asserts the 
FLUSH # pin. 


a 

intel : 

The FLUSH# pin needs to be asserted for one 
clock if driven synchronously or for two clocks if driv- 
en asynchronously. FLUSH# is asynchronous, but 
setup and hold times must be met for recognition in 
a particular cycle. FLUSH# should be de-asserted 
before the cache flush is complete. Failure to de-as- 
sert the pin will cause execution to stop as the proc- 
essor will be repeatedly flushing the cache. If exter- 
nal hardware activates flush in response to an I/O 
write, FLUSH# must be asserted for at least two 
clocks prior to ready being returned for the I/O write. 
This ensures that the flush completes before the 
processor begins execution of the instruction follow- 
ing the OUT instruction. 


The instructions INVD and WBINVD cause the on- 
chip cache to be flushed. External caches connect- 
ed to the Intel486 processor are signaled to flush 
their contents when these instructions are executed. 


WBINVD will also cause an external write-back 
cache to write back dirty lines before flushing its 
contents. The external cache is signaled using the 
bus cycle definition pins and the byte enables (refer 
to section 9.2.6 “Bus Cycle Definition” for the bus 
cycle definition pins and section 10.2.11 “Special 
Bus Cycles” for special bus cycles). Refer to the 
Intel486™ Processor Programmers Reference Man- 
ual for detailed instruction definitions. 


The results of the INVD and.WBINVD instructions 
are identical for the operation of the non-write-back 
enhanced Intel486 processor on-chip cache be- 
cause the cache is write-through. 


7.7.1 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR CACHE FLUSHING 


The on-chip cache can be flushed by external hard- 
ware or by software instructions. 


Flushing the cache through hardware is accom- 
plished by driving the FLUSH # pin low. This causes 
the cache to write back all modified lines in the 
cache and mark the state bits invalid. The First Flush 
Acknowledge cycle is driven by the Write-Back En- 
hanced InteiDX2 processor followed by the Second 
Flush Acknowledge cycle after all write-backs and 
invalidations are complete. The two special cycles 
are issued even if there are no dirty lines to write 
back. 
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The INVD and WBINVD instructions cause the on- 
chip cache to be invalidated. WBINVD causes the 
modified lines in the internal cache to be written 
back, and all lines to be marked invalid. After execu- 
tion of the WBINVD instruc-tion, the Write-back and 
Flush special cycles are driven to indicate to any 
external cache that it should write back and invali- 
date its contents. These two special cycles are is- 
sued even if there are no dirty lines to be written 
back. INVD causes all lines in the cache to be invali- 
dated, so modified lines in the cache are not written 
back. The Flush special cycle is driven after the 
INVD instruction is executed to indicate to any exter- 
nal cache that it should invalidate its contents. Care 
should be taken when using the INVD instruction to 
avoid creating cache consistency problems. — 


NOTE: 
It is recommended to use the WBINVD in- 
struction instead of the INVD instruction if 
the on-chip cache is configured in the write- 
back mode. 


The assertion of the RESET pin invalidates the en- 
tire cache without writing back the modified lines. No 
special cycles are issued after the invalidation is 
complete. 


Snoop cycles with invalidation (INV=1) cause the 
Write-Back Enhanced IntelDX2 processor to invali- 
date an individual cache line. If the snooped line is a 
modified line, then the processor schedules a write- 
back cycle. Inquire cycles with no-invalidation cause 
the Write-Back Enhanced IntelDX2 processor to 
only write-back the line, if the inquired line is in 
M-state, and not invalidate the line. 


SRESET, STPCLK#, INTR, NMI and SMI# are rec- 
ognized and latched, but not serviced during the full- 
cache, modified-line write-backs, caused either by 
WBINVD instruction or the FLUSH#. However, 
BOFF#, AHOLD and HOLD are recognized 
DURING the full-cache, modified-line write-backs. 


7.8 Write-Back Enhanced IntelDX2 
Processor Write-Back Cache 
Architecture 


This section describes additional features pertaining 


to the write-back mode of the Write-Back Enhanced 
IntelDX2 processor. 
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7.8.1 WRITE-BACK CACHE COHERENCY 
PROTOCOL . 


The Write-Back Enhanced IntelDX2 processor 
cache protocol supports a cache line in one of the 
following four states: 


e whether a line is valid and defined as write-back 
during allocation (E-state), 


e if it is valid and defined as write-through during. 


. allocation (S-state), 
e if it has been modified (M-state), 
e if it is invalid (I-state). 


These four states are the M (Modified line), E (write- 
back line), S (write-through line) and the | (Invalid 
line) states and the protocol is re-ferred to as the 
“Modified MESI protocol.” A definition of the States 
is given below: 


M - Modified: An M-state line is modified (different 


from main memory) and can be ac- 


cessed (read/written to) without 
sending a cycle out on the bus. 


An E-state line is a “write-back’”’ 
line, but the line is not modified (i.e., 
it is coherent with main memory). 
An E-state line can be accessed 
(read/written to) without generating 
a bus cycle and a write to an E-state 
line will cause the line to become 
modified. 


An S-state line is a ‘“‘write-through’”’ 
line, and is coherent with main 
memory. A read hit to an S-state 
line will not generate bus activity, 
but a write hit to an S-state line will 
generate a write-through cycle on 
the bus. A write to an S-state line 
will update the cache and the main 
memory. 


This state indicates that the line is 
not in the cache. A read fo this line 
will be a miss and may cause the 
Write-Back Enhanced _ IntelDX2 
processor to execute a line fill (fetch 
the whole line into the cache from 


E - Exclusive: 


S - Shared: 


| - Invalid: 
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main memory). A write to an invalid 
line will cause the Write-Back En- 
hanced IntelDX2 processor to exe- 
cute a write-through cycle on the 
bus. 


Every line in the Write-Back Enhanced IntelDX2 
processor cache is assigned a state dependent on 
both Write-Back Enhanced IntelDX2 processor gen- 
erated activities and activities generated by the sys- 
tem hardware. As the Write-Back Enhanced In- 
telDX2 processor is targeted for uniprocessor sys- 
tems, a subset of MESI protocol, namely MEI, is 
used in the Write-Back Enhanced IntelDX2 proces- 
sor to maintain cache coherency. 


With the modified MESI protocol, it is assumed that 
in a uniprocessor system lines are defined as write- 
back or write-through at allocation time. This proper- 
ty associated with a line is never altered. The lines 
allocated as write-through go to S-state and remain 
in S-state. A cache line that is allocated as write- 
back never enters the S-state. The WB/WT# pinis | 
sampled during line allocation and is used strictly to 
characterize a line as write-back or write-through. 


7.8.1.1 State Transition Tables 


State transi-tions are caused by processor-generat- 
ed transactions (memory reads/writes) and by a set 
of external input signals and internally-generated 
variables. The Write-Back Enhanced IntelDX2 proc- 
essor also drives certain pins as a consequence of 
the Cache Consistency Protocol. 


Read Cycles 


Table 7-5 shows the state transitions for lines in the 
cache during unlocked read cycles. 


Write Cycles 
The state transitions of cache lines during Write- 


Back Enhanced IntelDX2 processor-generated write 
cycles are described in Table 7-6. 
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Table 7-5. Cache State Transitions for Write-Back Enhanced IntelDX2™ 
Processor Initiated Unlocked Read Cycles 


n/a Read hit; data is provided to processor core by 
_ cache. No bus cycle is generated. 
n/a Read hit; data is pro-vided to processor core by 
cache. No bus cycle is generated. 
n/a S Read hit; Data is pro-vided to the processor by the 
cache. No bus cycle is generated. 


CACHE # low 
AND 
KEN # low 


Data item does not exist in cache (MISS). A line 

fill cycle (read) will be gen-erated by the Write-Back 
Enhanced IntelDX2™ processor. This state 
transition will occur if WB/WT # is sampled high 
with first BRDY #. 


AND 
WB/WT # high 
AND 
PWT low 


CACHE # low Same as previous read miss case except that 
AND WB/WT # is sampled low with first BRDY # or PWT 
KEN # low is high. 
AND 
(WB/WT # low 


OR PWT high) 


CACHE # high 
OR 
KEN # high 


KEN # pin inactive; the line is not intended to be 
cached in the Write-Back Enhanced IntelDX2 
processor. 


NOTES: 


Locked accesses to the cache will cause the accessed line to transition to the Invalid state. 

PCD can also be used by the processor to determine the cacheability, but using the CACHE# pin is recommended. The 
transition from | to E or S states (based on WB/WT#) occurs only if KEN# is sampled low one clock prior to the first 
BRDY # and then one clock prior to the last BRDY #, and the cycle is transformed into a line fill cycle. lf KEN# is sampled 
high, the line is not cached and remains in the | state. 


Table 7-6. Cache State Transitions for Write-Back Enhanced IntelIDX2™ 
Processor-Initiated Write Cycles 


Present Pin Description 
a= eee i 


| Mm | Write hit; update cache. No bus cycle generated to update memory. 


pea ie Write hit; update cache only. No bus cycle generated; line is now modified. 


Write hit; cache updated with write data item. A write-through cycle is 
generated on the bus to update memory. Subsequent writes to E-state or 
M-state lines are held up until this write-through cycle is completed. 


Write miss; a write-through cycle is generated on the bus to update external 
memory. No allocation is done. Subsequent writes to the E or M lines are 
blocked until the write-miss is completed. 
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Note that even though memory writes are buffered 
while 1/O writes are not, these writes appear at the 
pins in the same order as they were generated by 
the processor. Write-Back cycles caused by the re- 
placement of M-state lines are buffered, while Write- 
Backs due to Snoop hit to M-state lines are not buff- 
ered. 


Cache Consistency Cycles (Snoop Cycles) 
_ The purpose of Snoop cycles is to check whether 


the address being presented by another bus master 
is con-tained within the cache of the Write-Back En- 


hanced IntelDX2 processor. Snoop cycles may be © 


initiated with or without an invalidation request (INV 
= 1 or 0). If a snoop cycle is initiated with INV = 0 
(usually during memory read cycles by another mas- 
ter), it is referred to as an Inquire cycle. If a snoop 
cycle is initiated with INV = 1 (usually during memo- 
ry write cycles), it is referred to as an Invalidate cy- 
cle. If the address hits a modified line in the cache, 
the HITM# pin is asserted, and the modified line is 
written back onto the bus. Table 7-7 describes state 
transitions for Snoop cycles. 


* 


7.8.2 DETECTING ON-CHIP WRITE-BACK 


CACHE OF THE WRITE-BACK ENHANCED 


INTELDX2 PROCESSOR 


The write-back policy of the on-chip cache of the 
Write-Back Enhanced IntelDX2 processor may be 
detected by software or hardware. The software 


mechanism makes use of the CPUID instruction.- 


(See Appendix B, ‘Feature Determination,” for use 
of the CPUID instruction.) The hardware mechanism 


makes use of a write-back related output signal from 


the processor. 


B 

intel. 
A software mechanism to determine if a given proc- 
essor has write-back support for the on-chip cache 
should drive the WB/WT # pin to “1” during RESET. 
This pin will be sampled by the processor during the 
falling edge of the RESET. Execute the CPUID in- 
struction, which returns the model number in the 
EAX register, EAX[7:4]. If the model number re- 
turned is 7 (Write-Back Enhanced IntelDX2 proces- 
sor) and the family number is 4, the on-chip cache 
supports the write-back policy. If the model number 
returned is in the range 0 through 6 or 8, the on-chip 
cache only supports the write-through policy. 


The following pseudo code/steps give an example 
of the initialization BIOS that can be used to detect 
the presence of the write-back on-chip cache: 


¢ Boot Address Cold start 
e Load Segment Registers and null IDTR 


e Execute CPUID instruction and determine the 
Family ID and Model ID. 


¢ Compare the Family ID to 4 and the Model ID 
returned to 7. When the Family ID is 4, and the 
model ID is 7, the processor supports on-chip 
write-back caching. If the Family ID does not 
match 7, the processor only supports on-chip 
write-through caching. | 


The hardware mechanism involves using the HITM # 
signal. For the Write-Back Enhanced IntelDX2 proc- 
essor, this signal is driven inactive (high) during 
RESET. The chipset can sample this output on the 
falling edge of RESET. If HITM# is sampled high on 
the falling edge of RESET, the processor supports 
on-chip write-back cache configuration. For those 
processors that do not support internal write-back 
caching, this signal is an INC, and this output is not 
driven. 


Table 7-7. Cache State Transitions During Snoop Cycles 


Present 
State 
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Next ee Next State Description 
ee 1 INV=0 P 


Snoop hit to a modified line indicated by HITM # pin low. Write-Back 
Enhanced IntelDX2 Processor schedules the write back of the 

modified line to memory. The state of the line changes to E provided 
-INV = 0 and changes to | if INV = 1. 


Snoop hit, no bus cycle generated. State remains unaltered if 
INV = 0, and changes to | if INV = 1. There | is no external indication 
of this snoop hit. 


Snoop hit, no bus cycle generated. State remains unaltered if 
INV = 0, and changes to | if INV = 1. There is no external indication 
of this snoop hit. 


pot | tt] | Address not in cache ) 
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8.0 SYSTEM MANAGEMENT MODE 
(SMM) ARCHITECTURES 


8.1 SMM Overview 


The Intel486 processor supports four modes: Real, 
Virtual-86, Protected, and System Management 
Mode (SMM). As an operating mode, SMM has a 
distinct processor environment, interface and hard- 
ware/software features. 


SMM provides system designers with a means of 
adding new software-controlled features to comput- 
er products that operate transparently to the operat- 
ing system and software applications. SMM is 
intended for use only by system firmware, not by 
applications software or general purpose systems 
software. 


The SMM architectural extension consists of the fol- 
lowing elements: 


1. System Management Interrupt (SMI #) hardware 
interface. 


2. Dedicated and secure memory space (SMRAM) 
for SMI# handler code and processor state (con- 
text) data with a status signal for the system to 
decode access to that memory space, SMIACT #. 
(The SMBASE address is relocatable and could 
also be relocated to non-cacheable address 
space.) 


3. Resume (RSM) instruction, for exiting the System 
Management Mode. 


4. Special Features such as |/O-Restart, for trans- 
parent power management of I/O peripherals, 
and Auto HALT Restart. 


8.2 Terminology 


The following terms are used throughout the discus- 
sion of System Management Mode. 


SMM: System Management Mode. This is the oper- 
ating environment that the processor (system) en- 
ters when the System Management Interrupt is be- 
ing serviced. 


SMI #: System Management Interrupt. This is part of 
the SMM interface. When SMI# is asserted (SMI # 
pin asserted low) it causes the processor to invoke 
SMM. The SMI# pin is the only means of enter- 
ing SMM. 
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SMM handler: System Management Mode handler. 
This is the code that will be executed when the proc- 
essor is in SMM. An example application that this 
code might implement is a power management con- 
trol or a system control function. 


RSM: Resume instruction. This instruction is used by 
the SMM handler to exit the SMM and return to the 
interrupted operating system or application process. 


SMRAM: This is the physical memory dedicated to 
SMM. The SMM handler code and related data re- 
side in this memory. This memory is also used by the 
processor to store its context before executing the 
SMM handler. The operating system and applica- 
tions do not have access to this memory space. 


SMBASE: Control register that contains the address 
of the SMRAM space. 


Context: This term refers to the processor state. 
The SMM discussion refers to the context, or proc- 
essor State, just before the processor invokes SMM. 
The context normally consists of the processor reg- 
isters that fully represent the processor state. 


Context Switch: A context switch is the process of 
either saving or restoring the context. The SMM dis- 
cussion refers to the context switch as the process 
of saving/restoring the context while invoking/ 
exiting SMM, respectively. : 


8.3 System Management Interrupt 
Processing 


The system interrupts the normal program execution 
and invokes SMM by generating a System Manage- 
ment Interrupt (SMI #) to the processor. The proces- 
sor will service the SMI# by executing the following 
sequence (see Figure 8-1): 


1. The processor asserts the SMIACT # signal, indi- 
cating to the system that it should enable the 
SMRAM. 


2. The processor saves its state (context) to 
SMRAM, starting at default address location 
SFFFFH, proceeding downward in a stack-like 
fashion. 


3. The processor switches to the System Manage- 
ment Mode processor environment (a pseudo- 
real mode). 
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SMIi# 


eae 


Instr Instr | Instr; 


SMIi# 


SMIACT# 


Active during bus cycles in SMM 


242202-—56 


Figure 8-1. Basic SMI # Interrupt Service 


4. The processor will then jump to the default abso- 
lute address of 38000H in SMRAM to execute the 
SMI# handler. This SMI# handler performs the 
system management activities. 


5. The SMI# handler will then execute the RSM in- 


struction which restores the processors context 
from. SMRAM, de-asserts the SMIACT# signal, 
and then returns control to the previously inter- 
rupted program execution. 


NOTE: 
The above sequence is valid for the default 
SMBASE value only. See the following sec- 
tions for a description of the SMBASE regis- 
ter and SMBASE relocation. | 


The System Management Interrupt hardware inter- 
face consists of the SMI# interrupt request input 
and the SMIACT # output used by the system to de- 
code the SMRAM. 
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SMIACT# 


SMI 


SMI# Interface 
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Figure 8-2. Basic SMI # Hardware Interface — 


8.3.1 SYSTEM MANAGEMENT INTERRUPT 
(SMI #) 


SMI # is a falling-edge triggered, non-maskable in- 
terrupt request signal. SMI # is an asynchronous sig- 
nal, but setup and hold times, top and to;,-must be 
met in order to guarantee recognition on a specific — 
clock. The SMI# input need not remain active until 
the interrupt is actually serviced. The SMI# input 
only needs to remain active for a single clock if the 
required setup and hold times are met. SMI# will 


- also work correctly if it is held active for an arbitrary 


number of clocks. 


intel. 


The SMI# input must be held inactive for at least 
four external clocks after it is asserted to reset the 
edge triggered logic. A subsequent SMI # might not 
be recognized if the SMI# input is not held inactive 
for at least four clocks after being asserted. 


SMI#, like NMI, is not affected by the IF bit in the 
EFLAGS register and is recognized on an instruction 
boundary. An SMI# will not break locked bus cycles. 
The SMI# has a higher priority than NMI and is not 
masked during an NMI. In order for SMI# to be rec- 
ognized with respect to SRESET, SM!# should not 
be asserted until two (2) clocks after SRESET be- 
comes inactive. 


After the SMI# interrupt is recognized, the SMI# 
signal will be masked internally until the RSM in- 
struction is executed and the interrupt service rou- 
tine is complete. Masking the SMI# prevents recur- 
sive SMI# calls. SMI# must be de-asserted for at 
least 4 clocks to reset the edge triggered logic. If 
another SMI# occurs while the SMI# is masked, 
the pending SMI# will be recognized and executed 
on the next instruction boundary after the current 
SMI# completes. This instruction boundary occurs 
before execution of the next instruction in the inter- 
rupted application code, resulting in back to back 
SMM handlers. Only one SMI# can be pending 
while SMI# is masked. 
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The SMI # signal is synchronized internally and must 
be asserted at least three (3) CLK periods prior to 
asserting the RDY# signal in order to guarantee 
recognition on a specific instruction boundary. This 
is important for servicing an |/O trap with an SMI # 
handler. (See Figure 8-3.) | 


8.3.2 SMI# ACTIVE (SMIACT #) 


SMIACT # indicates that the processor is operating 
in System Management Mode. The processor as- 
serts SMIACT# in response to an SMI# interrupt 
request on the SMI# pin. SMIACT # is.driven active 
after the processor has completed all pending write 
cycles (including emptying the write buffers), and be- 
fore the first access to SMRAM when the processor 
saves (writes) its state (or context) to SMRAM. 
SMIACT# remains active until the last access to 


SMRAM when the processor restores (reads) its fies 


state from SMRAM. The SMiACT # signal does not 
float in response to HOLD. The SMIACT # signal is 
used by the system logic to decode SMRAM (See 
Figure 8-2). 


The number of CLKs required to complete the SMM 
state save and restore is very dependent on-system 
memory performance. The values listed in Table 8-1 
assume 0 wait-state memory writes (2 CLK cycles), 
2-1-1-1 burst read cycles, and 0 wait-state non-burst 
reads (2 CLK cycles). Additionally, it is assumed that 
the data read during the SMM state restore se- 
quence is not cacheable. 


A: Setup time for recognition on I/O instruction boundary 
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Figure 8-3. SMI# Timing for Servicing an I/O Trap 
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Figure 8-4 and Table 8-1 can be used for latency 
calculations. As shown, the minimum time required 
to enter an SMI # handler routine for the Intel486 DX 
processor (from the completion of the memupio’ in- 
struction) is given by: 


Latency to beginning of SMI# handler = 
A+B+ C= 153 CLKs 


CLK 
SMI# | 


 ADS# 


BRDY# 


~SMIACT# 


Normal State 
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and the minimum time required to return to the inter- 
rupted application (following the final SMM instruc- . . 


_tion before RSM) is given by: 


Latency to continue interrupted application = 
E+F+G = 243 CLKs 


State SMM State 
Save Handier Restore . 


System Management Mode Normal State 
/242202-59 


Figure 8-4. Intel486™ Processor SMIACT# Timing 
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Table 8-1. Intel486™ Processor SMIACT # Timing 


A:Last RDY # from non- 
SMM transfer to 
SMIACT # assertion 


B:SMIACT # assertion to 
first ADS # for SMM 
state save 


C:SMM state save 
(dependent on 
memory performance) 


Approx 
139 CLKs 


Approx. 
139 CLKs 


E:SMM state restore 
(dependent on 
memory performance) 


F:Last RDY # from SMM 
transfer to de- 
assertion of SMIACT # 


G: SMIACT # de- 
assertion to first non- 
SMM ADS # 


Approx. 
236 CLKs 


Approx. 
236 CLKs 


8.3.3 SMRAM 


The Intel486 processor uses the SMRAM space for 
state save and state restore operations during an 
SMI# and RSM. The SM! # handler, which also re- 
sides in SMRAM, uses the SMRAM space to store 
code, data and stacks. In addition, the SMI# han- 
dler can use the SMRAM for system management 
information such as the system configuration, con- 
figuration of a powered-down device, and system 
designer-specific information. 


The processor asserts the SMIACT # output to indi- 
cate to the memory controller that it is operating in 
System Management Mode. The system logic 
should ensure that only the processor has access to 
this area. Alternate bus masters or DMA devices try- 
ing to access the SMRAM space when SMIACT # is 
active should be directed to system RAM in the re- 
spective area. 


The system logic is minimally required to decode the 
physical memory address range from 38000H- 
SFFFFH as SMRAM area. The processor will save 
its state to the state save area from 3FFFFH down- 
ward to 3FEOOH. After saving its state the processor 


Intel486 SX/| IntelSX2T | Intel486 DX |IntelDX2™ | IntelDX4™ IntelDX4 
Processor | Processor Processor | Processor | Processor 3X Processor 2X 
2CLK 1 CLK 2 CLK 1 CLK 1 CLK 

minimum minimum minimum minimum minimum 

4OCLK  |20CLK  |40CLK 20CLK  113CLK 20 CLK 
minimum minimum minimum minimum minimum minimum 


Approx. 
139 CLKs 


D:SMM handler User User User User User User 
determined | determined | determined | determined | determined determined 


Approx. 
236 CLKs 


4CLK 2 CLK 4CLK 2 CLK 1 CLK 

minimum minimum minimum minimum minimum 
20CLK  |10CLK  |20CLK 10CLK  |6CLK ~—T40CLK~ 
minimum minimum minimum minimum minimum minimum 


Approx. 
139 CLKs 


Approx. 
139 CLKs 


Approx. 
236 CLKs 


Approx. 


1 CLK 
minimum 


will jump to the address location 38000H to begin 
executing the SMI# handler. The system logic can 
choose to decode a larger area of SMRAM as need- 
ed. The size of this SMRAM can be between 
32 Kbytes and 4 Gbytes. 


The system logic should provide a manual method 
for switching the SMRAM into system memory 
space when the processor is not in SMM. This will. 
enable initialization of the SMRAM space (i.e., load- 
ing SMI# handler) before executing the SMI# han- 
dier during SMM. (See Figure 8-5.) 


8.3.3.1 SMRAM State Save Map 


When the SMI# is recognized on an instruction 
boundary, the processor core first sets the 
SMIACT # signal LOW indicating to the system logic 
that accesses are now being made to the system- 
defined SMRAM areas. The processor then writes 
its state to the state save area in the SMRAM. The 
state save area starts at CS Base + [8000H + 
7FFFH]. The default CS Base is 30000H, therefore 
the default state save area is at 3FFFFH. In this 
case, the CS Base can also be referred to as the 
SMBASE. 
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lf SMBASE Relocation is enabled, then the SMRAM 
addresses can change. The following formula is 
used to determine the relocated addresses where 
the context is saved. The context will reside at CS 
Base + [8000H + Register Offset], where the de- 
fault initial CS Base is 30000H and the Register Off- 
set is listed in the SMRAM state save map (Table 
8-2). Reserved spaces will be used to accommodate 
new registers in future processors. The state save 
area starts at 7FFFH and continues downward in a 
stack-like fashion. | 


Some of the registers in the SMRAM state save area 
may be read and changed by the SMI# handler, 
with the changed values restored to the processor 
registers by the RSM instruction. Some register im- 
ages are read-only, and must not be modified (modi- 
fying these registers will result in unpredictable be- 
havior). The values stored in the areas marked re- 
served may change in future processors. An SMM 
handler should not rely on any values stored in an 
area that is marked as reserved. 


System memory 
accesses redirected ———> 
to SMRAM 


System memory 
accesses not 
redirected to 

SMRAM 


Processor accesses 
to system address 


space used for 
loading SMRAM 
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Figure 8-5. Redirecting System Memory 
Addresses to SMRAM 
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Table 8-2. SMRAM State Save Map 


Register at | 
wFC | CROC: NOC 
FFB | RS | NC 


7FF4 EFLAGS E 
7FFO EIP YES 
7FEC YES 
7FE8 YES 
7FE4 
7FEO 
7FDC 
7FD8 EDX YES 
7FD4 ECX YES 
7FDO A YES 
7FCC 
7FC8 DR7 
7FC4 TR* 
7FCO LDTR* 
7FBC “ Gs* 
7FB8 FS* 
7FB4 DS* 
7FBO Ss* 
7FAC cs* 
7FA8 ES* 
7FA7-7F98 
7F94 
7F93-7F8C 
7F88 
7F87-7F04 
7F02 


~< 
” 


m 
9 


m 


Mm | m 

w 
" 
<j< 

m 
NIM 


0 


S 


m 


m 
x 


z 


z 


Zz = 


Zz 


=< 


Reserved 
IDT Base 
Reserved 
GDT Base 
Reserved 


Auto HALT Restart YES 
Slot (Word) 


1/O Trap Restart Slot YES 
(Word) 


SMM Revision 
Identifier (DWord) 


SMBASE Slot (Dword) 
Reserved 


7F00 


7EFC 


7EF8 
7EF7~7E00 


YES 


NOTES: 

*Upper two bytes are reserved. 

Modifying a value that is marked as not writeable will result 
in unpredictable behavior. 

Words are stored in two consecutive bytes in memory with 
the low-order byte at the lowest address and the high-order 
byte in the high address. 


a 
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The following registers are saved and restored (in 
areas of the state save that are marked reserved), 


but are not visible to the system software program- 
mer: 


CR1, CR2 and CR4, hidden descriptor registers for 
CS, DS, ES, FS, GS, and SS. 


lf an SMI# request is issued for the purpose of pow- 
ering down the processor, the values of all reserved 
locations in the SMM state save must be saved to 
non-volatile memory. 


The following registers are not automatically saved 
and restored by SMI# and RSM: 


DR5-DRO, TR7-TR3, FPU registers: STn, FCS, 
FSW, tag word, FP instruction pointer, FP opcode, 
and operand pointer. 


For all SMI# requests except for suspend/resume, 
these registers do not have to be saved because 
their contents will not change. However, during a 
power down suspend/resume, a resume reset will 
clear these registers back to their default values. In 
this case, the suspend SMI# handler should read 
these registers directly to save them and restore 
them during the power up resume. Anytime the 
SMI# handler changes these registers in the proc- 
essor, it must also save and restore them. 


8.3.4 EXIT FROM SMM 


The RSM instruction is only available to the SMI # 
handler. The opcode of the instruction is OFAAH. 
Execution of this instruction while the processor is 
executing outside of SMM will cause an invalid op- 
code error. The last instruction of the SM! # handler 
will be the RSM instruction. 


The RSM instruction restores the state save image 
from SMRAM back to the processor, then returns 
control back to the interrupted program execution. 
There are three SMM features that can be enabled 
by writing to control “slots” in the SMRAM state 
save area. 


Auto HALT Restart. It is possible for the SMI# re- 
quest to interrupt the HALT state. The SMI# handler 
can tell the RSM instruction to return control to the 
HALT instruction or to return control to the instruc- 
tion following the HALT instruction by appropriately 
setting the Auto HALT Restart slot. The default op- 
eration is to restart the HALT instruction. 
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I/O Trap Restart. If the SMI# interrupt was gener- 
ated on an I/O access to a powered-down device, 
the SMI# handler can tell the RSM instruction to re- 
execute that I/O instruction by setting the !/O Trap 
Restart slot. | 


SMBASE Relocation. The system can relocate the 
SMRAM by setting the SMBASE Relocation slot in 
the state save area. The RSM instruction will set the 
SMBASE in the processor based on the value in the 
SMBASE relocation slot. The SMBASE must be 32K 
aligned. 


For further details on these SMM features, see sec- 
tion 8.5. 


lf the processor detects invalid state information, it 
enters the shutdown state. This happens only in the 
following situations: | 


e The value stored in the SMBASE slot is not a 
32-Kbyte-aligned address. 


e A reserved bit of CR4 is set to 1. 


e A combination of bits in CRO is illegal; namely, 
(PG=1 and PE=0) or (NW=1 and CD=0). 


In shutdown mode, the processor stops executing 
instructions until an NMI interrupt is received or reset 
initialization is invoked. The processor generates a 
special bus cycle to indicate it has entered shutdown 
mode. 


NOTE: | 
INTR and SMI# will also bring the processor 
out of a shutdown that is encountered due to 
invalid state information from SMM _ execu- 
tion. Make sure that INTR and SMI# are not 
asserted if SMM routines are written such 
that a shutdown occurs. 


8.4 System Management Mode 
Programming Model 


8.4.1 ENTERING SYSTEM MANAGEMENT MODE 


SMM is one of the major operating modes, on a level 
with Protected mode, Real address mode or virtual- 
86 mode. Figure 8-6 shows how the processor can 
enter SMM from any of the three modes and then 
return. 
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Real Mode 
Reset or RSM 


Reset or =: PE=1 
PE= Suit ____y, 


Protected Mode > 


System 
Management 
Mode 


—{ Virtual - 86 Mode 
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NOTE: 
Reset could occur by asserting the RESET or SRESET 


pin. 


Figure 8-6. Transition to and from System 
‘Management Mode 


The external signal SMI# causes the processor to 
switch to SMM. The RSM instruction exits SMM. 
SMM is transparent to applications programs and 
operating systems because of the following: 


e The only way to enter SMM is via a type of non- 
maskable interrupt triggered by an external sig- 
nal. 


e The processor begins executing SMM code from 
a separate address space, referred to earlier as 
system management RAM (SMRAM). 


e Upon entry into SMM, the processor saves the 
register state of the interrupted program in a part 
of SMRAM called the SMM context save space. 


e All interrupts normally handled by the operating 
system or by applications are disabled upon entry 
into SMM 


e A special instruction, RSM, restores processor 
registers from the SMM context save space and 
returns control to the interrupted program. 


SMM is similar to Real address mode in that there 
are no privilege levels or address mapping. An SMM 
program can execute all |/O and other system in- 
structions and can address up to four Gbytes of 
memory. 


8.4.2 PROCESSOR ENVIRONMENT 


When an SMI# signal is recognized on an instruc- 
tion execution boundary, the processor waits for all 
stores to complete, including emptying of the write 
buffers. The final write cycle is complete when the 


system returns RDY# or BRDY#. The processor 
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then drives SMIACT # active, saves its register state 


to SMRAM space, and begins to execute the SMM 
handler. | 


SMI# has greater priority than debug exceptions 
and external interrupts. This means that if more than 
one of these conditions occur at an instruction 


‘boundary, only the SMI# processing occurs, not a 


debug exception or external interrupt. Subsequent 
SMI# requests are not acknowledged while the 
processor is in SMM. The first SMI# interrupt re- 
quest that occurs while the processor is in SMM is 
latched, and serviced when the processor exits 
SMM with the RSM instruction. Only one SMI# will 
be latched by the processor while it is in SMM. 


When the processor invokes SMM, the processor 
core registers are initialized as shown in Table 8-3. 


Table 8-3. SMM Initial Processor 
Core Register Settings 


Resistor [—_Contenta 
[er | 000080008 


CS Base SMM Base 
| (default 30000H) 


0000H 


DS, ES, FS, GS, 
SS Selectors 


DS, ES, FS, GS, 000000000H 
SS Bases 

DS, ES, FS, GS, OFFEFFFFEH 
SS Limits | 


Bits 0,2,3 & 31 cleared 
(PE, EM, TS & PG); 
others are Bamodined 


The following is a summary of the key features in the | 
SMM environment: 


1. Real mode style address calculation 
2. 4-Gbyte limit checking 
3. IF flag is cleared 
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4. NMI is disabled 


5. TF flag in EFLAGS is cleared; single step traps 
are disabled 


6. DR7 is cleared, except for bits 12 and 13; debug 
traps are disabled. 


7. The RSM instruction no longer generates an in- 
valid opcode error 


8. Default 16-bit opcode, register and stack use. 


All bus arbitration (HOLD, AHOLD, BOFF #) inputs 
and bus sizing (BS8#, BS16#) inputs operate nor- 
mally while the processor is in SMM. 


8.4.2.1 Write-Back Enhanced IntelDX2 
Processor Environment 


When the Write-Back Enhanced IntelDX2 processor 
is in Enhanced Bus Mode, SMI# has greater priority 
than debug exceptions and external interrupts, ex- 
cept for FLUSH# and SRESET. (See section 4.8.6.) 


8.4.3 EXECUTING SYSTEM MANAGEMENT 
MODE HANDLER 


The processor begins execution of the SMM handler 
at offset 8000H in the CS segment. The CS Base is 
initially 30000H. However, the CS Base can be 
changed by using the SMM Base relocation feature. 


When the SMM handler is invoked, the processors 
PE and PG bits in CRO are reset to 0. The processor 
is in an environment similar to Real mode, but with- 
out the 64-Kbyte limit checking. However, the de- 
fault operand size and the default address size are 
set to 16 bits. 


The EM bit is cleared so that no exceptions are gen- 
erated. (if the SMM was entered from Protected 
mode, the Real mode interrupt and exception sup- 
port is not available.) The SMI# handler should not 
use floating point unit instructions until the FPU is 
properly detected (within the SMI# handler) and the 
exception support is initialized. 


Because the segment bases (other than CS) are 
cleared to 0 and the segment limits are set to 4 
Gbytes, the address space may be treated as a sin- 
gle flat 4-Gbyte linear space that is unsegmented. 
The processor is still in Real mode and when a seg- 
ment selector is loaded with a 16-bit value, that val- 
ue is then shifted left by 4 bits and loaded into the 
segment base cache. The limits and attributes are 
not modified. 
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In SMM, the processor can access or jump any- 
where within the 4-Gbyte logical address space. The 
processor can also indirectly access or perform a 


near jump anywhere within the 4-Gbyte logical ad- 


dress space. 


8.4.3.1 Exceptions and Interrupts within neem 
Management Mode 


When the processor enters SMM, it disables INTR 
interrupts, debug and single-step traps by clearing 
the EFLAGS, DR6 and DR7 registers. This is done 
to prevent a debug application from accidentally 
breaking into an SMM handler. This is necessary be- 
cause the SMM handler operates from a distinct ad- 
dress space (SMRAM), and hence, the debug trap 
will not represent the normal system memory space. 


lf an SMM handler wishes to use the debug trap 
feature of the processor to debug SMM handler 
code, it must first ensure that an SMM compliant 
debug handler is available. The SMM handler must 
also ensure DRO-DR3 is saved to be restored later. 
The debug registers DRO-DR3 and DR7 must then 
be initialized with the appropriate values. 


If the processor wishes to use the single step fea- 
ture of the processor, it must ensure that an SMM 
compliant single step handler is available and then 
set the trap flag in the EFLAGS register. 


If the system design requires the processor to re- 
spond to hardware INTR requests while in SMM, it 
must ensure that an SMM compliant interrupt han- 
dler is available and then set the interrupt flag in the 
EFLAGS register (using the STI instruction). Soft- 
ware interrupts are not blocked upon entry to SMM, 
and the system software designer must provide an 
SMM compliant interrupt handler before attempting 
to execute any software interrupt instructions. Note 
that in SMM mode, the interrupt vector table has the 
same properties and location as the Real mode vec- 
tor table. | 


NMI interrupts are blocked upon entry to the SMM 
handler. If an NMI request occurs during the SMM 
handler, it is latched and serviced after the proces- 
sor exits SMM. Only one NMI request will be latched 
during the SMM handler. If an NMI request is: pend- 
ing when the processor executes the RSM instruc- 
tion, the NMI is serviced before the next instruction 
of the interrupted code sequence. 


2-133 


intel486™ PROCESSOR FAMILY 


Although NMI requests are blocked when the proc- 
essor enters SMM, they may be enabled through 


software by executing an IRET instruction. If the | 


SMM handler requires the use of NMI interrupts, it 
should invoke a dummy interrupt service routine for 
the purpose of executing an IRET instruction. Once 
an IRET instruction is executed, NMI interrupt re- 
quests are serviced in the same “real mod’”’ manner 
in which they are handled outside of SMM. 


8.5 SMM Features 


8.5.1 SMM REVISION IDENTIFIER 


The SMM revision identifier is used to indicate the 
version of SMM and the SMM extensions that are 
supported by the processor. The SMM revision iden- 
- tifier is written during SMM entry and can be exam- 
ined in SMRAM space at Register Offset 7EFCH. 
_ The lower word of the SMM revision identifier refers 
to the version of the base SMM architecture. The 
upper word of the SMM revision identifier refers to 
the extensions available. (See Figure 8-7.) 


Register Offset 


SMBASE Reiocation | 


SMM Revision Level 
1/O Trap with Restart 


242202-62 


Figure 8-7. SMM Revision Identifier 
Table 8-4. Bit Values for SMM Revision Identifier 


Processor does not support I/O 
Trap Restart 


Processor supports |/O Trap 
Restart 


| Processor does not support 
| SMBASE relocation 


Processor supports SMBASE 


relocation 
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Bit 16 of the SMM revision identifier is used to indi- 
cate to the SMM handler that this processor sup- 
ports the SMM I/O trap extension. If this bit is high, — 
then this processor supports the SMM |/O trap ex- 
tension. If this bit is low, then this processor does 


not support I/O trapping using the I/O trap slot 
mechanism. (See Table 8-4.) 


Bit 17 of this slot indicates whether the processor 
supports relocation of the SMM jump vector and the 
SMRAM base address. (See Table 8-4.) 


The Intel486 processor supports both the I/O Trap 
Restart and the SMBASE relocation features. 


~ 8.5.2 AUTO HALT RESTART 


The Auto HALT Restart slot at register offset (word 
location) 7F02H in SMRAM indicates to the SMM 
handler that the SMI# interrupted the processor 
during a HALT state (bit 0 of slot 7FO2H is set to 1 if 
the previous instruction was a HALT). If the SMI# 
did not interrupt the processor in a HALT state, then 
the SMI# microcode will set bit 0 of the Auto HALT 
Restart slot to a value of 0. If the previous instruction 
was a HALT, the SMM handler can choose to either 
set or reset bit 0. If this bit is set to 1, the RSM micro 
code execution will force the processor to re-enter 
the HALT state. If this bit is set to O when the RSM 
instruction is executed, the processor will continue 
execution with the instruction just after the interrupt- 
ed HALT instruction. Note that if the interrupted in- 
struction was not a HALT instruction (bit 0 is set to 0 
in the Auto HALT Restart slot upon SMM entry), set- 
ting bit 0 to 1 will Cause unpredictable behavior 
when the RSM instruction is executed. (See Figure 
8-8 and Table 8-5.) a 


Register Offset 
7F02H 


= Auto HALT 
Restart 
242202-63 


Figure 8-8. Auto HALT Restart 
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Table 8-5. Bit Values for Auto HALT Restart 


Value of 
Bit 0 at Comments 
Returns to next instruction 
in interrupted program. 


Value of 
Bit 0 at 


Unpredictable 


Returns to next instruction 
after HALT 
Returns to HALT state 


lf the HALT instruction is restarted, the processor 
will generate a memory access to fetch the HALT 
instruction (if it is not in the internal cache), and exe- 
cute a HALT bus cycle. 


8.5.3 1/0 INSTRUCTION RESTART 


The {/O instruction restart slot (register offset 
7FOOH in SMRAM) gives the SMM handler the op- 
tion of causing the RSM instruction to automatically 
re-execute the interrupted I/O instruction. When the 
RSM instruction is executed, if the I/O instruction 
restart slot contains the value OFFH, then the proc- 
essor will automatically re-execute the I/O instruc- 
tion that the SMI# trapped. If the I/O instruction re- 
start slot contains the value 00H when the RSM in- 
struction is executed, then the processor will not re- 
execute the I/O instruction. The processor automati- 
cally initializes the I/O instruction restart slot to OOH 
during SMM entry. The |/O instruction restart slot 
should be written only when the processor has gen- 
erated an SMI# on an //O instruction boundary. 
Processor operation is unpredictable when the |/O 
instruction restart slot is set when the processor is 
servicing an SMI# that originated on a non-!/O in- 
struction boundary. (See Figure 8-9 and Table 8-6.) 
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Register Offset 
7FOOh 


I/O Instruction 
Restart Slot 
242202-64 


Figure 8-9. I/O Instruction Restart 


Table 8-6 I/O Instruction Restart Value 


Value at Value Comments 
Entry at Exit 
OOH OOH 


Do not restart trapped I/ O 
instruction 
OOH OFFH Restart trapped I/O 
instruction 


If the system executes back-to-back SMI# re- 
quests, the second SMM handler must not set 
the I/O instruction restart slot (see section 8.6.6 
“Nested SMI#s and I/O Restart’’). 


8.5.4 SMM BASE RELOCATION 


The Intel486 processor provides a control register, 
SMBASE. The address space used as SMRAM can 
be modified by changing the SMBASE register be- 
fore exiting an SMI# handler routine. SMBASE can 
be changed to any 32K aligned value (values that 
are not 32K aligned will cause the processor to enter 
the shutdown state when executing the RSM in- 
struction). SMBASE is set to the default value of 
30000H on RESET, but is not changed on SRESET. 
If the SMBASE register is changed during an SMM 
handler, all subsequent SMI # requests will initiate a 
state save at the new SMBASE. (See Figure 8-10.) 
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Register Offset 


7EF8H 


a fas . 
SMM Base 
242202-65 


Figure 8-10. SMM Base Location 


The SMBASE slot in the SMM state save area is a 
feature used to indicate and change the SMI# jump 
vector location and the SMRAM save area. When bit 
17 of the SMM Revision Identifier is set then this 
feature exists and the SMRAM base and conse- 
quently the jump vector are as indicated by the SMM 
Base slot. During the execution of the RSM instruc- 
‘tion, the processor will read this slot and initialize the 
processor to use the new SMBASE during the next 
SMI#. During an SMI#, the processor will do its 
context save to the new SMRAM area pointed to by 
the SMBASE, store the current SMBASE in the 
SMM Base slot (offset 7EF8H), and then start exe- 
cution of the new jump vector based on the current 
SMBASE. 


The SMBASE must be a 32-Kbyte aligned, 32-bit in- 
teger that indicates a base address for the SMRAM 
context save area and the SMI# jump vector. For 


example when the processor first powers up, the 


minimum SMRAM area is from 38000H-3FFFFH. 
The default SMBASE is 30000H. Hence the starting 
address of the jump vector is calculated by: 


SMBASE + 8000H 


While the starting address for the SMRAM state 
save area is calculated by: 


SMM Base + [8000H + 7FFFH] 


Hence, when this feature is enabled, the SMRAM 
register map is addressed according to the above 
formulas. (See Figure 8-11.) 


To change the SMRAM base address and SMM 
jump vector location, the SMM handler should modi- 
fy the SMBASE slot. Upon executing an RSM in- 
struction, the processor will read the SMBASE slot 
and store it internally. Upon recognition of the next 
SMI# request, the processor will use the new 
SMBASE slot for the SMRAM dump and SMI # jump 
vector. 
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If the modified SMBASE slot does not contain a 
32-Kbyte aligned value, the RSM microcode will © 


- cause the processor to enter the shutdown state. 


SMBASE + 8000H 


+ 7FFEH Start a State Save 


SMM Handler Entry 


SMBASE + 8000H 


SMBASE 


242202-66 


Figure 8-11. SMRAM Usage 


8.6 SMM System Design 
Considerations 


8.6.1 SMRAM INTERFACE 


The hardware designed to control the SMRAM 
space must follow these guidelines: 


1. A provision should be made to allow for initializa- 
tion of SMRAM space during system boot up. This 
initialization of SMRAM space must happen 
before the first occurrence of an SMI# interrupt. 
Initializing the SMRAM space must include instal- 
lation of an SMM handler, and may include instal- 
lation of related data structures necessary for par- 
ticular SMM applications. The memory controller . 
providing the interface to the SMRAM should pro- 
vide a means for the initialization code to manual- — 
ly open the SMRAM space. 


2. A minimum initial SMRAM address space of | 
38000H-3FFFFH should be decoded by the mem- 
ory controller. 


3. Alternate bus masters (such as DMA Sertholees) 
should not be allowed to access SMRAM space. 
Only the processor, either through SMI# or dur- 
ing initialization, should be allowed access to 
SMRAM. 


4. |In order to implement a zero-volt suspend func- 
tion, the system must have access to all of normal 
system memory from within an SMM _ handler 
routine. If the SMRAM is going to overlay normal 
system memory, there must be a method of ac- 
cessing any system memory that is located under- 
neath SMRAM. 


| 
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There are two potential schemes for locating the 
SMRAM, either overlaid to an address space on top 
of normal system memory, or placed in a distinct 
address space. (See Figure 8-12.) When SMRAM is 
overlaid on the top of normal system memory, the 
processor output signal SMIACT # must be used to 
distinguish SMRAM from main system memory. Ad- 
ditionally, if the overlaid normal memory is cache- 
able, both the processor internal cache and any sec- 
ond level caches must be empty before the first read 
of an SMM handler routine. If the SMM memory is 
cacheable, the caches must be empty before the 
first read of normal memory following an SMM han- 
dler routine. This is done by flushing the caches, and 
is required to maintain cache coherency. When the 
default SMRAM location is used, SMRAM is overlaid 
on top of system main memory (at 38000H through 
SFFFFH). 


lf SMRAM is located in its own distinct memory 
space, which can be completely decoded with only 
the processor address signals, it is said to be non- 
overlaid. In this case, there are no new requirements 
for maintaining cache coherency. 


Normal 
Memory 


y Shadowed Region 


Normal 
Memory 


Overlaid 
(caches must 
be flushed) 


242202-67 


Non-overlaid 
(no need to 
flush caches) 


Figure 8-12. SMRAM Location 
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8.6.2 CACHE FLUSHES 


The processor does not unconditionally flush its 
cache before entering SMM (this option is left to the 
system designer). If SMRAM is shadowed in a 
cacheable memory area that is visible to the applica- 
tion or operating system, it is necessary for the sys- 
tem to empty both the processor cache and any 
second level cache before entering SMM. That is, if 
SMRAM is in the same physical address location as 
the normal cacheable memory space, then an SMM 
read may hit the cache which would contain normal 
memory space code/data. If the SMM memory is 
cacheable, the normal read cycles after SMM may 
hit the cache, which may contain SMM code/data. In 
this case the cache should be empty before the first 
memory read cycle during SMM and before the first 
normal cycle after exiting SMM. (See Figure 8-13.) 


The FLUSH# and KEN# signals can be used to 
ensure cache coherency when switching between 
normal and SMM modes. Cache flushing during 
SMM entry is accomplished by asserting the 
FLUSH# pin when SMI# is driven active. Cache 
flushing during SMM exit is accomplished by assert- 
ing the FLUSH# pin after the SMIACT# pin is de- 
asserted (within 1 CLK). To guarantee this behavior, 
the constraints on setup and hold timings on the in- 
teraction of FLUSH # and SMIACT # as specified for 
a processor should be followed. 


If the SMRAM area is overlaid over normal memory 
and if the system designer does not want to flush 
the caches upon leaving SMM then references to 
the SMRAM area should not be cached. It is the 
obligation of the system designer to ensure that the 
KEN# pin is sampled inactive during all references 
to the SMRAM area. Figures 8-14 and 8-15 illustrate 
a cached and non-cached SMM using FLUSH # and 
KEN #. 
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SMI# 


SMIACT# 
242202~-68 


SMM Normal 
Handler 
SMI# 


SMIACT# 


FLUSH# 


242202-69 
State Normal 


Resume , Cycle 


FLUSH# 


242202-70 


Figure 8-15. Non-Cached SMM 
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8.6.2.1 Write-Back Enhanced IntelDX2 | 
Processor System Management Mode 
and Cache Flushing 


Regardless of the on-chip cache mode (i.e., either 
write-through or write-back) it is recommended that 
SMRAM be non-overlaid. This provides the greatest 
freedom for caching of both SMRAM and normal 
memory, provides a simplified memory controller de- 
sign, and eliminates the performance penalty of 
flushing. 


In general, cache flushing is not required when the 
SMRAM and normal memory are not overlaid. Table 
8-7 gives the cache flushing requirements for enter- 
ing and exiting SMM, when the SMRAM is not over- 
laid with normal memory space. 


SMRAM can not be cached as write-back lines. If 
SMRAM is cached, it should be cached only as 
write-through lines. This is because dirty lines can 
not be written back to SMRAM upon exit from SMM. 
The de-assertion of SMIACT # signals that the proc- 
essor is exiting SMM, and is used to assert 
FLUSH #. By the time the write back of dirty lines 
occurs, SMIACT # would already be inactive, so the 
SMRAM can no longer be decoded. When the 
SMRAN is cached as write-through, this problem will 
not occur. 


Table 8-7. Cache Flushing 
(Non-Overlaid SMRAM) 


Normal 
Memory 
Cacheablie 


SMRAM 


Cacheable | FLUSH Entering SMM 


No, but Snoop WBs 
must go to Normal 
Memory Space. 


No, but Snoop and 
Replacement WBs 
must go to normal 
memory space. 


Coherency requirements must be met when the nor- 
mal memory is cached in write-back mode. In this 
case, the snoop and replacement write-backs that 


Intel486™ PROCESSOR FAMILY 


occur during SMM must go to normal memory, even 
though SMIACT # is active. This requirement is com- 
patible with SMM security requirements, because 
these write backs can not decode the SMRAM, and 
the memory system must be able to handle this situ- 
ation properly. 


lf SMRAM is overlaid with normal memory space, 
additional system design features are needed to en- 
sure that cache coherency is maintained. Table 8-8 
lists the cache flushing requirements for entering 
and exiting the SMM when the SMRAM is overlaid 
with normal memory space. 


Table 8-8. Cache Flushing (Overlaid SMRAM) 


Normal 
Memory 
Cacheable 


WT or WB 


WT or WB 


lf SMI|# and FLUSH# are asserted together, the 
Write-Back Enhanced IntelDX2 processor guaran- 
tees that the FLUSH# will be recognized first, fol- 
lowed by the SMI #. If the cache is configured in the 
write-back mode, the modified lines will be written 
back to the normal user space, followed by the two 
speciai cycles. The SMI# will then be recognized 
and the transition to SMM will occur, as shown in 
Figure 8-16. 


SMRAM 
Cacheable 


Cache flushing during SMM exit is accomplished by 
asserting the FLUSH # pin after the SMIACT # pin is 
de-asserted (within 1 CLK). To guarantee this be- 
havior, the constraints on setup and hold timings on 
the interaction of FLUSH# and SMIACT # as speci- 
fied for the Write-Back Enhanced IntelDX2 proces- 
sor should be followed. 


The WBINVD instruction should not be used to flush 
the cache when exiting SMM. Instead, the FLUSH # 
pin should be asserted after the SMIACT # pin is de- 
asserted (within 1 CLK). The cache coherency re- 
quirements associated with SMM and write-through 
vs. write-back caches apply to second level cache 
control designs as well. The appropriate second lev- 
el cache flushing is also required upon entering and 
exiting the SMM. 
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Write - 
Back State 
Flush Cache ,Cycles Save 
SMI# | 


SMIACT# 


Cache must be empty 


Normal 


SMM _ State 
Handler Resume 


Cache must 
be empty 
242202-71 


Figure 8-16. Write-Back Enhanced IntelDX2™ Processor Cache Flushing for 
~ Overlaid SMRAM upon Entry and Exit of Cached SMM 


Snoops During SMM 


Snoops cycles are allowed during SMM. However, 
because the SMRAM is always cached as a write- 
through, there can never be a snoop hit to a modi- 
fied line in the SMRAM address space. Consequent- 
ly, if there is a snoop hit to a modified line, it will 
correspond to the normal address space. In this 
case, even though SMIACT # is asserted, the mem- 
ory controller must drive the snoop write-back cycle 
to the normal memory space and not to the SMRAM 
address space. ; 


If the overlaid normal memory is_ cacheable, 


FLUSH# must be asserted when entering SMM, 
causing all modified lines of normal memory to be 
written back. As a result, there can not be a snoop 
hit to a modified line in the cacheable normal memo- 
ry space that is overlaid with the SMRAM space. 


If the overlaid normal memory is not cacheable, no 
flushing is necessary when entering SMM. If normal 
memory is not overlaid with SMRAM, no flushing is 
required upon entering SMM and it is possible that a 
snoop can hit a modified line cached from anywhere 
in normal memory space while the processor is in 
SMM. 
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8.6.3 A20M# PIN AND SMBASE RELOCATION 


Systems based on a PC-compatible architecture | 
contain a feature that enables the processor ad- — 
dress bit A20 to be forced to 0. This limits physical 
memory to a maximum of 1 Mbyte, and is provided 
to ensure compatibility with those programs that re- 
lied on the physical address wrap around functionali- 
ty of the 8088 processor. The A20M# pin on In- 
tel486 processors provides this function. When 


_ A20M# is active, all external bus cycles will drive 


A20M # low, and all internal cache accesses will be 
performed with A20M # low. 


The A20M # pin is recognized while the processor is 
in SMM. The functionality of the A2Z0M# input must 
be recognized in the following two instances: 


1. If the SMM handler needs to access system 
memory space above 1 Mbyte (for example, when 
saving memory to disk for a zero-volt suspend), 
the A20M# pin must be de-asserted before the 
memory above 1 Mbyte is addressed. 


2. If SMRAM has been relocated to address space 
above 1 Mbyte, and A20M# is active upon enter- 
ing SMM, the processor will attempt to access 
SMRAM at the relocated address, but with A20 
low. This could cause the system to crash, be- 
cause there would be no valid SMM interrupt han- 
dler at the accessed location. 
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In order to account for the above two situations, the 
system designer must ensure that A20M# is de-as- 
serted on entry to SMM. A20M# must be driven in- 
active before the first cycle of the SMM state save, 
and must be returned to its original level after the 
last cycle of the SMM state restore. This can be 


done by blocking the assertion of A2Q0M# whenever 
SMIACT # is active. 


8.6.4 PROCESSOR RESET DURING SMM 


The system designer should take into account the 
following restrictions while implementing the proces- 
sor RESET logic. 


1. When running software written for the 80286 
processor a processor SRESET is used to switch 
the processor from Protected mode to Real 
mode. Note that SRESET has a higher interrupt 
priority than SMIACT #. When the processor is in 
SMM, the SRESET to the processor during SMM 
should be blocked until the processor exits SMM. 
SRESET must be blocked beginning from the 
time when SMI# is driven active and ending at 
least 20 CLK cycles after SMIACT # is de-assert- 
ed. Be careful not to block the global system 
RESET, which may be necessary to recover from 
a system crash. 


2. During execution of the RSM instruction to exit 
SMM, there is a small time window between the 
de-assertion of SMIACT # and the completion of 
the RSM microcode. If SRESET is asserted dur- 
ing this window, it is possible that the SMRAM 
space will be violated. The system designer must 
guarantee that SRESET is blocked until at least 
20 processor clock cycles after SMIACT# has 
been driven inactive. 


3. Any request for a processor SRESET for the pur- 
pose of switching the processor from Protected 
mode to Real mode must be acknowledged after 
the processor has exited SMM. In order to main- 
tain software transparency, the system logic must 
latch any SRESET signals that are blocked during 
SMM. 


8.6.5 SMM AND SECOND LEVEL WRITE 
BUFFERS 


Before an Intel486 processor enters SMM, it emp- 
ties its internal write buffers. This is necessary so 
that the data in the write buffers is written to normal 
memory space, not SMM space. Once the proces- 
sor is ready to begin writing an SMM state save to 
SMRA\M, it asserts the SMIACT # signal. SMIACT # 
may be driven active by the processor before the 
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system memory controller has had an opportunity to 
empty the second level write buffers. 


To prevent the data from these second level write 
buffers from being written to the wrong location, the 
system memory controller needs to direct the mem- 
ory write cycles to either SMM space or normal 
memory space. This can be accomplished by saving 
the status of SMIACT# along with the aacless for 
each word in the write buffers. 


8.6.6 NESTED SMI#s AND I/O RESTART 


Special care must be taken when executing an SMM 
handler for the purpose of restarting an I/O instruc- 
tion. When the processor executes a RSM instruc- 
tion with the |/O restart slot set, the restored EIP is 
modified to point to the instruction immediately pre- 
ceding the SMI # request, so that the I/O instruction 
can be re-executed. If a new SMI# request is re- 
ceived while the processor is executing an SMM 
handler, the processor will service this SMI#_ re- 
quest before restarting the original |/O instruction. If 
the I/O restart slot is set when the processor exe- 
cutes the RSM instruction for the second SMM han- 
dler, the RSM microcode will decrement the 
restored EIP again. EIP now points to an address 
different from the originally interrupted instruction, 
and the processor will begin execution of the inter- 
rupted application code at an incorrect entry point. 


To prevent this from occurring, the SMM handler 
routine must not set the I/O restart slot during 
the second of two consecutive SMM handlers. 


8.7 SMM Software Considerations 


8.7.1 SMM CODE CONSIDERATIONS 


The default operand size and the default address 
size are 16 bits; however, operand-size override and 
address-size override prefixes can be used as need- 
ed to directly access data anywhere within the 
4-Gbyte logical address space. 


With operand-size override prefixes, the SMM han- 
dler can use jumps, calls, and returns, to transfer 
control to any location within the 4-Gbyte space. 
Note, however, the following restrictions: 


e Any control transfer that does not have an oper- 
and-size override prefix truncates EIP to 16 low- 
order bits. 
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° Due to the Real mode style of base-address for- 
mation, a far jump or call cannot transfer control 


to a segment with a base address of more eas 


20 bits (one megabyte). 


8.7.2 EXCEPTION HANDLING 


Upon entry into SMM, external interrupts that require © 


handlers are disabled (the IF bit in the EFLAGS is 
cleared). This is necessary because, while the proc- 
essor is in SMM, it is running in a separate memory 
space. Consequently the vectors stored in the inter- 
rupt descriptor table (IDT) for the prior mode are not 
applicable. Before allowing exception handling (or 
software interrupts), the SMM program must initial- 
ize new interrupt and exception vectors. The inter- 
rupt vector table for SMM has the same format as 
for Real mode. Until the interrupt vector table is cor- 


rectly initialized, the SMM handler must not generate 


an exception (or software interrupt). Even though 
hardware interrupts are disabled, exceptions and 
software interrupts can still occur. Only a correctly 
written SMM handler can prevent internal excep- 
tions. When new exception vectors are initialized, in- 
ternal exceptions can be serviced. The following are 
the restrictions: 


1. Due to the Real mode style of base address for- 
mation, an interrupt or exception cannot transfer 
control to a segment with a base address of more 
that 20 bits. 


2. An interrupt or exception cannot transfer control 
to a segment offset of more than 16 bits 
(64 Kbytes). 


3. If exceptions or interrupts are allowed to occur, 
only the low order 16 bits of the return address 
(EIP) are pushed onto the stack. If the offset of 
the interrupted procedure is greater than 
64 Kbytes, it is not possible for the interrupt/ 
exception handler to return control to that proce- 
dure. (One work-around could be to perform soft- 
ware adjustment of the return address on the 
stack.) 
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4. The SMBASE Relocation feature affects the way 


the processor will return from an. miSHUR: or ex- - 


ception during an SMI # handler. 


8.7.3 HALT DURING SMM 


HALT should not be executed during SMM, unless 
interrupts have been enabled (see section 8.7.2. 
‘Exception Handling‘). Interrupts are disabled in 
SMM and INTR, NMI, and SMI# are the only events 
that take the processor out of HALT. | 


8.7.4 RELOCATING SMRAM TO AN ADDRESS 
ABOVE ONE MEGABYTE 


Within SMM (or Real mode), the segment base reg- 
isters can only be updated by changing the segment 
register. The segment registers contain only 16 bits, 
which allows only 20 bits to be used for a segment 
base address (the segment register is shifted left 
four bits to determine the segment base address). If 
SMRAM is relocated to an address above one 
megabyte, the segment registers can no ngs be 
initialized to point to SMRAM. 


_ These areas can still be accessed by using address 


override prefixes to generate an offset to the correct 
address. For example, if the SMBASE has been re- 
located immediately below 16M, the DS and ES reg- 
isters are still initialized to OOOO OOOOH. We can still 
access data in SMRAM by using 32-bit a cl 
registers: 


364K segment 
simmediately 
sbelow 16M 


mov esi, OOFFxxxxH 


mov ax,ds:[esil] 
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9.0 HARDWARE INTERFACE 


9.1 Introduction 


The Intel486 processor has separate parallel buses 
for data and addresses. The bidirectional data bus is 
32 bits in width. The address bus consists of two 
components: 30 address lines (A2-—A31) and 4-byte 
enable lines (BEO #-—-BE3#). The address lines form 
the upper 30 bits of the address and the byte en- 
ables select individual bytes within a 4-byte location. 
The address lines are bidirectional for use in cache 
line invalidations. (See Figure 9-1.) 


The Intel486 processor’s burst bus mechanism en- 
ables high-speed cache fills from external memory. 
Burst cycles can strobe data into the processor ata 
rate of one item every clock. Non-burst cycles have 
a maximum rate of one item every two clocks. Burst 
cycles are not limited to cache fills: all read bus cy- 
cles requiring more than a single data cycle can be 
bursted. The Write-Back Enhanced IntelDX2™ 
processor can also burst write cycles. 


During bus hold, the Intel486 processor relinquishes 
control of the local bus by floating its address, data 
and control buses. The Intel486 processor has an 
address hold feature in addition to bus hold. During 
address hold, only the address bus is floated, the 
data and control buses can remain active. Address 
hold is used for cache line invaiidations. 


The Intel486 supports the IEEE 1149.1 boundary 
scan. 


This section provides a brief description of the In- | 


tel486 processor input and output signals arranged 
by functional groups. The # symbol at the end of a 
signal name indicates that the active or asserted 
state occurs when the signal is at a low voltage. 
When a # is not present after the signal name, the 
signal is active at high voltage level. The term 
“ready” is used to indicate that the cycle is terminat- 
ed with RDY # or BRDY#. 
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This section and section 10, “Bus Operation,” de- 
scribe bus cycles and data cycles. A bus cycle is at 
least two-clocks long and begins with ADS# active 
in the first clock and RDY # and/or BRDY # active in 


~ the last clock. Data is transferred to or from the In- 
tel486 processor during a data cycle. A bus cycle 


contains one or more data cycles. 


9.2 Signal Descriptions 


9.2.1 CLOCK (CLK) 


CLK provides the fundamental timing and the inter- 
nal operating frequency for the Intel486 processor. 
All external timing parameters are specified with re- 
spect to the rising edge of CLK. 


The Intel486 processor can operate over a wide fre- 
quency range, however the CLK frequency cannot 
change rapidly while RESET is inactive. The CLK 
frequency must be stable for proper chip operation 


_ because a single edge of CLK is used internally to 


generate two phases. CLK only needs TTL levels for 
proper operation. Figure 9-2 illustrates the CLK 
waveform. 


9.2.2 INTELDX4 PROCESSOR CLOCK 
MULTIPLIER SELECTABLE INPUT 
(CLKMUL) 


The IntelDX4 processor differs from the IntelDX2 
processor in that it provides for two internal clock 
multiplier ratios: speed doubled mode and speed tri- 
pled mode. Speed doubled mode is identical to the 
IntelDX2 processor mode of operation where the in- 
ternal core is operating at twice the external bus fre- 
quency. Selecting speed tripled mode causes the in- 
ternal core frequency to operate at three times the 
external bus frequency. The IntelDX4 processor de- 
termines the desired clock multiplier ratio by sam- 
pling the status of the CLKMUL input during cold 
(power on) processor resets. The clock multiplier 
ratio cannot be changed during warm resets. 
Also, SRESET cannot be used to select the clock 
multiplier ratio. 
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Figure 9-1. Functional Signal Groupings 
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242202-73 
tx = input setup times 
ty = input hold times, output float, valid and hold times 


Figure 9-2. CLK Waveform 


To determine which clock multiplier is desired, the 
IntelDX4 processor samples the status of CLKMUL 
while RESET is active. If the CLKMUL input is driven 
low during RESET, the frequency of the core will be 
twice the external bus frequency (speed doubled 
mode). If driven high or left floating, speed tripled 
mode is selected. (See Table 9-1.) In order to allow 
maximum flexibility, CLKMUL can be jumper-config- 
urable to either Voc (speed tripled mode) or Vss 
(speed doubled mode). (See Figure 9-3.) 


Table 9-1. Clock Multiplier Selection 


Internal 
Clock 
Freq. 
(MHz) 


External 
Clock 
Freq. 
(MHz) 


CLKMUL 
Multiplier 


Vcc or 3 25 75 
Not Driven 33 100 


Intel486™ PROCESSOR FAMILY 


intelDX4 
Processor 


242202-74 | 


Figure 9-3. Voitage Detect (VOLDET) Sense Pin 


The clock multiplier selection method is fully back- 
ward compatible with Intel486 processor-based sys- 
tem designs. The CLKMUL signal occupies a pin 
which is labeled as an ‘INC‘ on other Intel486 proc- 
essors. Therefore, this pin is not driven in other In- 
tel486 processor system designs. The IntelDX4 
processor contains an internal pull-up resistor on the 
CLKMUL signal. As shown in Table 9-1, when 
CLKMUL is not driven, the internal core frequency 
defaults to speed tripled mode. 


The internal pull-up resistor on the CLKMUL pin is 
disabled while the IntelDX4 processor is in the Stop 
Grant or Stop Clock modes. This prevents a low lev- 
el DC current path from drawing current while in the 
Stop Grant or Stop Clock states on a system with 
CLKMUL connected to Vss. 


9.2.3 ADDRESS BUS (A31-A2, BEO# -BE3+#) 


A31-A2 and BEO#-BE3# form the address bus 
and provide physical memory and |/O port address- 
es. The Intel486 processor is capable of addressing 
4 gigabytes of physical memory space (O0000Q000H 
through FFFFFFFFH), and 64 Kbytes of I/O address 
space (OOO0O0000H through OOOOFFFFH). A31-A2 
identify addresses to a 4-byte location. BEO#- 
BE3# identify which bytes within the 4-byte location 
are involved in the current transfer. 
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Addresses are driven back into the Intel486 proces- © 


sor over A31-—A4 during cache line invalidations. 
The address lines are active HIGH. When used as 
inputs into the processor, A31-A4 must meet the 
setup and hold times, too and te3. A31—-A2 are not 
driven during bus or address hold. 


The byte enable outputs, BEO#-BE3#, determine 
which bytes must be driven valid for read and write 
cycles to external memory. 

e BE3# applies to D24-D31 

e BE2# applies to D16-—D23 

e BE1# applies to D8-D15 

e BEO# applies to DO-D7 


BEO#-—BE3# can be decoded to generate AO, A1 
and BHE# signals used in 8- and 16-bit systems 
(see Table 10-5). BEO#-BE3# are active LOW and 
are not driven during bus hold. 


9.2.4 DATA LINES (D31-D0) 


The bidirectional lines, D31—D0, form the data bus 
for the Intel486 processor DO-D7 define the least 
significant byte and D24-D31 the most significant 
byte. Data transfers to 8- or 16-bit devices are possi- 
ble using the data bus sizing feature controlled by 


the BS8# or BS16# input pins. D31—D0 are active 


HIGH. For reads, D31-D0 must meet the setup and 
hold times, too and te3. D31—D0 are not driven dur- 
ing read cycles and bus hold. 


9.2.5 PARITY 
Data Parity Input/Outputs (DPO-DP3) 


DPO-DP3 are the data parity pins for the processor. 
There is one pin for each byte of the data bus. Even 
parity is generated or checked by the parity genera- 
tors/checkers. Even parity means that there are an 
even number of HIGH inputs on the eight corre- 
sponding data bus pins and parity pin. — 


Data parity is generated on ail write data cycles with 
the same timing as the data driven by the Intel486 
processor. Even parity. information must be driven 
back to the Intel486 processor on these pins with 
the same timing as read information to insure that 
the correct parity check status is indicated by the 
Intel486 processor. 


The values read on these pins do not affect program 


execution. It is the responsibility of the system to 
take appropriate actions if a parity error occurs. 
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Input signals on DPO-—DP3 must meet setup and 
hold times too and tog for proper operation. 


Parity Status Output (PCHK #) 


Parity status is driven on the PCHK # pin, and a pari- 
ty error is indicated by this pin being LOW. PCHK # 
is driven the clock after ready for read operations to 
indicate the parity status for the data sampled at the 
end of the previous clock. Parity is checked during 
code reads, memory reads and !/O reads. Parity is 
not checked during interrupt acknowledge cycles. 
PCHK# only checks the parity status for enabled 
bytes as indicated by the byte enable and bus size 
signals. It is valid only in the clock immediately after 
read data is returned to the Intel486 processor. At all 
other times it is inactive (HIGH). PCHK# is never 
floated. 3 


Driving PCHK # is the only effect that bad input pari- 


_ ty has on the Intel486 processor. The Intel486 proc- 


essor will not vector to a bus error interrupt when - 
bad data parity is returned. In systems that will not 
employ parity, PCHK# can be ignored. In:systems 
not using parity, DPO-DP3 should be connected to 
Vcc through a pull-up resistor. 


9.2.6 BUS CYCLE DEFINITION 
M/lO#, D/C#, W/R# Outputs 


M/lO#, D/C# and W/R# are the primary bus cycle 
definition signals. They are driven valid as the ADS # 
signal is asserted. M/IO# distinguishes between 
memory and |/O cycles, D/C# distinguishes be- 
tween data and control cycles and W/R# distin- 
guishes between write and read cycles. 


Bus cycle definitions as a function of M/IO#, D/C# 
and W/R# are given in Table 9-2. Note there is a 
difference between the Intel486 processor and In- 
tel386™ processor bus cycle definitions. The halt 
bus cycle type has been moved to location 001 in 
the Intel486 processor from location 101 in the In- 
tel386 processor. Location 101 is now reserved and 
will never be generated by the Intel486 processor. 


Special bus cycles are discussed in section 10.2.11, 
“Special Bus Cycles’’. , 
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Table 9-2. ADS # Initiated Bus Cycle Definitions 


0 | 0 | 0 _| Interrupt Acknowledge | 
a oe ee Halt/Special Cycle 

poo ft | o | vORead 
po ft ft | vOWrte 
ae clk es anc 


Bus Lock Output (LOCK #) 


LOCK # indicates that the Intel486 processor is run- 
ning a read-modify-write cycle where the external 
bus must not be relinquished between the read and 
write cycles. Read-modify-write cycles are used to 
implement memory-based semaphores. Multiple 
reads or writes can be locked. 


When LOCK # is asserted, the current bus cycle is 
locked and the Intel486 processor should be al- 
lowed exclusive access to the system bus. LOCK # 
goes active in the first clock of the first locked bus 
cycle and goes inactive after ready is returned indi- 
cating the last locked bus cycle. 


The Intel486 processor will not acknowledge bus 
hold when LOCK # is asserted (though it will allow 
an address hold). LOCK # is active LOW and is float- 
ed during bus hold. Locked read cycles will not be 
transformed into cache fill cycles if KEN # is. re- 
turned active. Refer to section 10.2.6, “Locked Cy- 
cles,” for a detailed discussion of Locked bus cy- 
cles. 


Pseudo-Lock Output (PLOCK #) 


The pseudo-lock feature allows atomic reads and 
writes of memory operands greater than 32 bits. 
These operands require more than one cycle to 
transfer. The Intel486 processor asserts PLOCK# 
during segment table descriptor reads (64 bits) and 
cache line fills (128 bits). 


When PLOCK # is asserted no other master will be 
given control of the bus between cycles. A bus hold 
request (HOLD) is not acknowledged during pseudo- 
locked reads and writes, with one exception. During 
non-cacheable non-bursted code prefetches, HOLD 
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is recognized on memory cycle boundaries even 
though PLOCK# is asserted. The Intel486 proces- 
sor will drive PLOCK# active until the addresses for 
the last bus cycle of the transaction have been driv- 
en regardless of whether BRDY # or RDY# are re- 
turned. 


A pseudo-locked transfer is meaningful only if the 
memory operand is aligned and if its completely con- 
tained within a single cache line. 


Because PLOCK # is a function of the bus size and 
KEN # inputs, PLOCK# should be sampled only in 
the clock ready is returned. This pin is active LOW 
and is not driven during bus hold. Refer to section 
10.2.7, “Pseudo-Locked Cycles.” 


9.2.6.1 PLOCK # Floating Point Considerations 


For processors with an on-chip FPU, the following 
must be noted for PLOCK # operation. A 64-bit float- 
ing point number must be aligned to an 8-byte 
boundary to guarantee an atomic access. Normally 
PLOCK# and BLAST# are inverse of each other. 
However, during the first cycle of a 64-bit floating 
point write, both PLOCK# and BLAST # will be as- 
serted. Intel486 processors with on-chip FPUs also 
assert PLOCK# during floating point long reads and 
writes (64 bits), segmentable description reads (64 
bits) and code line fills (128 bits). 


9.2.7 BUS CONTROL 


The bus control signals allow the Intel486 processor 
to indicate when a bus cycle has begun, and allow 
other system hardware to control burst cycles, data 
bus width and bus cycle termination. 


Address Status Output (ADS #) 


The ADS# output indicates that the address and 
bus cycle definition signals are valid. This signal will 
go active in the first clock of a bus cycle and go 
inactive in the second and subsequent clocks of the 
cycle. ADS# is also inactive when the bus is idle. 


ADS# is used by the external bus circuitry as the 
indication that the Intel486 processor has started a 
bus cycle. The external circuit must sample the bus 
cycle definition pins on the next rising edge of the 
clock after ADS # is driven active. 


ADS # is active LOW and is not driven during bus 
hold. 
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Non-burst Ready Input (RDY #) 


RDY # indicates that the current bus cycle is com- 
plete. In response to a read, RDY# indicates that 
the external system has presented valid data on the 
data pins. In response to a write request, RDY # indi- 
cates that the external system has accepted the In- 
tel486 processor data. RDY# is ignored when the 
bus is idle and at the end of the first clock of the bus 
cycle. Because RDY# is sampled during address 
hold, data can be returned to the processor when 
AHOLD is active. 


RDY# is active LOW, and is not provided with an 
internal pull-up resistor. This input must satisfy setup 
and hold times ty¢ and t;7 for proper chip operation. 


9.2.8 BURST CONTROL 
Burst Ready Input (BRDY #) 


BRDY # performs the same function during a burst 
cycle that RDY # performs during a non-burst cycle. 
BRDY # indicates that the external system has pre- 
sented valid data on the data pins in response to a 
read or that the external system has accepted the 
Intel486 processor data in response to a write. 
’ BRDY # is ignored when the bus is idle and at the 
end of the first clock in a bus cycle. 


During a burst cycle, BRDY # will be sampled each 
clock, and if active, the data presented on the data 


bus pins will be strobed into the Intel486 processor. . 


ADS# is negated during the second through last 
data cycles in the burst, but address lines A2-A3 
and byte enables will change to reflect the next data 
item expected by the Intel486 processor. 


If RDY# is returned simultaneously with BRDY#, 
BRDY # is ignored and the burst cycle is premature- 
ly aborted. An additional complete bus cycle will be 
initiated after an aborted burst cycle if the cache line 
fill was not complete. BRDY # is treated as a normal 
ready for the last data cycle in a burst transfer or for 
non-burstable cycies. Refer to section 10.2.2, “Multi- 
ple and Burst Cycle Bus Transfers,”’ for burst cycle 
timing. 


BRDY # is active LOW and is provided with a small — 


internal pull-up resistor. BRDY# must satisfy the 
_setup and hold times ty, and t7. 
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BLAST # indicates that the next time BRDY # is re- 
turned it will be treated as a normal RDY #, terminat- 
ing the line fill or other multiple-data-cycle transfer. . 
BLAST # is active for all bus cycles regardless of 
whether they are cacheable or not. This pin is active 
LOW and is not driven during bus hold. 


Burst Last Output (BLAST #) 


9.2.9 INTERRUPT SIGNALS 


The interrupt signals can interrupt or suspend exe- 
cution of the processor’s current instruction stream. — 


Reset Input (RESET) 


The RESET input must be used at power-up to ini- 
tialize the processor. The Reset input forces the 
processor to begin execution at a known state. The 
processor cannot begin execution of instructions un- | 
til at least 1 ms after Vcc and CLK have reached 
their proper DC and AC specifications. The RESET 
pin should remain active during this time to ensure 
proper processor operation. However, for warm 
boot-ups RESET should remain active for at least 15 
CLK periods. RESET is active HIGH. RESET is asyn- 
chronous but must meet setup and hold times too 
and to; for recognition in any specific clock. 


RESET will reset SMBASE to the default value of 
30000H. If SMBASE relocation is not used, the RE- 
SET signal can be used as the only reset. (See sec- 
tion 8, “System Management Mode Architecture.”’) 


The Intel486 processor will be placed in the Power 


Down Mode if UP# is sampled active at the falling 
edge of RESET. 


Soft Reset Input (SRESET) — 


The SRESET (Soft RESET) input, has the same 
functions as RESET, but does not change the 
SMBASE, and UP# is not sampled on the falling 
edge of SRESET. If SMBASE relocation is used by 
the system, the soft resets should be handled using 
the SRESET input. The SRESET signal should not 
be used for the cold boot-up power-on reset. 


The SRESET input pin is provided to save the status 
of SMBASE during intel 286 processor-compatible 
mode change. SRESET leaves the location of 


| 
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SMBASE intact while resetting other units, including 
the on-chip cache. (See section 9.2.18.4, “Soft Re- 
set,” for Write-Back Enhanced IntelDX2 processor 
differences.) For compatibility, the system should 
use SRESET to flush the on-chip cache. The 
FLUSH# input pin should be used to flush the on- 


chip cache. SRESET should not be used to initiate 
test modes. 


System Management Interrupt Request Input 
(SMI #) | 


SMI# is the system management mode interrupt re- 
quest signal. The SMI# request is acknowledged by 
the SMIACT # signal. After the SMI # interrupt is rec- 
ognized, the SMI# signal will be masked internally 
until the RSM instruction is executed and the inter- 
rupt service routine is complete. SMI# is falling- 
edge sensitive after internal synchronization. 


The SMI# input must be held inactive for at least 
four clocks after it is asserted to reset the edge trig- 
gered logic. SMI! # is provided with a pull-up resistor 
to maintain compatibility with designs which do not 
use this feature. SMI# is an asynchronous signal, 
but setup and hold times, tag and to1, must be met in 
order to guarantee recognition on a specific clock. 


System Management 
(SMIACT #) 


Mode Active Output 


SMIACT # indicates that the processor is operating 
in System Management Mode. The processor as- 
serts SMIACT # in response to an SMI interrupt re- 
quest on the SMI# pin. SMIACT # is driven active 
after the processor has completed all pending write 
cycles (including emptying the write buffers), and be- 
fore the first access to SMRAM when the processor 
saves (writes) its state (or context) to SMRAM. 
SMIACT# remains active until the last access to 
SMRAM when the processor restores (reads) its 
state from SMRAM. The SMIACT # signal does not 
float in response to HOLD. The SMIACT # signal is 
used by the system logic to decode SMRAM. 


Maskable Interrupt Request Input (INTR) 


INTR indicates that an external interrupt has been 
generated. Interrupt processing is initiated if the IF 
flag is active in the EFLAGS register. 


The Intel486 processor will generate two locked in- 
terrupt acknowledge bus cycles in response to as- 
serting the INTR pin. An 8-bit interrupt number will 
be latched from an external interrupt controller at 
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the end of the second interrupt acknowledge cycle. 
INTR must remain active until the interrupt acknowl- 
edges have been performed to assure program in- 
terruption. Refer to section 10.2.10, “Interrupt Ac- 
knowledge,’ for a detailed discussion of interrupt ac- 
knowledge cycles. 


The INTR pin is active HIGH and is not provided with 
an internal pull-down resistor. INTR is asynchro- 
nous, but the INTR setup and hold times, tao and toy, 
must be met to assure recognition on any specific 
clock. 


Non-maskable Interrupt Request Input (NMI) 


NMI is the non-maskable interrupt request signal. 
Asserting NMI causes an interrupt with an internally 
supplied vector value of 2. External interrupt ac- 
knowledge cycles are not generated because the 
NMI interrupt vector is internally generated. When 
NMI processing begins, the NMI signal will be 
masked internally until the IRET instruction is exe- 
cuted. 


NMI is rising edge sensitive after internal synchroni- 
zation. NMI must be held LOW for at least four CLK 
periods before this rising edge for proper operation. 
NMI is not provided with an internal pull-down resis- 
tor. NMI is asynchronous but setup and hold times, 
top and to; must be met to assure recognition on any 
specific clock. 


Stop Clock Interrupt Request Input (STPCLK #) 


The Intel486 processor provides an interrupt mecha- 
nism, STPCLK#, that allows system hardware to 
control the power consumption of the processor by 
stopping the internal clock (output of the PLL) to the 
processor core in a controlled manner. This low- 
power state is called the Stop Grant state. In addi- 
tion, the STPCLK# interrupt allows the system to 
change the input frequency within the specified 
range or completely stop the CLK input frequency 
(input to the PLL). If the CLK input is completely 
stopped, the processor enters into the Stop Clock 
state-the lowest power state. If the frequency is 
changed or stopped, the Intel486 processor will not 
return to the Stop Grant state until the CLK input has 
been running at a constant frequency for the time 
period necessary to stabilize the PLL (minimum of 
1 ms). 


The Intel486 processor will generate a Stop Grant 


bus cycle in response to the STPCLK # interrupt re- 
quest. STPCLK# is active LOW and is provided 
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with an internal pull-up resistor. STPCLK# is an 
asynchronous signal, but must remain active until 
the processor issues the Stop Grant bus cycle. (Re- 
_fer to section 10.2.11.3, “Stop Grant Indication Cy- 
cle.”’) 


9.2.10 BUS ARBITRATION SIGNALS 


This section describes the mechanism by which the 
processor relinquishes control of its local bus when 
requested by another bus master. 


Bus Request Output (BREQ) 


The Intel486 processor asserts BREQ whenever a 
bus cycle is pending internally. Thus, BREQ is al- 
ways asserted in the first clock of a bus cycle, along 
with ADS #. Furthermore, if the Intel486 processor is 
currently not driving the bus (due to HOLD, AHOLD, 
or BOFF#), BREQ is asserted in the same clock 
that ADS # would have been asserted if the Intel486 
processor were driving the bus. After the first clock 
of the bus cycle, BREQ may change state. It will be 
asserted if additional cycles are necessary to com- 
plete a transfer (via BS8#, BS16#, KEN#), or if 
more cycles are pending internally. However, if no 
additional cycles are necessary to complete the cur- 
rent transfer, BREQ can be negated before ready 
comes back for the current cycle. External logic can 
use the BREQ signal to arbitrate among multiple 
processors. This pin is driven regardless of the state 
of bus hold or address hold. BREQ is active HIGH 
and is never floated. During a hold state, internal 
events may cause BREQ to be de-asserted prior to 
any bus cycles. 


Bus Hold Request Input (HOLD) 


HOLD allows another bus master complete control 
of the Intel486 processor bus. The Intel486 proces- 
sor will respond to an active HOLD signal by assert- 
ing HLDA and placing most of its output and input/ 
output pins in a high impedance state (floated) after 
completing its current bus cycle, burst cycle, or se- 
quence of locked cycles. In addition, if the Intel486 
processor receives a HOLD request while perform- 
ing a code fetch, and that cycle is backed off 
(BOFF#), the Intel486 processor will recognize 


HOLD before restarting the cycle. The code fetch . 


can be non-cacheable or cacheable and non-burst- 
ed or bursted. The BREQ, HLDA, PCHK# and 


FERR# pins are not floated during bus hold. The | 


Intel486. processor will maintain its bus in this state 
until the HOLD is de-asserted. Refer to section 
10.2.9, “Bus Hold,” for timing diagrams for bus hold 
cycles and HOLD request acknowledge during 
BOFF#. 
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Unlike the Intel386 processor, the Intel486 proces- 
sor will recognize HOLD during reset. Pull-up resis- 
tors are not provided for the outputs that are floated . 
in response to HOLD. HOLD is active HIGH and is 
not provided with an internal pull-down resistor. 
HOLD must satisfy setup and hold times t;g and ty9 
for proper chip operation. 


Bus Hold Acknowledge Output (HLDA) 


- HLDA indicates that the Intel486 processor has giv- 


en the bus to another local bus master. HLDA goes 
active in response to a hold request presented on 
the HOLD pin. HLDA is driven active in the same 
clock that the Intel486 processor floats its bus. 


HLDA will be driven inactive when leaving bus hold 


and the Intel486 processor will resume driving the 
bus. The Intel486 processor wiil not cease internal 
activity during bus hold because the internal cache 
will satisfy the majority of bus requests. HLDA is ac- 
tive HIGH and remains driven during bus hold. 


Backoff input (BOFF #) 


Asserting the BOFF # input forces the Intel486 proc- 
essor to release control of its bus in the next clock. 
The pins floated are exactly the same as in re- 
sponse to HOLD. The response to BOFF¥ differs 
from the response to HOLD in two ways: First, the 
bus is floated immediately in response to BOFF# 
while the Intel486 processor completes the current 
bus cycle before floating its bus in response to 
HOLD. Second the Intel486 processor does not as- 
sert HLDA in response to BOFF #. 


The Intel486 processor remains in bus hold until 
BOFF# is negated. Upon negation, the Intel486 
processor restarts the bus cycle aborted when 
BOFF # was asserted. To the internal execution en- 
gine the effect of BOFF # is the same as inserting a 
few wait states to the original cycle. Refer to section 
10.2.12, ‘Bus Cycle Restart,‘ for a description of bus 
cycle restart. 


Any data returned to the Intel486 processor while 
BOFF # is asserted is ignored. BOFF# has higher 
priority than RDY# or BRDY#. If both BOFF# and 
ready are returned in the same clock, BOFF# takes 
effect. If BOFF # is asserted while the bus is idle, the 
Intel486 processor will float its bus in the next clock. 
BOFF # is active LOW and must meet setup and 
hold times t;g and tyg for proper chip operation. 
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9.2.11 CACHE INVALIDATION 


The AHOLD and EADS# inputs are used during 
cache invalidation cycles. AHOLD conditions the In- 
tel486 processor address lines, A4—A31, to accept 
an address input. EADS # indicates that an external 
address is actually valid on the address inputs. Acti- 
vating EADS# will cause the Intel486 processor to 
read the external address bus and perform an inter- 
nal cache invalidation cycle to the address indicat- 
ed. Refer to section 10.2.8, “Invalidate Cycle,” or 
cache invalidation cycle timing. 


Address Hold Request Input (AHOLD) 


AHOLD is the address hold request. It allows anoth- 
er bus master access to the Intel486 processor ad- 
dress bus for performing an internal cache invalida- 
tion cycle. Asserting AHOLD will force the Intel486 
processor to stop driving its address bus in the next 
clock. While AHOLD is active only the address bus 
will be floated, the remainder of the bus can remain 
active. For example, data can be returned for a pre- 
viously specified bus cycle when AHOLD is active. 
The Intel486 processor will not initiate another bus 
cycle during address hold. Because the Intel486 
processor floats its bus immediately in response to 
AHOLD, an address hold acknowledge is not re- 
quired. If AHOLD is asserted while a bus cycle is in 
progress, and no readies are returned during the 
time AHOLD is asserted, the Intel486 processor will 
redrive the same address (that it originally sent out) 
once AHOLD is negated. 


AHOLD is recognized during reset. Because the en- 
tire cache is invalidated by reset, any invalidation 
cycles run during reset will be unnecessary. AHOLD 
is active HIGH and is provided with a small internal 
pull-down resistor. It must satisfy the setup and hold 
times tig and tig for proper chip operation. AHOLD 
also determines whether or not the built in self test 
features of the Intel486 processor will be exercised 
on assertion of RESET. (See section 11.1, “Built-In 
Self Test.’’) 


External Address Valid Input (EADS #) 


EADS # indicates that a valid external address has 
been driven onto the Intel486 processor address 
pins. This address will be used to perform an internal 
cache invalidation cycle. The external address will 
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be checked with the current cache contents. If the 
address specified matches any areas in the cache, 
that area will immediately be invalidated. 


An invalidation cycle may be run by asserting 
EADS# regardless of the state of AHOLD, HOLD 
and BOFF #. EADS # is active LOW and is provided 
with an internal pull-up resistor. EADS # must satisfy 
the setup and hold times t;2 and t13 for proper chip 
operation. 


9.2.12 CACHE CONTROL 
Cache Enable Input (KEN #) 


KEN# is the cache enable pin. KEN# is used to 
determine whether the data being returned by the 


current cycle is cacheable. When KEN# is active J 


and the Intel486 processor generates a cycle that 


can be cached (most any memory read cycle), the ™ 


cycle will be transformed into a cache line fill cycle. 


A cache line is 16 bytes long. During the first cycle of 
a cache line fill the byte-enable pins should be ig- 
nored and data should be returned as if all four byte 
enables were asserted. The Intel486 processor will 
run between 4 and 16 contiguous bus cycles to fill 
the line depending on the bus data width selected by 
BS8# and BS16#. Refer to section 10.2.3, ‘““Cache- 
able Cycles,” for a description of cache line fill cy- 
cles. 


The KEN # input is active LOW and is provided with 
a small internal pull-up resistor. It must satisfy the 
setup and hold times t;,4 and ty5 for proper chip op- 
eration. 


Cache Flush Input (FLUSH #) 


The FLUSH # input forces the Intel486 processor to 
flush its entire internal cache. FLUSH# is active 
LOW and need only be asserted for one ciock. 
FLUSH # is asynchronous but setup and hold times 
tog and to; must be met for recognition on any spe- 
cific clock. 


FLUSH# also determines whether or not the tri- 
state test mode of the Intel486 processor will be in- 
voked on assertion of RESET. (See section 11.4, 
“Tri-State Output Test Mode.”’) 


2-151 


Intel486™ PROCESSOR FAMILY 


9.2.13 PAGE CACHEABILITY (PWT, PCD) 


The PWT and PCD output signals correspond to two 
user attribute bits in the page table entry. When pag- 
' ing is enabled, PWT and PCD correspond to bits 3 
and 4 of the page table entry respectively. For cy- 
cles that are not paged when paging is enabled (for 
example !/O cycles) PWT and PCD correspond to 
bits 3 and 4 in control register 3. When paging is 
disabled, the Intel486 processor ignores the PCD 
and PWT bits and assumes they are zero for the 
purpose of caching and driving PCD and PWT. 


PCD is masked by the CD (cache disable) bit in con- 
trol register O (CRO). When CD=1 (cache line fills 
disabled) the Intel486 processor forces PCD HIGH. 
When CD=0, PCD is driven with the value of the 
page table entry/directory. 


The purpose of PCD is to provide a cacheable/non- 
cacheable indication on a page by page basis. The 
Intel486 processor will not perform a cache fill to 
any page in which bit 4 of the page table entry is set. 
PWT corresponds to the write-back bit and can be 
used by an external cache to provide this functionali- 
ty. PCD and PWT bits are assigned to be zero during 
real mode or whenever paging is disabled. Refer to 
section 7.6, ‘Page Cacheability,, for a discussion of 
non-cacheable pages. 


PCD and PWT have the same timing as the cycle 

definition pins (M/IO#, D/C#, W/R#). PCD and 

PWT are active HIGH and are not driven during bus 
hold. 


NOTE: 
The PWT and PCD bits function differently i in 
the write-back mode of the Write-Back En- 
hanced IntelDX2 processor. (See section 
7.6.1.) 


9.2.14 UPGRADE PRESENT (UP#) 


The Upgrade Present input detects the presence of 
the upgrade processor, then powers down the core, 
and tri-states all outputs of the original processor, so 
that the original processor consumes very low cur- 
rent. This state is known as Upgrade Power Down 


~ Mode. UP # is active LOW and sampled at all times, 


including after power-up and during reset. 
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9.2.15 NUMERIC ERROR REPORTING (FERR#, 
_IGNNE #) 


To allow PC-type floating point error reporting, In- 


tel486 DX, IntelDX2, and IntelDX4 processors pro- 
vide two pins, FERR# and IGNNE#. 


Floating Point Error Output (FERR #) 


- The processor asserts FERR# whenever an un- 


masked floating point error is encountered. FERR# 


‘is similar to the ERROR # pin on the Intel387 math 


coprocessor. FERR# can be used by external logic 
for PC-type floating point error reporting in systems 
with an Intel486 DX, IntelDX2, or IntelDX4 proces- 
sor. FERR# is active LOW and is not floated during 
bus hold. 


In some cases, FERR# is asserted when the next 
floating point instruction is encountered. In other 
cases, it is asserted before the next floating point 
instruction is encountered, depending on the execu- 
tion state of the instruction that caused the excep; 
tion. 


The following class of floating point exceptions drive 
FERR # at the time the exception occurs (i.e., before 
encountering the next floating point instruction): 


1. The stack fault, invalid operation, and denormal 
exceptions on all transcendental instructions, in- 
teger arithmetic instructions, FSQRT, FSCALE, 
FPREM(1), FXTRACT, FBLD, and FBSTP. 


_ 2. Any exceptions on store instructions (including in- 


teger store instructions). 


The following class of floating point exceptions drive 
FERR# only after encountering the next mene 
point instruction: 


1. Exceptions other than on all transcendental in- 


structions, integer arithmetic instructions, FSQRT, 
FSCALE, FPREM(1), FXTRACT, FBLD, and 
FBSTP. 


2. Any exception on all basic arithmetic, load, com- 
pare, and control instructions (i.e., all other in- 
structions). 


Ignore Numeric Error Input (IGNNE #) 


Intel486 DX, IntelDX2, and IntelDX4 processors will 
ignore a numeric error and continue executing non- 
control floating point instructions when IGNNE# is 
asserted, and FERR# is still activated. When de- 
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asserted, the processor will freeze on a non-control 
floating point instruction if a previous instruction 


caused an error. IGNNE# has no effect when the 
NE bit in control register 0 is set. 


The IGNNE # input is active LOW and provided with 
a small internal pull-up resistor. This input is asyn- 
chronous, but must meet setup and hold times tog 
and to; to insure recognition on any specific clock. 


9.2.16 BUS SIZE CONTROL (BS16#, BS8#) 


The BS16# and BS8 # inputs allow external 16- and 
8-bit buses to be supported with a small number of 
external components. The Intel486 processor sam- 
ples these pins every clock. The value sampled in 
the clock before ready determines the bus size. 
When asserting BS16# or BS8# only 16 or 8 bits of 
the data bus need be valid. If both BS16# and 
BS8# are asserted, an 8-bit bus width is selected. 


When BS16# or BS8# are asserted, the Intel486 
processor will convert a larger data request to the 
appropriate number of smailer transfers. The byte 
enables will also be modified appropriately for the 
bus size selected. 


BS16# and BS8# are active LOW and are provided 
with small internal pull-up resistors. BS16# and 
BS8# must satisfy the setup and hold times t;4 and 
t15 for proper chip operation. 


9.2.17 ADDRESS BIT 20 MASK (A20M#) 


Asserting the A20M# input causes the Intel486 
processor to mask physical address bit 20 before 
performing a lookup in the internal cache and before 
driving a memory cycle to the outside world. When 
A20M# is asserted, the Intel486 processor emu- 
lates the 1-Mbyte address wraparound that occurs 
on the 8086. A20M# is active LOW and must be 
asserted only when the processor is in real mode. 
The A20M# is not defined in Protected Mode. 
A20M # is asynchronous but should meet setup and 
hold times tog and to; for recognition in any specific 
clock. For correct operation of the chip, A20M# 
should not be active at the falling edge of RESET. 


A20M# exhibits a minimum 4 clock latency, from 
time of assertion to masking of the A20 bit. AZOM# 
is ignored during cache invalidation cycles. |/O 
writes require A20M# to be asserted a minimum of 
2 clocks prior to RDY being returned for the I/O 
write. This insures recognition of the address mask 
before the Intel486 processor begins execution of 
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the instruction following OUT. If A20M# is asserted 
after the ADS# of a data cycle, the A20 address 
signal is not masked during this cycle but is masked 
in the next cycle. During a prefetch (cacheable or 
not), if A20M # is asserted after the first ADS #, A20 
is not masked for the duration of the prefetch; even 
if BS16# or BS8# is asserted. 


9.2.18 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR SIGNALS AND OTHER 
ENHANCED BUS FEATURES 


This section describes the pins that interface with 
the system to support the Enhanced Bus mode 
write-back features at system level. 


9.2.18.1 Cacheability (CACHE #) 


The CACHE # output indicates the internal cachea- 
bility on read cycles and a burst write-back on write 
cycles. CACHE # is asserted for cacheable reads, 
cacheable code fetches and write-backs. It is driven 
inactive for non-cacheable reads, special cycles, |/O 
cycles and write-through cycles. This is different 
from the PCD (page cache disable) pin. The opera- 
tional differences between CACHE# and PCD are 
listed in Table 9-3. See Table 9-4 for operational 
differences between CACHE# and other Intel486 
processor signals. 


Table 9-3. Differences between 
CACHE # and PCD 


[Bus Operation | caches | pod 
PCD(3) PCD(3) 

Replacement witetback| tow [low 

[snoopforced wrteback | tow | tow 


S-state write-through high same as 
PCD(3) 


I-state write-through (2) high same as 
PCD(3) 


a 
. Includes line fills and non-cacheable reads. During 
locked read cycles CACHE # is inactive. The non-cache- 
able reads may or may not be burst. 

2. Due to the non-allocate on write policy, this includes 
both cacheable and non-cacheable writes. PCD will dis- 
tinguish between the two, but CACHE # does not. 

3. This behavior is the same as the existing specification of 
the Intel486 processor in write-through mode. 
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Table 9-4. Write-Back Enhanced IntelDX2T™ 
Processor CACHE # vs. Other Intel486T™ 
Processor Signals 


rm Relation To This Signal. 
ADS # CACHE # is driven to valid state with 
ADS # 
RDY#, CACHE # is de-asserted with the first 
BRDY# | RDY# or BRDY# 
HLDA, CACHE # floats under these signals. 
BOFF # 


| KEN# The combination of CACHE # and 
KEN # determines if a read miss is 
_ converted into a cache line fill. 


9.2.18.2 Cache Flush (FLUSH #) 


FLUSH # is an existing pin that operates differently if 
the processor is configured as Enhanced Bus mode 
(write-back). In Enhanced Bus mode, it acts similar 
to the WBINVD instruction. In Enhanced Bus mode, 
FLUSH # is treated as an interrupt. It is sampled at 
each clock, but is recognized only on instruction 
boundary. Pending writes are completed before 
FLUSH # is serviced, and all prefetching is stopped. 
Depending on the number of modified lines in the 
cache, the flush could take up to a minimum of 1280 
bus Clocks or 2560 processor clocks and a maxi- 
mum of 5000+ bus clocks to scan the cache, per- 
form the write backs, invalidate the cache and run 
two special cycles. After all modified lines are written 
back to memory, two special bus cycles, ‘First Flush 
ACK Cycle” and “Second Flush ACK Cycle,” are 
issued, in that order. These cycles differ from the 
special cycles issued after WBINVD only in that 
A2=1 (address line 2 = 1). SRESET, STPCLK#, 


INTR, NMI and SMI# are not recognized during a . 


flush write-back, while BOFF#, AHOLD and HOLD 
are recognized. 


FLUSH # may be asserted just for a single clock, or 
may be retained asserted, but should be de-assert- 
ed at or prior to the RDY# returned from the “First 
Flush ACK” special bus cycle. If asserted during 
INVD or WBINVD, FLUSH# will be recognized. If 
asserted simultaneously with SMI#, then SMI# is 
recognized after FLUSH # is serviced. 


FLUSH # may be driven at any time. If driven during 


SRESET, it must be held for one clock after 
SRESET is de-asserted to be recognized. 
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9.2.18.3 Hit/Miss to a Modified Line (HITM #) 


HITM# is a cache coherency protocol pin that is 
driven only in Enhanced Bus mode. When a snoop 
cycle is run by the system (with INV = “0” or INV = 
1”), HITM# indicates if the processor contains the 
snooped line in the M-state. Assertion of HITM # in- 
dicates that the line will be written back in total, un- 
less the processor is already in the process of doing 
a replacement write-back of the same line. 


HITM# will be valid on the bus two system clocks 
after EADS# is asserted on the bus. If asserted, 
HITM# remains asserted until the last RDY# or 
BRDY # of the snoop write-back cycle is returned. It 
will be de-asserted before the next following ADS #. 
(See Table 9-5.) 


Table 9-5. HITM # vs. Other Intel486™ Signals 


Pin | : ee 
Symbol Relation To This adi 


EADS#_ | HITM# is asserted-due to an EADS#- 
driven snoop, provided the snooped 
line is in the M-state in the cache. 


HLDA, HITM# does not float under these 
BOFF# | signals. 


ADS#, The beginning of a snoop write-back 
CACHE # | cycle is identified by the assertion of 
ADS#, CACHE #, and HITM#. 


9.2.18.4 Soft Reset (SRESET) 


When in Enhanced Bus mode, SRESET has the fol- 
lowing differences: SRESET, unlike RESET, does 
not cause AHOLD, A20M#, FLUSH#, UP#, and 
WB/WT# pins to be sampled (i.e., special test 


‘modes and on-chip cache configuration can not be 


accessed with SRESET.) 


On SRESET, the internal SMRAM base register re- 
tains its previous value and the processor does not 
flush, write-back or disable the internal cache. 
CRO.CD and CRO.NW retain previous values, CRO.4. 
is set to ‘1, and the remaining bits are cleared. Be- 


~ cause SRESET is treated as an interrupt, it is possi- 


ble to have a bus cycle while SRESET is asserted. A 
bus cycle could be due to an on-going instruction, 
emptying the write buffers of the processor, or 
snoop write-back cycles if there is a snoop hit to an 
M-state line while SRESET is asserted. | 
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For both Standard Bus ade and Enhanced 
Bus mode: 


e SMI# must be blocked during SRESET. It must 
also be blocked for a minimum of 2 clocks after 
SRESET is de-asserted. 


e SRESET must be blocked during SMI#. It must 
also be blocked for a minimum of 20 clocks after 
SMIACT # is de-asserted. 


9.2.18.5 invalidation Request (INV) 


INV is a cache coherency protocol pin that is used 
only in Enhanced Bus mode. It is sampled by the 
processor on EADS#-driven snoop cycles. It is 
necessary to assert this pin to simulate the stan- 


dard mode processor invalidate cycle on write- - 


through-only lines. INV also invalidates the write- 
back lines. However, if the snooped line is in the M- 
state, the line will be written back and then invalidat- 
ed. 


INV is sampled when EADS # is asserted. If INV is 
not asserted with EADS#, the snoop cycle will have 


no effect on a write-through-only line or a line allo- | 


cated as write-back, but not yet modified. If the line 
is write-back and modified, it will be written back to 
memory, but will not be de-allocated (invalidated) 
from the internal cache. The address of the snooped 
cache line is provided on the address bus. (See Ta- 
ble 9-6.) 


Table 9-6. INV vs. Other Intel486™ Signals 


Pin . a 
Symbol Relation To This Signal 
EADS# | EADS# determines when INV is 
sampled. 
A31-A4 | The address of the snooped cache line 
is provided on these pins. 


9.2.18.6 Write-Back/Write-Through (WB/WT #) 


WB/WT # enables Enhanced Bus mode (write-back 
cache). It also allows the system to define a cached 
line as write-through or write-back 


WB/WT # is sampled at the falling edge of RESET 
to determine if Enhanced Bus mode is enabled 
(WB/WT # must be driven for two clocks before and 
two clocks after RESET for recognition by the proc- 
essor). If sampled low or floated, the Write-Back 
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Enhanced IntelDX2 processor operates in the In- 
tel486 processor standard mode. For write-through 
only operation, i.e. standard mode, WB/WT # does 
not need to be connected. 


In Enhanced Bus mode, WB/WT# allows the sys- 
tem-hardware to force any allocated line to be treat- 
ed as write-through or write-back. As with cacheabil- 
ity, both the processor and the external system must 
agree that a line may be treated as write-back for 
the internal cache to be allocated as write-back. The 
default is always write-through. The processor’s in- 
dication of write-back vs. write-through is from 
the PWT pin, in which function and timing are the 
same as in the standard mode intel486 proces- 
sor. 


To define write-back or write-through configuration 
of a line, WB/WT # is sampled in the same clock as 
the first RDY # or BRDY # is returned during a line 
fill (allocation) cycle. (See Table 9-7.) © 


Table 9-7. WB/WT # vs. Other Intel486™ 
Processor Signals 


Pin : oa ee 
Symbol Relation to This Signal 
RDY #, WB/WT # is sampled with the first 
BRDY# | RDY# or BRDY# 


The combination of WB/WT # and 
PWT determine if the Write-Back 
Enhanced IntelDX2™ processor will 
treat the line as WB. 


PCD, The state of WB/WT # does not matter 


CACHE #, | if PCD, CACHE # or KEN # define the 
KEN # line to be non-cacheable. 


W/R# WB/WT # is significant only on read fill 
cycles. 


WB/WT # is sampled on the falling 
edge of RESET to define the cache 
configuration. 


9.2.18.7 Pseudo-Lock Output (PLOCK #) 


In the Enhanced bus mode, PLOCK # is always driv- 
en inactive. In this mode, a 64-bit data read (caused 
by an FP operand access or a segment descriptor 
read) is treated as a multiple cycle read request, 
which may be a burst or a non-burst access based 
on whether BRDY # or RDY # is returned by the sys- 
tem. Because only write-back cycles (caused by 
Snoop write-back or replacement write-back) are 
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burstable, a 64-bit write will be driven out as two 
non-burst bus cycles. BLAST# is asserted during 
both writes. Refer to section 10.2, “Bus Functional 
Description” for details on Pseudo-Locked bus cy- 
cles. | 


9.2.19 INTELDX4 PROCESSOR VOLTAGE 
DETECT SENSE OUTPUT (VOLDET) 


A voltage detect sense pin (VOLDET) has been add- 
ed to the IntelDX4 processor PGA package. This pin 
allows external system logic to distinguish between 
a 5V Intel486 DX or IntelDX2 processor and the 3.3V 
IntelDX4 processor. The pin passively indicates to 
external logic whether the installed PGA processor 
requires 5V (in the case of the Intel486 DX or In- 
telDX2 processor) or 3.3V (in the case of the In- 
telDX4 processor). Pin S4 has been defined as the 
~ VOLDET pin because this pin is defined as an INC 
pin on the Intel486 DX and IntelDX2 processor. This 
pin is only provided in PGA package. 


To utilize this feature, a weak, external pull-up resis- 
tor should be connected to the VOLDET pin. This 
pin samples high (logic 1) if the installed processor is 
a 5V Intel486 DX or IntelDX2 processor. This pin 
samples low (logic 0) if a IntelDX4 processor is in- 
stalled. Upon sampling the logic level of this pin, ex- 
ternal logic can then enable the proper Vcc level to 
the processor. In power sensitive applications, an 
active element is preferred for the pull-up device be- 
cause it could be disabled after sampling, thereby 
eliminating the resulting DC current path when the 
installed processor is the IntelDX4 processor. 


Figure 9-4 shows a logical representation of the 
Voltage Detect sense mechanism. 


This pin can remain not connected for those system 
designs that do not wish to utilize this voltage detect 
feature. 


9.2.20 BOUNDARY SCAN TEST SIGNALS 


The following boundary scan test signals are avail- 
able on all Intel486 processors except the Intel486 
SX processor in PGA packages. 
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Figure 9-4. Voltage Detect (VOLDET) Sense Pin 
Test Clock (TCK) 


TCK is an input to the Intel486 processor and pro- 
vides the clocking function required by the JTAG 
boundary scan feature. TCK is used to clock state 
information and data into and out of the component. 
State select information and data are clocked into 
the component on the rising edge of TCK on TMS 
and TDI, respectively. Data is clocked out of the part 
on the falling edge of TCK on TDO. : 


In addition to using TCK as a free running clock, it 
may be stopped in a low, O, state, indefinitely as | 
described in IEEE 1149.1. While TCK is stopped in 
the low state, the boundary scan latches retain their 
state. 


When boundary scan is not used, TCK should be 
tied high or left as a NC. (This is important during. 


- power up to avoid the possibility of glitches on the 


TCK which could prematurely initiate boundary scan 
operations.) TCK is supplied with an internal pull-up 
resistor. : 


TCK is a clock signal and is used as a reference for 
sampling other JTAG signals. On the rising edge of 
TCK, TMS and TDI are sampled. On the falling edge — 
of TCK, TDO is driven. | 


Test Mode Select (TMS) 


TMS is decoded by the JTAG TAP (Tap Access 
Port) to select the operation of the test logic, as de- 
scribed in section 11.5.4, ‘““Test Access Port Control- 
ler.” 
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To guarantee deterministic behavior of the TAP con- 


troller TMS is provided with an internal pull-up resis- 
tor. If boundary scan is not used, TMS may be tied 
high or left unconnected. TMS is sampled on the 
rising edge of TCK. TMS is used to select the inter- 
nal TAP states required to load boundary scan in- 


structions to data on TDI. For proper initialization of . 


the JTAG logic, TMS should be driven high, ‘‘1,” for 
at least four TCK cycles following the rising edge of 
RESET. 


Test Data Input (TDI) 


TDI is the serial input used to shift JTAG instructions 
and data into the component. The shifting of instruc- 
tions and data occurs during the SHIFT-IR and 
SHIFT-DR TAP controller states, respectively. 
These states are selected using the TMS signal as 
described in section 11.5.4, “Test Access Port Con- 
troller.” 


An internal pull-up resistor is provided on TDI to en- 
sure a known logic state if an open circuit occurs on 
the TDI path. Note that when “1” is continuously 
shifted into the instruction register, the BYPASS in- 
struction is selected. TDI is sampled on the rising 
edge of TCK, during the SHIFT-IR and the 
SHIFT-DR states. During all other TAP controller 
states, TDI is a “don’t care.” TDI is only sampled 
when TMS and TCK have been used to select the 
SHIFT-IR or SHIFT-DR states in the TAP controller. 
For proper initialization of JTAG logic, TDI should be 
driven high, “1,” for at least four TCK eyees follow- 
ing the rising edge of RESET. 


Test Data Output (TDO) 


TDO is the serial output used to shift JTAG instruc- 
tions and data out of the component. The shifting of 
instructions and data occurs during the SHIFT-IR 
and SHIFT-DR TAP controller states, respectively. 
These states are selected using the TMS signal as 
described in section 11.5.4, ‘“Test Access Port Con- 
troller’. When not in SHIFT-IR or SHIFT-DR state, 
TDO is driven to a high impedance state to allow 
connecting TDO of different devices in parallel. TDO 
is driven on the falling edge of TCK during the 
SHIFT-IR and SHIFT-DR TAP controller states. At 
all other times TDO is driven to the high impedance 
state. TDO is only driven when TMS and TCK have 
been used to select the SHIFT-IR or SHIFT-DR 
states in the TAP controller. 
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9.3 Interrupt and Non-Maskable 
Interrupt Interface 


The Intel486 processor provides four asynchronous 
interrupt inputs: INTR (interrupt request), NMI (non- 
maskable interrupt input), SMI# (system manage- 
ment interrupt) and STPCLK # (stop clock interrupt). 
This section describes the hardware interface be- 
tween the instruction execution unit and the pins. 
For a description of the algorithmic response to in- 
terrupts refer to section 4.7.6, “Interrupts”. For inter- 
rupt timings refer to section 10.2.10, ‘Interrupt Ac- 
knowledge’”’. 


9.3.1 INTERRUPT LOGIC 


The Intel486 processor contains a two-clock syn- 
chronizer on the interrupt line. An interrupt request 
will reach the internal instruction execution unit two 
clocks after the INTR pin is asserted, if proper setup 
is provided to the first stage of the synchronizer. 


There is no special logic in the interrupt path other 
than the synchronizer. The INTR signal is level sen- 
sitive and must remain active for the instruction exe- 
cution unit to recognize it. The interrupt will not be 
serviced by the Intel486 processor if the INTR signal 
does not remain active. 


The instruction execution unit will look at the state of 
the synchronized interrupt signal at specific clocks 
during the execution of instructions (if interrupts are 
enabled). These specific clocks are at instruction 
boundaries, or iteration boundaries in the case of 
string move instructions. Interrupts will only be ac- 
cepted at these boundaries. 


An interrupt must be presented to the Intel486 proc- 
essor INTR pin three clocks before the end of an 
instruction for the interrupt to be acknowledged. Pre- 
senting the interrupt 3 clocks before the end of an 
instruction allows the interrupt to pass through the 
two clock synchronizer leaving one clock to prevent 
the initiation of the next sequential instruction and to 
begin interrupt service. If the interrupt is not received 
in time to prevent the next instruction, it will be ac- 
cepted at the end of next instruction, assuming INTR 
is still held active. 


The longest latency between when an interrupt re- 
quest is presented on the INTR pin and when the 
interrupt service begins is: longest instruction used 
+ the two clocks for synchronization + one clock 
required to vector into the interrupt service micro- 
code. 
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9.3.2 NMI LOGIC 


The NMI pin has a synchronizer like that used on the 
INTR line. Other than the synchronizer, the NMI log- 
ic is different from that of the maskable interrupt. 


NMI is edge triggered as opposed to the level trig- 
gered INTR signal. The rising edge of the NMI signal 
is used to generate the interrupt request. The NMI 
input need not remain active until the interrupt is ac- 
tually serviced. The NMI pin only needs to remain 
active for a single clock if the required setup and 
hold times are met. NMI will operate properly if it is 
held active for an arbitrary number of clocks. 


The NMI input must be held inactive for at least four 
clocks after it is asserted to reset-the edge triggered 
logic. A subsequent NMI may not be generated if the 
NMI is not held inactive for at least four clocks after 
being asserted. 


The NMI input is internally masked whenever the 
NMI routine is entered. The NMi input will remain 
masked until an IRET (return from interrupt) instruc- 
tion is executed. Masking the NMI signal prevents 
‘recursive NMI calls. If another NMI occurs while the 
NMI is masked off, the pending NMI will be executed 
after the current NMI is done. Only one NMI can be 
pending while NMI is masked. 


9.3.3 SMI# LOGIC 


SMI# is edge triggered like NMI, but the interrupt 
request is generated on the falling-edge. SMI# is an 
asynchronous signal, but setup and hold times, tao 
and to;, must be met in order to guarantee recogni- 
tion on a specific clock. The SMI# input need not 
remain active until the interrupt is actually serviced. 
The SMI# input only needs to remain active for a 
single clock if the required setup and hold times are 
met. SMI # will also work correctly if it is held active 
for an arbitrary number of clocks. 


The SMI# input must be held inactive for at least 
four clocks after it is asserted to reset the edge trig- 
gered logic. A subsequent SMI # might not be recog- 
nized if the SMI# input is not held inactive for at 
least four clocks after-being asserted. 


SMI#, like NMI, is not affected by the IF bit in the 


EFLAGS register and is recognized on an instruction 
boundary. An SMI # will not break locked bus cycles. 
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The SMIi# has a higher priority than NMI and is not 
masked during an NMI. 


After the SMI# interrupt is recognized, the SMI # 
signal will be masked internally until the RSM 
instruction is executed and the interrupt service rou- 
tine is complete. Masking the SMI# prevents recur- 
sive SMI# calls. The SMI# input must be de-assert- 
ed for at least 4 clocks to reset the edge triggered 
logic. If another SMI# occurs while the SMI# is 
masked, the pending SMI# will be recognized and 
executed on the next instruction boundary after the 
current SMI# completes. This instruction boundary 
occurs before execution of the next instruction in the 
interrupted application code, resulting in back to 
back SMM handlers. Only one SMI# can be pend- 
ing while SMI # is masked. 


The SMI# signal is synchronized internally and 
should be asserted at least three (3) CLK periods 
prior to asserting the RDY# signal in order to guar- 
antee recognition on a specific instruction boundary. 
This is important for servicing an I/O trap with an 
SMI# handler. 


9.3.4 STPCLK# LOGIC 


STPCLK# is level triggered and active LOW. | 
STPCLK# is an asynchronous signal, but must re- 
main active until the processor issues the Stop 
Grant bus cycle. STPCLK# may be de-asserted at 
any time after the processor has issued the Stop 
Grant bus cycle. When the processor enters the 
Stop Grant state, the internal pull-up resistor of 
STPCLK#, CLKMUL (for IntelDX4 processor), and 
UP# are disabled so that the processor power con- 
sumption is reduced. The STPCLK# input must be 
driven high (not floated) in order to exit the Stop 
Grant state. STPCLK# must be de-asserted for a 
minimum of 5 clocks after RDY# or BRDY# is 
returned active for the Stop Grant Bus Cycle be- 
fore being asserted again. 


When the processor recognizes a STPCLK# inter- 
rupt, the processor will stop execution on the next 
instruction boundary (unless superseded by a higher 
priority interrupt), stop the pre-fetch unit, empty all 
internal pipelines and the write buffers, generate a 
Stop Grant bus cycle, and then stop the internal 
Clock. At this point the processor is in the Stop Grant 
state. 
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The processor cannot respond to a STPCLK# re- 
quest from an HLDA state because it cannot empty 
the write buffers and, therefore, cannot generate a 
Stop Grant cycle. 


The rising edge of STPCLK# will tell the processor 
that it can return to program execution at the instruc- 
tion following the interrupted instruction. 


Unlike the normal interrupts, INTR and NMI, the 
STPCLK# interrupt does not initiate acknowledge 
cycles or interrupt table reads. Among external inter- 
rupts, the STPCLK# order of priority is shown in 
section 4.7.6. 


9.4 Write Buffers 


The Intel486 processor contains four write buffers to 
enhance the performance of consecutive writes to 
memory. The buffers can be filled at a rate of one 
write per clock until all four buffers are filled. 


When all four buffers are empty and the bus is idle, a 
write request will propagate directly to the external 
bus bypassing the write buffers. If the bus is not 
available at the time the write is generated internally, 
the write will be placed in the write buffers and prop- 
agate to the bus as soon as the bus becomes avail- 
able. The write is stored in the on-chip cache imme- 
diately if the write is a cache hit. 


Writes will be driven onto the external bus in the 
same order in which they are received by the write 
buffers. Under certain conditions a memory read will 
go onto the external bus before the memory writes 
pending in the buffer even though the writes oc- 
curred earlier in the program execution. 


A memory read will only be reordered in front of all 
writes in the buffers under the following conditions: If 
all writes pending in the buffers are cache hits and 
the read is a cache miss. Under these conditions the 
Intel486 processor will not read from an external 
memory location that needs to be updated by one of 
the pending writes. 


Reordering of a read with the writes pending in the 
buffers can only occur once before all the buffers 
are emptied. Reordering read once only maintains 
cache consistency. Consider the following example: 
The processor writes to location X. Location X is in 
the internal cache, so it is updated there immediate- 
ly. However, the bus is busy so the write out to main 
memory is buffered (see Figure 9-5). At this point, 
any reads to location X would be cache hits and 
most up-to-date data would be read. 
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Intel486™ Write 
Processor Cache Buffer 


4 New | New New Data X 
Data X | New Data Y 
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Figure 9-5. Reordering of a Reads 
with Write Buffers 


The next instruction causes a read to location Y. 
Location Y is not in the cache (a cache miss). Be- 
cause the write in the write buffer is a cache hit, the 
read is reordered. When location Y is read, it is put 
into the cache. The possibility exists that location Y 
will replace location X in the cache. If this is true, | 
location X would no longer be cached (see Figure 
9-6). 


Intel486™ 
Processor Cache 
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Figure 9-6. Reordering of a Reads 
with Write Buffers 


Cache consistency has been maintained up to this 
point. If a subsequent read is to location X (now a 
cache miss) and it was reordered in front of the buff- 
ered write to location X, stale data would be read. 
This is why only 1 read is allowed to be reordered. 
Once a read is reordered, al! the writes in the write 
buffer are flagged as cache misses to ensure that no 
more reads are reordered. Because one of the con- 
ditions to reorder a read is that all writes in the write 
buffer must be cache hits, no more reordering is al- 
lowed until all of those flagged writes propagate to 
the bus. Similarly, if an invalidation cycle is run all 
entries in the write buffer are flagged as cache miss- 
es. 
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For multiple processor systems and/or systems us- 
ing DMA techniques, such as bus snooping, locked 
semaphores should be used to maintain cache con- 
sistency. 


9.4.1 WRITE BUFFERS AND I/O CYCLES 


. Input/Output (I/O) cycles must be handled in a dif- — 


ferent manner by the write buffers. 


|/O reads are never reordered in front of buffered 
memory writes. This insures that the Intel486 proc- 
essor will update all memory locations before read- 
ing status from an I/O device. 


The Intel486 processor never buffers single |/O 
writes. When processing an OUT instruction, internal 
execution stops until the I/O write actually com- 
pletes on the external bus. This allows time for the 
external system to drive an invalidate into the In- 


tel486 processor or to mask interrupts before the - 


processor progresses to the instruction following 
OUT. REP OUTS instructions will be buffered. 


A read cycle must be explicitly generated to a non- 
cacheable location in memory to guarantee that a 
read bus cycle is performed. This read will not be 
allowed to proceed to the bus until after the I/O 
write has completed because I/O writes are not 
buffered. The |/O device will have time to recover to 
accept another write during the read cycle. 


9.4.2 WRITE BUFFERS IMPLICATIONS ON 
LOCKED BUS CYCLES 


Locked bus cycles are used for read-modify-write 
accesses to memory. During a read-modify-write ac- 
- cess, a memory base variable is read, modified and 
then written back to the same memory location. It is 
important that no other bus cycles, generated by 
other bus masters or by the Intel486 processor itself, 
be allowed on the exiernal bus between the read 
and write portion of the locked sequence. 


During a locked read cycle, the Intel486 processor 
will always access external memory, it will never 
look for the location in the on-chip cache, but for 
_ ‘write cycles, data is written in the internal cache (if 
cache hit) and in the external memory. All data 
pending in the Intel486 processor’s write buffers will 
be written to memory before a locked cycle is al- 
lowed to proceed to the external bus. 


The Intel486 processor will assert the LOCK# pin 


after the write buffers are emptied during a locked 
bus cycle. With the LOCK# pin asserted, the 
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processor will read the data, operate on the data 
and place the results in a write buffer. The contents 


_ of the write buffer will then be written to external 


memory. LOCK # will become inactive after the write 
part of the locked cycle. 


9.5 Reset and Initialization 


The Intel486 processor has a built in self test (BIST) 
that can be run during reset. BIST is invoked if the 
AHOLD pin is asserted for 1 clock before and 1 
clock after RESET is de-asserted. RESET must be 
active for 15 clocks with or without BIST being en- 
abled. To ensure proper results, neither FLUSH 4 
nor SRESET can be asserted while BIST is execut- 
ing. Refer to section 11.0, ‘Processor Testability,’’ 
for information on Intel486 processor testability. 


The Intel486 processor registers have the values 
shown in Table 9-8 after RESET is performed. The 
EAX register contains information on the success or 
failure of the BIST if the self test is executed. The 
DX register always contains a component identifier 
at the conclusion of RESET. The upper byte of DX 
(DH) will contain 04 and the lower byte (DL) will con- 
tain the revision identifier. (See Table 9-9.) 


RESET forces the Intel486 processor to terminate 
all execution and local bus activity. No instruction or 
bus activity will occur as long as RESET is active. 


All entries in the cache are invalidated by RESET. 


9.5.1 FLOATING POINT REGISTER VALUES 


In addition to the register values listed above, In- 
tel486 DX, IntelDX2, and IntelDX4 processors have 
the floating point register values shown in Table 
9-10. 


The floating point registers are initialized as if the 
FINIT/FNINIT (initialize processor) instruction was 
executed if the BIST was performed. If the BIST is 
not executed, the floating point registers are un- 
changed. 


The Intel486 processor will start executing instruc- 
tions at location FFFFFFFOH after RESET. When 
the first Inter Segment Jump or Call is executed, ad- 
dress lines A20-—A31 will drop LOW for CS-relative © 
memory cycles, and the Intel486 processor will only 
execute instructions in the lower one Mbyte of physi- 
cal memory. This allows the system designer to use 
a ROM at the top of physical memory to initialize the 
system and take care of RESETs. 
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Table 9-8. Register Values after Reset 


Reaister initial Value Initial Value 
: (BIST) (No BIST) 
Zero (Pass) Undefined 


Undefined Undefined 


Revision ID Revision ID 
[—e8P | Undetines | Undefined 
Pee | oFFron | oFFFOn 


IDTR Base = 0, Base = 0, 


Limit = 3FFh Limit = 3FFh 


60000010h 60000010h 
00000000h 00000000h 


Table 9-9. Intel486™ Processor Revision ID 


Component ID ; Revision 
(DH) ID (DL) 

Intel486 SX Processor 
IntelSX2™ Processor} 042s | Sx 
Intel486™ DX 04 Ox 
Processor 1x 
intelDX2™ Processor 
Write-Back Enhanced 
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Table 9-10. Floating Point Values after Reset 


(BIST) (No BIST) 
on 
[rex | c0000000n | Unchanged | 


9.5.2 PIN STATE DURING RESET 


The Intel486 processor recognizes and can respond 
to HOLD, AHOLD, and BOFF # requests regardless 
of the state of RESET. Thus, even though the proc- 
essor is in reset, it can still float its bus in response 
to any of these requests. 


While in reset, the Intel486 processor bus is in the 
state shown in Figure 9-7 if the HOLD, AHOLD and 
BOFF # requests are inactive. The figure shows the 
bus state for the Intel486 processor. Note that the 
address (A31—A2, BE3#-BE0O#) and cycle defini- 
tion (M/IO#, D/C#, W/R#) pins are undefined 
from the time reset is asserted up to the start of the 
first bus cycle. All undefined pins (except FERR #) 
assume known values at the beginning of the first 
bus cycle. The first bus cycle is always a code fetch 
to address FFFFFFFOH. 
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Tx Tx Ix . ; Tx T T, T, T, 


CLK 
At teast 15 CLK periods ~217 CLK if no self-test 


RESET . LLL) (G) : A O ~2° CLK if self-test 
t2o 


tzo 


AHOLD LLL \\ 
FLUSH# ; AKO) 1]} 


(Syng 


inputs 


rrietst ANY Ns, 

“syn ENN | 

“payng (L/ \\ 
see TZ a NNN 
BREO | NNN TT) 


BEO-BE3#, PWT, PCD 


A3, A2, PLOCK UNDEFINED ANAK 
D/C#, W/R# 


ps Se ee 
LOCK# 
eee MESSRS )D)) : 
D31-D0, 
HLDA (s) 
SMIACT# [TT] 
WBIWT# * 7] @ ANN 
CACHE# ; 
_ HITM# ararnnann //, 
Notes: 


1. RESET is an asynchronous input. t2, must be met only to guarantee recognition on a specific clock edge. 

2a. When A20M+# is driven synchronously, it must be driven high (inactive) for the CLK edge prior to the falling edge of 
RESET to ensure proper operation. AZOM# setup and hold times must be met. 

2b. When A20Mé# is driven asynchronously, it should be driven low (active) for two CLKs prior to and two CLKs after the 
failing edge of RESET to ensure proper operation. 

3a. When FLUSH# is driven synchronously, it must be driven low (high) for the CLK edge prior to the falling edge of 
RESET to invoke the 3-state Output Test Mode. All outputs are guaranteed 3-stated within 10 CLKs of RESETR being 
deasserted. FLUSH# setup and hold times must be met. 


3b. When FLUSH# is driven asynchronously, it must be driven low (active) for two CLKs period prior to and two CLKs 


after the falling edge of RESET to invoke the 3-state Output Test Mode. All outputs are guaranteed 3-stated within 10 
CLKs of RESET being deasserted. 


A31-A4, MIO#, BLAST | UNDEFINED LL ; 


4. AHOLD shouid be driven high (active) for the CLK edge prior to the falling edge of RESET to invoke the Built-In-Seif- 


Test (BIST). AHOLD setup and hold times must be met. 


5. Hold is recognized normally during RESET. On power-up HLDA is indeterminate until RESET is ieconeed by the 
processor. 


6. 15 CLKs RESET pulse width for warm resets. Power-up resets require RESET to be asserted for at least 1 ms after 
Vcc and CLK are stable. 


7. WB/WT# should be driven high for at least one CLK before falling edge of RESET and at least one CLK after falling 
edge of RESET to enable the Enhanced Bus Mode. The Standard Bus Mode will be enabled if WB/WT# is sampled 
low or left floating at the falling edge of RESET. 


8. The system may sampie HITM# to detect the presence of the Enhanced Bus Mode. If HITM# is HIGH for one CLK 
after reset is inactive, the Enhanced Bus Mode is present. 


Figure 9-7. Pin States during RESET 
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9.5.2.1 Controlling the CLK Signal in the 
Processor during Power On 


The power on requirements of the Intel486 proces- 
sor with regards to allowable CLK input during the 
power on sequence have never been specified. 
Clocking the processor before Vcc has reached its 
normal operating level can cause unpredictable re- 
sults on Intel486 processors. While Intel will main- 
tain original clock and power specifications (none), 
this section reflects what Intel considers to be a 
good clock design. 


Intel strongly recommends that system designers 
ensure that a clock signal is not presented to the 
Intel486 processor until Vcc has stabilized at its nor- 
mal operating level. This design recommendation 
can easily be met by gating the clock signal with a 
POWERGOOD signal. The POWERGOOD signal 
should reflect the status of Vcc at the Intel486 proc- 
essor (which may be different from the power supply 
status in designs that provide power to the proces- 
sor through the use of a voltage regulator or con- 
verter). 


Most clock synthesizers and some clock oscillators 
contain on-board gating logic. If external gating logic 
is implemented, it should be done on the original 
clock signal output from the clock oscillator/synthe- 
sizer. Gating the clock to the processor indepen- 
dently of the clock to the rest of the motherboard will 
cause clock skew, which may violate processor or 
chipset timing requirements. If the clock signal to the 
motherboard is enabled with a POWERGOOD sig- 
nal, it is also important to verify that the motherboard 
logic does not require a clock input prior to this 
POWERGOOD signal. Some chipsets also gate the 
clock to the processor only after a POWERGOOD 
signal, which inherently meets the requirements of 
this design note. Designs should implement this de- 
sign note, so as to maintain maximum flexibility with 
all Intel486 processor steppings. 


9.5.2.2 FERR# Pin State During Reset for 
intel486 DX, IntelDX2, and IntelDX4 
Processors 


FERR# reflects the state of the ES (error summary 
status) bit in the floating point unit status word. The 
ES bit is initialized whenever the floating point unit 
state is initialized. The floating point unit’s status 
word register can be initialized by BIST or by execut- 
ing FINIT/FNINIT instruction. Thus, after reset and 
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before executing the first FINIT or FNINIT instruc- 
tion, the values of the FERR# and the numeric 
status word register bits 0-7 depends on whether or 
not BIST is performed. Table 9-11 shows the state 
of FERR# signal after reset and before the execu- 
tion of the FINIT/FNINIT instruction. 


Table 9-11. FERR # Pin State after Reset and 
before FP Instructions 


BIST __ | FPU Status Word 
Paper an Register Bits 0-7 


Inactive (High) Inactive (Low) 


Undefined Undefined 
(Low or High) (Low or High) 


After the first FINIT or FNINIT instruction, FERR # 
pin and the FPU status word register bits (07) will be 
inactive irrespective of the Built-In Self-Test (BIST). 


9.5.2.3 Power Down Mode (Upgrade Processor 
Support) 


The Power Down Mode on the Intel486 processor, 
when initiated by the upgrade processor, reduces 
the power consumption of the Intel486 processor 
(see Table 17-3 DC Specifications), as well as 
forces all of its output signals to be tri-stated. The 
UP# pin on the Intel486 processor is used for en- 
abling the Power Down Mode. 


Once the UP# pin is driven active by the upgrade 
processor upon power-up, the Intel486 processor’s 
bus is floated immediately. The Intel486 processor 
enters the Power Down Mode when the UP # pin is 
sampled asserted in the clock before the falling 
edge of RESET. The UP# pin has no effect on the 
power down status, except during this edge. The 
Intel486 processor then remains in the Power Down 
Mode until the next time the RESET signal is activat- 
ed. For warm resets, with the upgrade processor in 
the system, the Intel486 processor will remain tri- 
stated and re-enter the Power Down Mode once 
RESET is de-asserted. Similarly for power-up resets, 
if the upgrade processor is not taken out of the sys- 
tem, the Intel486 processor will tri-state its outputs 
upon sensing the UP # pin active and enter the Pow- 
er Down Mode after the falling edge of RESET. 
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9.6 Clock Control 


The Intel486 processor provides an interrupt mecha- 
nism (STPCLK#) that allows system hardware to 
control the power consumption of the processor by 
stopping the internal clock (output of the PLL) to the 
processor core in a controlled manner. This low- 
power state is called the Stop Grant state. In addi- 
tion, the STPCLK# interrupt allows the system to 
change the input frequency within the specified 
range or completely stop the CLK input frequency 
(input to the PLL). If the CLK input is completely 
stopped, the processor enters into the Stop Clock 
statethe lowest power state. . 


There are two targets for the low-power mode sup- 
ply current: 


¢ ~20-100 mA in the Stop Grant state (fast 
wake-up, frequency-and voltage-dependent), and 


e ~100-1000 A in the full Stop Clock state 
(slow wake-up, voltage-dependent). 


See section 9.6.4.2 and 9.6.4.3, for a detailed de- 
scription of the Stop Grant and Stop Clock states. 


9.6.1 STOP GRANT BUS CYCLE 


A special Stop Grant bus cycle will be driven to the 


bus after the processor recognizes the STPCLK# — 


interrupt. The definition of this bus cycle is the same 
as the HALT cycle definition for the standard 
Intel486 processor, with the exception that the Stop 
Grant bus cycle drives the value 0000 0010H on the 
address pins. The system hardware must acknowl- 
edge this cycle by returning RDY # or BRDY#. The 
processor will not enter the Stop Grant state un- 
til either RDY # or BRDY# has been returned. 


The Stop Grant bus cycle is defined as follows: 


M/IO# = 0, D/C# = 0, W/R# = 1, Address Bus 


= 0000 0010H (A, = 1), BES#-BEO# = 1011, | 


Data bus = undefined 


The latency between a STPCLK# request and the 
Stop Grant bus cycle is dependent on the current 
instruction, the amount of data in the processor write 
_ buffers,-and the system memory performance. (See 
Figure 9-8.) 
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9.6.2 PIN STATE DURING STOP GRANT 


During the Stop Grant state, most output and input/ 
output signals of the processor will maintain their 
previous condition (the level they held when entering 
the Stop Grant state). The data and data parity sig- 
nals will be tri-stated. In response to HOLD being 
driven active during the Stop Grant state (when the 
CLK input is running), the processor will generate 
HLDA and tri-state all output and input/output sig- 
nals that are tri-stated during the HOLD/HLDA state. 
After HOLD is de-asserted all signals will return to 
their prior state before the HOLD/HLDA sequence. 


In order to achieve the lowest possible power con- 
sumption during the Stop Grant state, the system 
designer must ensure the input signals with pull-up 
resistors are not driven LOW and the input signals 
with pull-down resistors are not driven HIGH. (See 
Table 3-11 in the Quick Pin Reference section for 
signals with internal pull-up and pull-down resistors.) 


All inputs, except the data bus pins must be driven to 
the power supply rails to ensure the lowest possible 
current consumption during Stop Grant or Stop 
Clock modes. For compatibility with future proces- 
sors, data pins should be driven low to achieve the 
lowest possible power consumption. Pull-down re- 
sistors/bus keepers are needed to minimize leakage 
current. 7 


If HOLD is asserted during the Stop Grant state, all 
pins that are normally floated during HLDA will still 
be floated by the processor. The floated pins should 
be driven to a low level. (See Table 9-12.) 


9.6.3 WRITE-BACK ENHANCED INTELDX2 PIN 
STATE DURING STOP GRANT SPECIFICS 


During the Stop Grant state, most output signals of 
the processor will maintain their previous condition, 
which is the level they held when entering the Stop 
Grant state. The data bus and data parity signals 
also maintain their previous state. In response to 
HOLD being driven active during the Stop Grant 


state when the CLK input is running, the Write-Back 


Enhanced intelDX2 processor will generate HLDA - 
and tri-state all output and input/output signals that 
are tri-stated during the HOLD/HLDA state. After 
HOLD is de-asserted all signals will return to the 
state they were in prior to the HOLD/HLDA se- 
quence. 


STPCLK# 


ADDR 
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a Ca) rant Bus Cycle — 


242202-79 


Figure 9-8. Stop Clock Protocol 


Table 9-12. Pin State during Stop 
Grant Bus State 


[Sisnat_~| Type | State 
A3-A2 Previous state 
eese-BE0* | 0 | Previousstate 
Win. D/G#,M/IO# | _O | Previous state 
rapse | | active 
ruocke,PLocK® | 0 | Inactive 
rarco «| O| Previous state 
PHBA —~S*d| OAs per HOLD 
TFERR® «| =O Previous state 
pop.pwr |__| Previous stato 
Pechke «tO _| Previous state 
Pewr.Pcd | _O_| Previous state 
Pswiacte | _O_| Previous state 


All inputs should be driven to the power supply rails 
to ensure the lowest possible current consumption 
during the Stop Grant or Stop Clock states. (See 
Table 9-13.) 


The Write-Back Enhanced IntelDX2 processor has 
bus keepers features. The data bus and data parity 
pins have bus keepers that maintain the previous 
state while in the Stop Grant state. External resistors 
are no longer required, which prevents excess cur- 
rent during the Stop Grant state. (If external resistors 
are present, they should be strong enough to ‘“‘flip”’ 
the bus hold circuitry and eliminate potential DC 
paths. Alternately, ‘“‘weak” resistors may also be 
added to prevent excessive current flow.) See sec- 
tion 17.3.3, “External Resistors Recommended to 
Minimize Leakage Currents,” for external register 

values. | 


In order to obtain the lowest possible power con- 
sumption during the Stop Grant state, system 
designers must ensure that the input signals with 
pull-up resistors are not driven LOW, and the input 
signals with pull-down resistors are not driven HIGH. 
(See the Table 3-11 for signals with internal pull-up 
and pull-down resisters). 
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Table 9-13. Write-Back Enhanced InteIDX2™ Pin 
State during =teP Grant Bus Cycle 


eel 
Gene] 0 | Prous ie 
/BES#-BEO# ——|_O _| Previousstate 
[Wini#, D/G#, M/iO# | 0 | Previous stato 
}ADS# =—=————s|_«O_| Inactive (high) 
[Lock, PLOGK® | _O_| Inactive (igh) 
feRea «|__| Previousstato 
[HDA ~SS*| | Asper HOLD 
BLAST # Previous state 
frenne | 0 | revousstate 
O Previous state 
rpw.P6D | 0 | Previous stato 
fcackee | _0 | nactvet righ) 
rite | 0 nactvet righ) 
[sMiacTe | © | Previous state 


NOTES: 

1. For the case of snoop cycles (via EADS#) during Stop 
Grant state, both HITM# and CACHE# may go active 
depending on the snoop hit in the internal cache. 

During Stop Grant state, AHOLD, HOLD, BOFF# and 
EADS # are serviced normally. 


9.6.4 CLOCK CONTROL STATE DIAGRAM 


The following state descriptions and diagram show 
the state transitions during a Stop Clock cycle for 
the Intel486 processor. (Refer to Figure 9-9 for a 
Stop Clock state diagram.) Refer to section 9.6.5 for 
Write-Back Enhanced IntelDX2 processor Clock 
State specifics. 


«9.6.4.1 Normal State 


This is the normal operating state of the processor. 
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The Stop Grant state provides a fast wake-up state 
that can be entered by simply asserting the external 
STPCLK# interrupt pin. Once the Stop Grant bus 
cycle has been placed on the bus, and either RDY # 
or BRDY # is returned, the processor is in this state 
(depending on the CLK input frequency). The proc- 


9.6.4.2 Stop Grant State 


“essor returns to the normal execution state 10-20 


clock periods after STPCLK# has been de-assert- 
ed. 


While in the Stop Grant state, the pull-up resistors 
on STPCLK#, CLKMUL (for the IntelDX4 processor) 
and UP# are disabled internally. The system must 
continue to drive these inputs to the state they were 
in immediately before the processor entered the 
Stop Grant state. For minimum processor power 
consumption, all other input pins should be driven to 
their inactive level while the processor is in the Stop 
Grant state. 


A RESET or SRESET will bring the processor from 
the Stop Grant state to the Normal state. The proc- 
essor will recognize the inputs required for cache 
invalidation’s (HOLD, AHOLD, BOFF # and EADS #) 
as explained later in this section. The processor will 
not recognize any other inputs while in the Stop 
Grant state. Input signals to the processor will not be 
recognized until 1 CLK after STPCLK# is de-assert- 
ed (see Figure 9-10). 


While in the Stop Grant state, the processor will not 
recognize transitions on the interrupt signals (SMI #, 
NMI, and INTR). Driving an active edge on either 
SMI# or NMI will not guarantee recognition and 
service of the interrupt request following exit from 


- the Stop Grant state. However, if one of the interrupt 


signals (SMI#, NMI, or INTR) is driven active while 
the processor is in the Stop Grant state, and held 
active for at least one CLK after STPCLK # is de-as- 
serted, the corresponding interrupt will be serviced. 
The Intel486 processor requires INTR to be held ac- 
tive until the processor issues an interrupt acknowl- 
edge cycle in order to guarantee recognition. (See 
Figure 9-10). 


When the processor is in the Stop Grant state, the 
system is allowed to stop or change the CLK input. 
When the CLK input to the processor is stopped (or 
changed), the Intel486 processor requires the CLK 
input to be held at a constant frequency for a mini- 
mum of 1 ms before de-asserting STPCLK#. This 
1-ms time period is necessary so that the PLL can 
stabilize, and it must be met before the plocesser 
will return to the Stop Grant state. 


4 Auto HALT 
Power Down State 


CLK Running 
20mA - 100mA 


HALT asserted and 
HALT Bus cycle 
generated 
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1 Normal State 


Normal Execution 


INTR, NMI, SMi#, 


_ RESET, SRESET 


STPCLK# de-asserted and 
HALT Bus cycle generated 


| STPCLK# asserted and 


STPCLK# asserted and 
Stop Grant Bus cycle 
generated 


Stop Grant Bus cycle generated 


Stop Clock Snoop State 


One Clock Powerup 
Perform Cache Invalidation 


2 Stop Grant State 


Clock Running 
Icc - 20mA - 100mA 


Start CLK 
+ PLL Startup Latency 


Stop CLK 


3 Stop Clock State 


Internal Powerdown 


CLK Changed * 
icc - 100uUA - 100UA 
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* The system can change the input frequency within the specified range or completely stop the CLK input frequency 


(input to PLL) 


Figure 9-9. Intel486™ Processor Family Stop Clock State Machine 


The processor will generate a Stop Grant bus cycle 
only when entering that state from the Normal or the 
Auto HALT Power Down state. When the processor 
enters the Stop Grant state from the Stop Clock 
state or the Stop Clock Snoop state, the processor 
will not generate a Stop Grant bus cycle. 


9.6.4.3 Stop Clock State 


Stop Clock state is entered from the Stop Grant 
state by stopping the CLK input (either logic nigh or 
logic low). None of the processor input signals 


should change state while the CLK input is stopped. 
Any transition on an input signal (with the exception 
of INTR, NMI and SMI#) before the processor has 
returned to the Stop Grant state will result in unpre- 
dictable behavior. If INTR is driven active while the 
CLK input is stopped, and held active until the proc- 
essor issues an interrupt acknowledge bus cycle, it 
will be serviced in the normal manner. The system 
design must ensure the processor is in the correct 
state prior to asserting cache invalidation or interrupt 
signals to the processor. 
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CLK 
Sampled 
STPCLK# 
i i 
SMI# 


* 


A.Earliest time at which NMI or SMI# will be recognized. 
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_ Figure 9-10. Recognition of npor when Exiting Stop Grant State 


The processor will return to the Stop Grant state 
after the CLK input has been running at a constant 
frequency for a period of time equal to the PLL start- 
up latency (see section 9.6.4.2). The CLK input can 


be restarted to any frequency between the minimum — 


and maximum frequency listed in the AC timing 
specifications. 


9.6.4.4 Auto HALT Power Down State 


The execution of a HALT instruction will also cause 
the processor to automatically enter the Auto HALT 
Power Down state. The processor will issue a nor- 
mal HALT bus cycle before entering this state. The 
processor will transition to the Normal state on the 
occurrence of INTR, NMI, SMI#, RESET, or 
SRESET. 


The system can generate a STPCLK# while the 
processor is in the Auto HALT Power Down state. 
The processor will generate a Stop Grant bus cycle 
when it enters the Stop Grant state from the HALT 
state. 


When the system de-asserts the STPCLK# inter- 
rupt, the processor will return execution to the HALT 
state. The processor will generate a new HALT bus 
__ cycle when it re-enters the HALT state from the Stop 
Grant state. 
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9. 6. 4.5 Stop Clock Snoop State (Cache 
Invalidations) 


When the processor is in the Stop Grant state or the | 


Auto HALT Power Down state, the processor will 
recognize HOLD, AHOLD, BOFF# and EADS# for 
cache invalidation. When the system asserts HOLD, 
AHOLD, or BOFF #, the processor will float the bus 
accordingly. When the system then asserts EADS #, 
the processor will transparently enter the Stop 
Clock Snoop state and will power up for 1 full core 
clock in order to perform the required cache snoop 
cycle. It will then re-freeze the clock to the proces- 
sor core and return to the previous state. The proc-. 
essor does not generate a bus cycle when it returns 
to the previous state. 


A FLUSH # event during the Stop Grant state or the 
Auto HALT Power Down state will be latched and 
acted upon by asserting the internal FLUSH # signal 
for one clock upon re-entering the Normal state. 
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9.6.4.6 Auto Idle Power Down State 


When the chip is known to be truly idle and waiting 
fora RDY# or BRDY# from a memory or !/O bus 
cycle read, the Intel486 processor will reduce its 
core clock rate to be equal to the external CLK fre- 
quency without affecting performance. When any 
RDY # or BRDY # is asserted, the part will return to 
clocking the core at the specified multiplier of the 
external CLK frequency. This functionality is trans- 
parent to software and external hardware. 


9.6.5 WRITE-BACK ENHANCED INTELDX2 
PROCESSOR CLOCK CONTROL STATE 
DIAGRAM 


Figure 9-11 (state diagram) shows the state tran- 
sitions during Stop Clock for the Write-Back En- 
hanced IntelDX2 processor. ; 


4 Auto HALT 
Power Down State 
CLK Running 
Halt Bus Cycle Generated 
Icc approximately 100uA 


STPCLK# de-asserted 


STPCLK# asserted 


5 Stop Clock Snoop State 


Clock Powerup 
Write Through: Cache invalidation 
Write back: Write, Invalidation 


FLUSH# 


6 Auto HALT Power 
Down Flush State. 


Write through: Cache Invalidation 
Write back: Writeback, Invalidation, 
2 flush Ack. cycles 


FLUSH# 
Done 


INTR, NMI, SMi#, 
RESET, SRESET 
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NOTE: | 
The Stop Clock State Machine in the Stan- 
dard bus configuration is identical to that of 
other Intel486 processors. (See _ section 
9.6.4, ’Clock Control State Diagram’’.) 


Normal StateThis is the normal operating state of 
the processor. When the processor is executing pro- 
gram/instruction and the STPCLK# pin is not as- 
serted, the processor is said to be in it’s normal 
state. | 


1 Normal State 


Normal Execution 
All Clocks Running 


STPCLK# 
asserted STPCLK# asserted 
and Stop Grant Bus 


cycle generated 


2 Stop Grant State 


Clock Running 
icc approximately 20 - 50mA 


Start CLK 
+ PLL Startup Latency 


3 Stop Clock State 


Internal Powerdown 
CLK Stopped 
icc approximately 100uUA 
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Figure 9-11. Write-Back Enhanced IntelDX2™ Processor Stop Clock State 
Machine (Enhanced Bus Configuration) 
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9.6.5.2 Stop Grant State 


For minimum processor power consumption, all oth- 
_ er input pins should be driven to their inactive level 
while the processor is in the Stop Grant state ex- 
cepting data bus, data parity, WB/WT# and INV 
pins. WB/WT # should be driven low and INV should 
be driven high. 


In both the Standard mode and Enhanced mode 
states, the following conditions exist: 


e A RESET, SRESET or de-assertion of STPCLK# 
will bring the processor from the Stop Grant state 
to the Normal state. 


_@ While in the Stop Grant state, the processor will 
not recognize transitions on the interrupt signals 
(SMI#, NMI, and INTR). This means SMI#, NMI, 
INTR are not Stop Break events. The external 
logic should de-assert STPCLK# before issuing 
interrupts or if an interrupt is asserted it should be 
kept asserted for at least 1 clock after STPCLK# 
is removed. (Note that the Write-Back Enhanced 
IntelDX2 processor requires that INTR must be 
held active until the processor issues an interrupt 
acknowledge cycle i in order to guarantee recogni- 
tion). 


e FLUSH# is not a Stop Break event. But if 
FLUSH # is asserted during the Stop Grant state, 


it is latched by the Write-Back Enhanced 


IntelDX2 processor and serviced later when 
STPCLK # is deasserted. 


e The processor will latch and respond to the in- 
puts BOFF#, EADS#, AHOLD, and HOLD. The 
processor will not recognize any other inputs 
while in the Stop Grant state except FLUSH #. 
Other input signals to the processor will not be 
recognized until the CLK following the CLK in 

_ which STPCLK# is de-asserted. (See Figure 9- 
11.) 


e The processor will generate a Stop Grant bus cy- 
~ cle only when entering that state from the Normal 
or the Auto HALT Power Down state. The Stop 
Grant bus cycle is not generated when the proc- 
essor enters the Stop Grant state from the Stop 
_Clock state or the Stop Clock Snoop state. 


e The processor will not enter the Stop Grant state 
until all the pending writes are completed, all 
pending interrupts are serviced and the proce: 
sor is idle. 
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Stop Clock StateStop Clock state is the lowest pow- 
er consumption mode in the processor, because it 
allows removal of the external clock. It also has the 
longest latency for returning to normal state. The 
Stop Clock state is entered from the Stop Grant 
state by stopping the CLK input. In the Stop Clock 
state, total processor power consumption drops to 
100 yA, which is approximately 200-250 times low- 
er than the Stop Grant state. None of the processor 
input signals should change state while the CLK in- 
put is stopped. Any transition on an input signal be- 
fore the processor has returned to the Stop Grant 


9.6.5.3 Stop Clock State 


state will result in unpredictable behavior. If INTR is 


driven active, it must be held active until the proces- 
sor issues an interrupt acknowledge cycle. 


In the Stop Clock state, the processor is dormant. It 
does not respond to any transitions on any of the 
input pins including snoops, flushes and interrupts. It 
is recommended that this mode only be entered if 
the processor cache is coherent with main memory 
and the processor is not processing any interrupts. If 
this mode is entered with a dirty cache, no alternate 
master cycles can be allowed while the processor is 
in the Stop Clock state. 


The processor will return to the Stop Grant state 
after the CLK input has been running at a constant 
frequency for a period of time equal to the PLL start- 
up latency. The CLK input can be restarted to any 
frequency between the minimum and maximum fre- 
quency listed in the AC timing specifications. 


In Enhanced Bus Mode 


If the processor is taken into ‘the Stop Clock state’ 
with a dirty cache, alternate bus master cycles are 
not allowed while the processor remains in the Stop 
Clock state. In order to take the processor into the 
Stop Clock state with a clean cache, the cache must 
be flushed. During the time the cache is being 
flushed, the system must block interrupts to the 
processor. With all interrupts other than STPCLK# 
blocked, the processor does not write into the cache 
during the time from the completion of the flush and 
time it enters the Stop Grant state. This is necessary 
for the cache to be coherent. To ensure this, the 
system should drive KEN# inactive from the time 


- the flush starts until the Stop Grant cycle is issued. 


The system can then put the processor in the Stop 
Clock state by stopping the CLOCK. : 


intel. 


If the processor is already in the Stop Grant state 
and entering the Stop Clock state is desired, the sys- 
tem must de-assert STPCLK# before flushing the 
cache in order to ensure the cache coherency. The 
5-clock de-assertion specification for STPCLK# 
must also be met before the above sequence can 
occur. . 


9.6.5.4 Auto HALT Power Down State 


Upon execution of a HALT instruction, the processor 
will automatically enter a low power state, called the 
Auto HALT Power Down state. The processor will 
issue a normal HALT bus cycle when entering this 
state. Because interrupts are HALT break events, 
the processor will transition to the Normal state on 
the occurrence of INTR, NMI, SMI# or RESET 
(SRESET is also a HALT break event). If there is a 
FLUSH# while the processor is in this state, the 
FLUSH # will be serviced by transitioning to the Stop 
Clock Fiush state. After the FLUSH# is completed, 
the processor returns back to the Auto HALT Power 
Down state. 


The system can generate a STPCLK# while the 
processor is in the Auto HALT Power Down state. 
The processor will then generate a Stop Grant bus 
cycle and enter the Stop Grant state from the Auto 
HALT Power Down state. When the system de-as- 
serts the STPCLK# interrupt, the processor will re- 
turn to the Auto HALT Power Down state. The proc- 
essor will not generate a new HALT bus cycle when 
it re-enters the Auto HALT Power Down state from 
the Stop Grant state. 


9.6.6 STOP CLOCK SNOOP STATE (CACHE 
INVALIDATIONS) 


When the processor is in the Stop Grant state or the 
Auto HALT Power Down state, the processor will 
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recognize HOLD, AHOLD, BOFF #, and EADS # for 
cache invalidation. When the system asserts HOLD, 
AHOLD, or BOFF #, the processor will float the bus 
accordingly. When the system asserts EADS#, the 
processor will transparently enter the Stop Clock 
Snoop state and will power up in order to perform 
the required cache snoop cycle and write-back cy- 
cles. It will then refreeze the CLK to the processor 
core and return to the previous state (i.e., either the 
Stop Grant state or the Auto HALT Power Down 
state). The processor does not generate a bus cycle 
when it returns to the previous state. 


9.6.6.1 Auto HALT Power Down Flush State 
(Cache Flush) for the Write-Back 
Enhanced IntelDX2 


lf the processor is in either Standard or Enhanced 
mode and a FLUSH# event occurs during Auto 
HALT Power Down state, the processor will tran- 
sition to the Auto HALT Power Down Flush state. If 
the on-chip cache is configured as a write-back 
cache, the CLK to the processor core is turned on 
until all the dirty lines are written back, the cache is 
invalidated, and the two flush acknowledge cycles 
are completed. If the on-chip cache is configured as 
a write-through cache, the CLK to the processor 
core is turned on until the cache is invalidated. The 
processor then refreezes the CLK and returns to the 
previous state (i.e., the Auto HALT Power Down 
state). Auto HALT Power Down Flush state is en- 
tered only from the Auto HALT Power Down state 
and not from the Stop Grant state. 


; 


9.6.7 SUPPLY CURRENT MODEL FOR STOP 
CLOCK MODES AND TRANSITIONS 


Figures 9-12 and 9-13 illustrate the effect of different 
Stop Clock state transitions on the supply current of 
the Intel486 processor. 
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STPCLK# asserted, CLK not changed 


a 
’  STPCLK# deasserted, CLK not changed 
Short (10CLK) latency 


CLK started, requires PLL startup 
latency to re-enter Stop Grant state 
eS Sa > 


16 MHz 
* CLK started, requires PLL startup e 
latency to re-enter Stop Grant state 


16MHz 20MHz 25MHz 


STPCLK# asserted, CLK not changed 
enteral 
STPCLK# deasserted, CLK not changed 


Short (10CLK) latency 


CLK started, requires PLL startup 
latency to re-enter Stop Grant state 
& 


CLK started, requires PLL startup 
latency to re-enter Stop Grant state 


8 MHz 25 MHz 33 MHz 
@ Normal State 
gm Stop Grant State 
O Stop Clock State 
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Figure 9-13. Supply Current Model for Stop Clock Modes and Transitions for the IntelDX4™ Processor 
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10.0 BUS OPERATION 


All Intel486 processors operate in Standard Bus 
(write-through) mode. However, when the internal 
cache of the Write-Back Enhanced IntelDX2™ proc- 
essor is configured in write-back mode, the proces- 
sor bus operates in the Enhanced Bus mode, which 
is described in section 10.3. When the internal 
cache of the Write-Back Enhanced IntelDX2 
processor is configured in write-through mode, 
the processor bus operates in Standard Bus 
mode, identical to the other Intel486 processors 
in Standard Bus mode. 


10.1 Data Transfer Mechanism 


All data transfers occur as a result of one or more 
bus cycles. Logical data operands of byte, word and 
doubleword lengths may be transferred without re- 
strictions on physical address alignment. Data may 
be accessed at any byte boundary but two or three 
cycles may be required for unaligned data transfers. 
(See section 10.1.2, “Dynamic Data Bus Sizing,” 
and section 10.1.5, ‘“Operand Alignment.’’) 


The Intel486 processor address signals are split into 
two components. High-order address bits are provid- 
ed by the address lines, A2-A31. The byte enables, 
BEO #-BE3#, form the low-order address and pro- 
vide linear selects for the four bytes of the 32-bit 
address bus. 


The byte enable outputs are asserted when their as- 
sociated data bus bytes are involved with the pres- 


ent bus cycle, as listed in Table 10-1. Byte enable 


patterns that have a negated byte enable 
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separating two or three asserted byte enables will 
never occur (see Table 10-5). All other byte enable 
patterns are possible. | 


Table 10-1. Byte Enables and Associated Data 
and Operand Bytes 


Byte 
Enable 
Signal 

DO-D7 (byte 0-least significant) 
D8-D15 _—(byte 1) 
D16-D23 (byte 2) 


D24-D31 (byte 3-most significant) 


Associated Data Bus Signals 


Address bits AO and A1 of the physical operand’s 
base address can be created when necessary. Use 
of the byte enables to create AO and A1 is shown in 
Table 10-2. The byte enables can also be decoded 
to generate BLE# (byte low enable) and BHE# 
(byte high enable). These signals are needed to ad- 
dress 16-bit memory systems. (See section 10.1.3, 
“Interfacing with 8-, 16-, and 32-Bit Memories.’’) 


10.1.1 MEMORY AND I/O SPACES 


Bus cycles may access physical memory space or |/ 
O space. Peripheral devices in the system may ei- 
ther be memory-mapped, or |/O-mapped, or both. 
Physical memory addresses range from OO0000000H 
to FFFFFFFFH (4 gigabytes). |!/O addresses range 
from OOOQ0000H to OOOOFFFFH (64 Kbytes) for pro- 
grammed I/O. (See Figure 10-1.) 


Table 10-2. Generating AO-A31 from BEO# -BE3# and A2-A31 


Intel486™ Processor Address Signals 


A31 er A2 


Physical Base Address 
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FFFFFFFFH .. 


Physical | 
| Memory 


4 Gbyte | 


Accessible 
Programmed 
1/0 Space 


Physical Memory 1/0 Space 
Space 
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Figure 10-1. Physical Memory and I/O Spaces 


10.1.1.1 Memory and |/O Space Organization 


The Intel486 processor datapath to memory and in- 
put/output (I/O) spaces can be 32-, 16- or 8-bits 
wide. The byte enable signals, BEO#-BE3#, allow 


byte granularity when addressing any memory or I/O 


structure whether 8, 16 or 32 bits wide. 


The Intel486 processor includes bus control pins, 
BS16# and BS8#, which allow direct connection to 
16- and 8-bit memories and I/O devices. Cycles to 
32-, 16- and 8-bit may occur in any sequence, since 
the BS8# and BS16# signals are une during 
each bus cycle. 


32-bit wide memory and |/O spaces are organized 
as arrays of physical 4-byte words. Each memory or 
1/O 4-byte word has four individually addressable 
bytes at consecutive byte addresses (see Figure 10- 
2). The lowest addressed byte is associated with 
data signals DO-D7; the highest-addressed byte 
with D24-—D31. Physical 4-byte words begin at ad- 
dresses divisible by four. 
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32-Bit Wide Organization 


. mT | FFFFFFFCH 
00000003H 00000000H 


ey Se ee ee | 
BE3#¢ BE2# BE1#. BEO# 
-16-Bit Wide Organization 
FFFFFFFFH FFFFFFFEH 
00000001H 00000000H 
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Figure 10-2. Physical Memory and 
|/O Space Organization 


16-bit memories are organized as arrays of physical 
2-byte words. Physical 2-byte words begin at ad- 
dresses divisible by two. The byte enables BEO # — 
BE3#, must be decoded to A1, BLE# and BHE# to 
address 16-bit memories. (See section 10.1.3, ‘“In- 
terfacing with 8-, 16- and 32-Bit Memories.”’) 


To address 8-bit memories, the two low order ad- 
dress bits AO and A1, must be decoded from BEO # — 
BE3#. The same logic can be used for 8- and 16-bit 
memories, because the decoding logic for: BLE # 
and AO are the same. (See section 10.1.3, “Interfac- 
ing with 8-, 16-, and 32-Bit Memories.’’) 


10.1.2 DYNAMIC DATA BUS SIZING | 


Dynamic data bus sizing is a feature allowing proc- 
essor connection to 32-, 16- or 8-bit buses for mem- 


- ory or I/O. The Intel486 processor may connect to 


all three bus sizes. Transfers to or from 32-, 16- or 8- 
bit devices are supported by dynamically determin- 
ing the bus width during each bus cycle. Address 
decoding circuitry may assert BS16# for 16-bit de-. 
vices, or BS8# for 8-bit devices during each bus 
cycle. BS8# and BS16# must be negated when ad- 
dressing 32-bit devices. An 8-bit bus width is select- 
ed if both BS16# and BS8# are asserted. 


intel. 


BS16# and BS8# force the Intel486 processor to 
run additional bus cycles to complete requests larg- 


er than 16- or 8 bits. A 32-bit transfer will be convert- — 


ed into two 16-bit transfers (or 3 transfers if the data 
is misaligned) when BS16# is asserted. Asserting 
BS8# will convert a 32-bit transfer into four 8-bit 
transfers. 


Extra cycles forced by BS16# or BS8# should be 
viewed as independent bus cycles. BS16# or BS8# 
must be driven active during each of the extra cycles 
unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. 


The Intel486 processor will drive the byte enables 
appropriately during extra cycles forced by BS8# 
and BS16#. A2-A31 will not change if accesses are 
to a 32-bit aligned area. Table 10-3 shows the set of 
byte enables that will be generated on the next cycle 
for each of the valid possibilities of the byte enables 
on the current cycle. 
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The dynamic bus sizing feature of the Intel486 proc- 
essor is significantly different than that of the In- 
tel386 processor. Unlike the Intel386 processor, the 
Intel486 processor requires that data bytes be driv- 
en on the addressed data pins. The simplest exam- 
ple of this function is a 32-bit aligned, BS16# read. 
When the Intel486 processor reads the two high or- 
der bytes, they must be driven on the data bus pins 
D16-—D31. The Intel486 processor expects the two 
low order bytes on DO-D15. The Intel386 processor 
expects both the high and low order bytes on DO- 
D15. The Intel886 processor always reads or writes 
data on the lower 16 bits of the data bus when 
BS16# is asserted. | 


The external system must contain buffers to enable 
the Intel486 processor to read and write data on the 
appropriate data bus pins. Table 10-4 shows the 
data bus lines to which the Intel486 processor ex- 
pects data to be returned for each valid combination 
of byte enables and bus sizing options. 


Table 10-3. Next Byte Enable Values for BSn # Cycles 


Next with BS8 # Next with BS 16 # 
BE3# BE24 BEi# BEO# | BE3# BE2#4 BE1# =BEO# | BE3# BE2# BEi# BEO# 
n n n 
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n 
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‘‘n’”? means that another bus cycle will not be required to satisfy the request. 


Table 10-4. Data Pins Read with Different Bus Sizes 
BE1# 


BE2+# 


BEo# | w/oBss+/Bsic# | wese+ | wBS16# 


D7-DO 
D15-D0 
D23-D0 
D31-DO0O 
D15-D8 
D23-D8 
D31-D8 
D23-D16 
D31-D16 
D31-D24 


— 
o>) 


D7-DO 
D7-DO0O 
D7-DO 
D7-D0 
D15—D8 
D15-—D8 
D15-D8 
D23-D16 
D23-D16 
D31-D24 


D7-D0 
D15-—D0 
D15-—D0 
D15-—D0 
D15-—D8 
D15-D8 
D15-D8 
D23-D16 
D31-D16 
D31-D24 
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Valid data will only be driven onto data bus pins cor- 


responding to active byte enables during write cy- | 


_ Cles. Other pins in the data bus will be driven but 
they will not contain valid data. Unlike the Intel386 
processor, the Intel486 processor will not duplicate 
write data onto parts of the data bus for which the 
corresponding byte enable is negated. 


10.1.3 INTERFACING WITH 8-, 16- AND 32-BIT 
MEMORIES : 


In 32-bit physical memories, such as the one shown 
in Figure 10-3, each 4-byte word begins at a byte 
address that is a multiple of four. A2—A31 are used 
as a 4-byte word select. BEO # -BE3# select individ- 
ual bytes within the 4-byte word. BS8# and BS16# 
- are negated for all bus cycles involving the 32-bit 
~ array. 


16- and 8-bit memories require external byte swap- 
ping logic for routing data to the appropriate data 
lines and logic for generating BHE#, BLE# and A1. 
In systems where mixed memory widths are used, 
extra address decoding logic is necessary to assert 
BS16# or BS8#. 


Intel486™ 
Processor 

ass# = Bs16# 
Address 
Decode 


Byte 
Select Logic 


2-176 


Address Bus (A31—A2 BE0#-BE3#) 


Address Bus 
(BEO#-BE3#, A2-A31 


"HIGH" “HIGH" 
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Figure 10-3. Intel486™ Processor 
with 32-Bit Memory | 


Figure 10-4 shows the Intel486 processor address 
bus interface to 32-, 16- and 8-bit memories. To ad- 
dress 16-bit memories the byte enables must be de- 
coded to produce A1, BHE# and BLE# (AO). For 8- » 
bit wide memories the byte enables must be decod- 
ed to produce AO and A1. The same byte select 
logic can be used in 16- and 8-bit systems, because 
BLE # is exactly the same as AO. (See Table 10-5.) 


BEO#-BE3# can be decoded as shown in Table 
10-5 to generate A1, BHE# and BLE#. The byte 
select logic necessary to generate BHE # and BLE # 


is shown in Figure 10-5. 
32-Bit Memory 


BHE#, BLE#, A1 16-Bit Memory 


AO (BLE#), At 


8-Bit Memory 
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Figure 10-4. Addressing 16- and 8-Bit Memories 
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Table 10-5. Generating A1, BHE# and BLE # for Addressing 16-Bit Devices 


Intel486™ Processor 8-, 16-Bit Bus Signals 
Peese | Be2e | Bete | BEO* | At | BHEY | BLE® (AO) 
H* H* H* H* x—no active bytes 


x-—not contiguous bytes 


x—not contiguous bytes 
x—not contiguous bytes 
x—not contiguous bytes 


x-not contiguous bytes 


CO a ee Te ee a oe 
m— mK mx «K KX Om Ooo K Lore xX 
mm Tx« mK KK Det Kerrier x 


BLE # asserted when DO-D7 of 16-bit bus is active. 
BHE # asserted when D8-D15 of 16-bit bus is active. 
A1 low for all even words; A1 high for all odd words. 


= don’t care H = high voltage level L = low voltage level 
* = anon-occurring pattern of Byte Enables; either none are asserted or the pattern has Byte Enables 
asserted for non-contiguous bytes 


Combinations of BEO#-BE3# that never occur are 
those in which two or three asserted byte enables 
are separated by one or more negated byte enables. 
These combinations are ‘“‘don’t care” conditions in 
the decoder. A decoder can use the non-occurring 
geiae BEO # -BE3# combinations to its best advantage. 


BHE# 
se | jo—->o2* Figure 10-6 shows an Intel486 processor data bus 


242902-M?2 interface to 16- and 8-bit wide memories. External 
byte swapping logic is needed on the data lines so 
that data is supplied to and received from the In- 
tel486 processor on the correct data pins (see Table 
10-4). 


Al 


242202-M1 


BLE# (OR AO) 


242202-89 


Figure 10-5. Logic to Generate A1, BHE # 
and BLE # for 16-Bit Buses 
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Figure 10-6. Data Bus Interface to 16- and 8-Bit Memories 


10.1.4 DYNAMIC BUS SIZING DURING CACHE 
LINE FILLS 


BS8# and BS16# can be driven during cache line 
fills. The Intel486 processor will generate enough 8- 
or 16-bit cycles to fill the cache line. This can be up 
to sixteen 8-bit cycles. 


The external system should assume that all byte en- 
ables are active for the first cycle of a cache line fill. 
The Intel486 processor will generate proper byte en- 
ables for subsequent cycles in the line fill. Table 10- 
6 shows the appropriate AO (BLE#), A1 and BHE # 
for the various combinations of the Intel486 proces- 
sor byte enables on both the first and subsequent 
cycles of the cache line fill. The ‘**”’ marks all combi- 
nations of byte enables that will be generated by the 
Intel486 processor during a cache line fill. 


10.1.5 OPERAND ALIGNMENT 


Physical 4-byte words begin at addresses that are 
multiples of four. It is possible to transfer a logical 
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operand that spans more than one physical 4-byte 
word of memory or I/O at the expense of extra cy- 
cles. Examples are 4-byte operands beginning at ad- 
dresses that are not evenly divisible by 4, or 2-byte 
words split between two physical 4-byte words. 
These are referred to as unaligned transfers. 


Operand alignment and data bus size dictate when 
multiple bus cycles are required. Table 10-7 de- 
scribes the transfer cycles generated for all combi- 
nations of logical operand lengths, alignment, and 


_ data bus sizing. When multiple cycles are required to 


transfer a multibyte logical operand, the highest-or- 
der bytes are transferred first. For example, when 
the processor does a 4-byte unaligned read begin- 
ning at location x11 in the 4-byte aligned space, the 
three high order bytes are read in the first bus cycle. 
The low byte is read in a subsequent bus cycle. 
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Table 10-6. Generating AO, A1 and BHE# from the Intel486™ Processor Byte Enables 
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Table 10-7. Transfer Bus Cycles for Bytes, Words and Dwords 


Physical Byte Address in 
Memory (Low Order Bits) 


Transfer Cycles over 16-Bit 
Bus 
(jt = BS #16 asserted) 


Transfer Cycies over 8-Bit 
Bus 
(t = BS8# Asserted) 


KEY: 

b = byte transfer h = high-order portion 
w = 2-byte transfer | = low-order portion 
3 = 3-byte transfer m = mid-order portion 
d = 4-byte transfer 


The function of unaligned transfers with dynamic 
bus sizing is not obvious. When the external systems 
asserts BS16# or BS8# forcing extra cycles, low- 
order bytes or words are transferred first (opposite 
to the example above). When the Intel486 processor 
requests a 4-byte read and the external system as- 
serts BS16#, the lower 2 bytes are read first fol- 
lowed by the upper 2 bytes. 


In the unaligned transfer described above, the proc- 
essor requested three bytes on the first cycle. If the 
external system asserted BS16# during this 3-byte 
transfer, the lower word is_ transferred first 


Byte-Length of Logical Operand 


hw 
lw 


T 
byte with byte with 
lowest address highest 


address 


followed by the upper byte. In the final cycle the low- 
er byte of the 4-byte operand is transferred as in the 
32-bit example above. 


10.2 Bus Functional Description 


The Intel486 processor supports a wide variety of 
bus transfers to meet the needs of high performance 
systems. Bus transfers can be single cycle or multi- 
ple cycle, burst or non-burst, cacheable or non- 
cacheable, 8-, 16- or 32-bit, and pseudo-locked. To 
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support multiprocessing systems there are cache in- 
validation cycles and locked cycles. 


This section begins with basic non-cacheable non- 
burst single cycle transfers. It moves on to multiple 
cycle transfers and introduces the burst mode. 
Cacheability is introduced in section 10.2.3, ‘““Cache- 
able Cycles.” The remaining sections describe 
locked, pseudo-locked, invalidate, bus hold and in- 
terrupt cycles. 


Bus cycles and data cycles are discussed in this 
section. A bus cycle. is at least two clocks long and 
begins with ADS # active in the first clock and ready 
active in the last clock. Data is transferred to or from 
the Intel486 processor during a data cycle. A bus 
cycle contains one or more data cycles. 


Refer to section 10.2.13, “Bus States,” for a de- 
scription of the bus states shown in the timing dia- 
grams. 


ADS# 
A2-A31 


BEO—3# 
WI/R# 


* To Processor 
** From Processor 


; a 
intel. 
10.2.1 NON-CACHEABLE NON-BURST SINGLE 
CYCLE 


10.2.1.1 No Wait States 


The fastest non-burst bus cycle that the Intel486 
processor supports is two clocks long. These cycles 
are called 2-2 cycles because reads and writes take 
two cycles each. The first ‘2’ refers to reads and 
the second to writes. 


For example, if a wait state needs to be added to the 
write, the cycle would be called 2-3. 


Basic two clock read and write cycles are shown in 
Figure 10-7. The Intel486 processor initiates a cycle 
by asserting the address status signal (ADS #) at the 
rising edge of the first clock. The ADS# output indi- 
cates that a valid bus cycle definition and address is 
available on the cycle definition lines and address 
bus. 


D/C# , 


NNN «ANY ANANN 
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Figure 10-7. Basic 2-2 Bus Cycle 
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The non-burst ready input (RDY #) is returned by the 
external system in the second clock. RDY# indi- 
cates that the external system has presented valid 
data on the data pins in response to a read or the 
external system has accepted data in response to a 
write. 


The Intel486 processor samples RDY# at the end 
of the second clock. The cycle is complete if RDY # 
is active (LOW) when sampled. Note that RDY# is 
ignored at the end of the first clock of the bus cycle. 


The burst last signal (BLAST #) is asserted (LOW) 
by the Intel486 processor during the second clock of 
the first cycle in all bus transfers illustrated in Figure 
10-7. This indicates that each transfer is complete 
after a single cycle. The Intel486 processor asserts 
BLAST # in the last cycle of a bus transfer. 


CLK 


Intel486™ PROCESSOR FAMILY 


The timing of the parity check output (PCHK#) is 
shown in Figure 10-7. The Intel486 processor drives 
the PCHK# output one clock after ready terminates 
a read cycle. PCHK# indicates the parity status for 
the data sampled at the end of the previous clock. 
The PCHK # signal can be used by the external sys- 
tem. The Intel486 processor does nothing in re- 
sponse to the PCHK# output. 


10.2.1.2 Inserting Wait States 


The external system can insert wait states into the 
basic 2-2 cycle by driving RDY # inactive at the end 
of the second clock. RDY # must be driven inactive 
to insert a wait state. Figure 10-8 illustrates a simple 
non-burst, non-cacheable signal with one wait state 
added. Any number of wait states can be added to 
an Intel486 processor bus cycle by maintaining 
RDY # inactive. . 
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Figure 10-8. Basic 3-3 Bus Cycle 
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The burst ready input (BRDY #) must be driven inac- 
tive on all clock edges where RDY # is driven inac- 
tive for proper operation of these simple non-burst 
cycles. 


10.2.2 MULTIPLE AND BURST CYCLE BUS 
TRANSFERS 


Multiple cycle bus transfers can be caused by inter- 
nal requests from the Intel486 processor or by the 


external memory system. An internal request for a_ 


128-bit pre-fetch must take more than one cycle. In- 
ternal requests for unaligned data may also require 
multiple bus cycles. A cache line fill requires multiple 
cycles to complete. | 


The external system can cause a multiple cycle 
transfer when it can only supply 8- or 16-bits per 
cycle. 


Only multiple cycle transfers caused by internal re- 
quests are considered in this section. Cacheable cy- 
cles and 8- and 16-bit transfers are covered in sec- 
tion 10.2.3, ‘““Cacheable Cycles” and section 10.2.5, 
“8- and 16-Bit Cycles.” 


Internal Requests from Intel486 DX, IntelDX2, 
and IntelDX4 Processors 


An internal request by an Intel486 DX, IntelDX2, or 
IntelDX4 processor for a 64-bit floating point load 
must take more than one internal cycle. 


10.2.2.1 Burst Cycles 


The Intel486 processor can accept burst cycles for 
any bus requests that require more than a single 
data cycle. During burst cycles, a new data item is 
strobed into the Intel486 processor every clock rath- 
er than every other clock as in non-burst cycles. The 
fastest burst cycle requires 2 clocks for the first data 
item with subsequent data items returned every 
clock. 


The Intel486 processor is capable of bursting a max- 
imum of 32 bits during a write. Burst writes can only 
occur if BS8# or BS16¥# is asserted. For example, 
the Intel486 processor can burst write four 8-bit op- 
erands or two 16-bit operands in a single burst cycle. 
But the Intel486 processor cannot burst multiple 32- 
bit writes in a single burst cycle. 7 


Burst cycles begin with the Intel486 processor driv- 


ing out an address and asserting ADS # in the same 
manner as non-burst cycles. The Intel486 processor 
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indicates that it is willing to perform a burst cycle by 
holding the burst last signal (BLAST #) inactive in 
the second clock of the cycle. The external system 


indicates its willingness to do a burst cycle by return- 
ing the burst ready signal (BRDY #) active. 


The addresses of the data items in a burst cycle will 
all fall within the same 16-byte aligned area (corre- 
sponding to an internal Intel486 processor cache 
line). A 16-byte aligned area begins at location 
XXXXXXXO0 and ends at location XXXXXXXF. During 
a burst cycle, only BEO-3#, Ao, and Ag may 
change. A4y-A31, M/IO#, D/C#, and W/R# will re- 


main stable throughout a burst. Given the first ad- 


dress in a burst, external hardware can easily calcu- 
late the address of subsequent transfers in advance. 
An external memory system can be designed to | 
quickly fill the Intel486 processor internal cache 
lines. 


Burst cycles are not limited to cache line fills. Any 


multiple cycle read request by the Intel486 proces- 


sor can be converted into a burst cycle. The Intel486 
processor will only burst the number of bytes need- 
ed to complete a transfer. 


For example, the Intel486 DX, IntelDX2, Write-Back 
Enhanced IntelDX2 or IntelDX4 processor will burst 
eight bytes for a 64-bit floating point non-cacheable 
read. 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BRDY# active 
rather than RDY # (non-burst ready) in the first cycle 
of a transfer. For cycles that cannot be burst, such 
as interrupt acknowledge and halt, BRDY # has the 
same effect as RDY#. BRDY# is ignored if both 
BRDY # and RDY# are returned in the same clock. 
Memory areas and peripheral devices that cannot 
perform bursting must terminate cycles with RDY#. 


10.2.2.2 Terminating Multipie and Burst Cycle | 
Transfers 


The Intel486 processor drives BLAST # inactive for 
all but the last cycle in a multiple cycle transfer. 
BLAST # is driven inactive in the first cycle to inform 
the external system that the transfer could take ad- 
ditional cycles. BLAST # is driven active in the last 
cycle of the transfer indicating that the next time 
BRDY# or RDY# is returned the transfer is com- 
plete. 


BLAST # is not valid in the first clock of a bus cycle. 
It should be sampled only in the second and subse- 
quent clocks when RDY# or BRDY # is returned. 


| 
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The number of cycles in a transfer is a function of 
several factors including the number of bytes the 
Intel486 processor needs to complete an internal re- 
quest (1, 2, 4, 8, or 16), the state of the bus size 
inputs (BS8# and BS16#), the state of the cache 


enable input (KEN#) and alignment of the data to 
be transferred. 


When the Intel486 processor initiates a request it 
knows how many bytes will be transferred and if the 
data is aligned. The external system must indicate 
whether the data is cacheable (if the transfer is a 
read) and the width of the bus by returning the state 


of the KEN#, BS8# and BS16# inputs one clock - 


before RDY# or BRDY# is returned. The Intel486 
processor determines how many cycies a transfer 
will take based on its internal information and inputs 
from the external system. 


BLAST # is not valid in the first clock of a bus cycle 
because the Intel486 processor cannot determine 
the number of cycles a transfer will take until the 
external system returns KEN#, BS8# and BS16#. 
BLAST # should only be sampled in the second and 
subsequent clocks of a cycle when the external sys- 
tem returns RDY # or BRDY#. 


The system may terminate a burst cycle by returning 
RDY # instead of BRDY#. BLAST # will remain de- 
asserted until the last transfer. However, any trans- 
fers required to complete a cache line fill will follow 
the burst order, e.g., if burst order was 4, 0, C, 8 and 
RDY # was returned at after 0, the next transfers will 
be from C and 8. 


10.2.2.3 Non-Cacheable, Non-Burst, Multiple 
Cycle Transfers 


Figure 10-9 illustrates a 2 cycle non-burst, non- 
cacheable multiple cycle read. This transfer is simply 
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a sequence of two single cycle transfers. The 
Intel486 processor indicates to the external system 
that this is a multiple cycle transfer by driving 
BLAST # inactive during the second clock of the first 


' cycle. The external system returns RDY # active in- 


dicating that it will not burst the data. The external 
system also indicates that the data is not cacheable 
by returning KEN# inactive one clock before it re- 
turns RDY# active. When the Intel486 processor 
samples RDY # active it ignores BRDY #. 


Each cycle in the transfer begins when ADS# is 
driven active and the cycle is complete when the 
external system returns RDY # active. 


The Intel486 processor indicates the last cycle of 
the transfer by driving BLAST# active. The next 
RDY # returned by the external system terminates 
the transfer. 


10.2.2.4 Non-Cacheable Burst Cycles 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BRDY# active 
rather than RDY # in the first cycle of the transfer. 
This is illustrated in Figure 10-10. 


There are several features to note in the burst read. 
ADS # is only driven active during the first cycle of 
the transfer. RDY# must be driven inactive when 
BRDY # is returned active. 


BLAST # behaves exactly as it does in the non-burst 
read. BLAST # is driven inactive in the second clock 
of the first cycle of the transfer indicating more cy- 
cles to follow. In the last cycle, BLAST # is driven 
active telling the external memory system to end the 
burst after returning the next BRDY #. 
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_ Figure 10-9. Non-Cacheable, Non-Burst, Multiple-Cycle Transfers 
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Figure 10-10. Non-Cacheable Burst Cycle 
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10.2.3 CACHEABLE CYCLES 


Any memory read can become a cache fill operation. 
The external memory system can allow a read re- 
quest to fill a cache line by returning KEN# active 
one clock before RDY# or BRDY# during the first 
cycle of the transfer on the external bus. Once 
KEN # is asserted and the remaining three require- 
ments described below are met, the Intel486 proc- 
essor will fetch an entire cache line regardless of the 
state of KEN#. KEN# must be returned active in 
the last cycle of the transfer for the data to be writ- 
ten into the internal cache. The Intel486 processor 
will only convert memory reads or prefetches into a 
cache fill. 


KEN # is ignored during write or |/O cycles. Memory 
writes will only be stored in the on-chip cache if 
there is a cache hit. |1/O space is never cached in 
the internal cache. 


To transform a read or a prefetch into a cache line 
fill the following conditions must be met: 


1. The KEN# pin must be asserted one clock prior 
to RDY # or BRDY# being returned for the first 
data cycle. 


2. The cycle must be of the type that can be internal- 
ly cached. (Locked reads, I/O reads, and interrupt 
acknowledge cycles are never cached). 


3. The page table entry must have the page cache 
disable bit (PCD) set to 0. To cache a page table 
entry, the page directory must have PCD=0O. To 
cache reads or prefetches when paging is dis- 
abled, or to cache the page directory entry, con- 
trol register 3 (CR3) must have PCD=0. 


4. The cache disable (CD) bit in control register 0 


(CRO) must be clear. 
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External hardware can determine when the Intel486 
processor has transformed a read or prefetch into a 
cache fill by examining the KEN#, M/IO#, D/C#, 
W/R#, LOCK#, and PCD pins. These pins convey 
to the system the outcome of conditions 1-3 in the 
above list. In addition, the Intel486 processor drives 
PCD high whenever the CD bit in CRO is set, so that 
external hardware can evaluate condition 4. 


Cacheable cycles can be burst or non-burst. 


10.2.3.1 Byte Enables during a Cache Line Fill 


For the first cycle in the line fill, the state of the byte 
enables should be ignored. In a non-cacheable 
memory read, the byte enables indicate the bytes 
actually required by the memory or code fetch. 


The Intel486 processor expects to receive valid data 
on its entire bus (32 bits) in the first cycle of a cache 
line fill. Data should be returned with the assumption 
that all the byte enable pins are driven active. How- 
ever if BS8# is asserted only one byte need be re- 
turned on data lines DO-D7. Similarly if BS16# is 
asserted two bytes should be returned on DO-D15. 


The Intel486 processor will generate the addresses 
and byte enables for all subsequent cycles in the 
line fill. The order in which data is read during a line 
fill depends on the address of the first item read. 
Byte ordering is discussed in section 10.2.4, “Burst 
Mode Details.” 
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10.2.3.2 Non-Burst Cacheable Cycles | 


Figure 10-11 shows a non-burst cacheable cycle. 
The cycle becomes a cache fill when the Intel486 
processor samples KEN# active at the end of the 
first clock. The Intel486 processor drives BLAST # 
inactive in the second clock in response to KEN #. 
BLAST # is driven inactive because a cache fill re- 
quires 3 additional cycles to complete. BLAST # re- 


mains inactive until the last transfer in the cache line 


fill. KEN# must be returned active in the last cycle 
of the transfer for the data to be written into the 
internal cache. 7 


Note that this cycle would be a single bus cycle if 
KEN # was not sampled active at the end of the first 


= , 
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clock. The subsequent three reads would not have 
happened since a cache fill was not requested. 


The BLAST # output is invalid in the first clock of a 
cycle. BLAST # may be active during the first clock 
due to earlier inputs. Ignore BLAST # until the sec- 
ond clock. 


During the first cycle of the cache line fill the exter- 
nal system should treat the byte enables as if they 
are all active. In subsequent cycles in the burst, the 
Intel486 processor drives the address lines and byte 
enables. (See section 10.2.4.2, ‘Burst and Cache 
Line Fill Order‘) . 
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Figure 10-11. Non-Burst, Cacheable Cycles 
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10.2.3.3 Burst Cacheable Cycles 


Figure 10-12 illustrates a burst mode cache fill. As in 
Figure 10-11, the transfer becomes a cache line fill 
when the external system returns KEN# active at 
the end of the first clock in the cycle. 
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The external system informs the Intel486 processor 
that it will burst the line in by driving BRDY # active 
at the end of the first cycle in the transfer. 


Note that during a burst cycle, ADS # is only driven 
with the first address. 
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Figure 10-12. Burst Cacheable Cycle 
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10.2.3.4 Effect of Changing KEN# duringa — 
Cache Line Fill 


KEN# can change multiple times as long as it ar- 
rives at its final value in the clock before RDY # or 
BRDY# is returned. This is illustrated in Figure 
10-13. Note that the timing of BLAST # follows that 
of KEN# by one clock. The Intel486 processor sam- 
ples KEN# every clock and uses the value returned 
in the clock before ready to determine if a bus cycle 
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would be a cache line fill. Similarly, it uses the value 
of KEN # in the last cycle before early RDY # to load 
the line just retrieved from memory into the cache. 


KEN# is sampled every clock and it must satis 
setup and hold time. — 


KEN # can also change multiple times before a burst 
cycle, as long as it arrives at its final value one clock 
before ready is returned active. 
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Figure 10-13. Effect of Changing KEN # 
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10.2.4 BURST MODE DETAILS chip when either RDY # or BRDY # are active. Driv- 
ing BRDY # and RDY # inactive adds a wait state to 
the transfer. A burst cycle where two clocks are re- 


10.2.4.1 Adding Wait States to Burst Cycles quired for every burst item is shown in Figure 10-14. 


Burst cycles need not return data on every clock. 
The Intel486 processor will only strobe data into the 
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Figure 10-14. Slow Burst Cycle 
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10.2.4.2 Burst and Cache Line Fill Order __ Table 10-8. Burst Order 
(Both Read and Write cdalbcha 


Second Third -Fourth | 
Addr. Addr. Addr. 
C 


The burst order used by the Intel486 processor is 
shown in Table. 10-8. This burst order is followed by 
any burst cycle (cache or not), cache line fill anuset 
or not) or code prefetch. 


The Intel486 processor presents each request for 
data in an order determined by the first address in 
the transfer. For example, if the first address was 
104 the next three addresses in the burst will be 
100, 10C and 108. An example of burst address se- 
quencing is shown in Figure 10-15. 
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Figure 10-15. Burst Cycle Showing Order of Addresses 
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The sequences shown in Table 10-8 accommodate 
systems with 64-bit buses as well as systems with 
32-bit data buses. The sequence applies to all 
bursts, regardless of whether the purpose of the 
burst is to fill a cache line, do a 64-bit read, or do a 
pre-fetch. If either BS8# or BS16# is returned ac- 
tive, the Intel486 processor completes the transfer 
of the current 32-bit word before progressing to the 
next 32-bit word. For example, a BS16# burst to 
address 4 has the following order: 4-6-0-2-C-E-8-A. 


10.2.4.3 Interrupted Burst Cycles 


Some memory systems may not be able to respond 
with burst cycles in the order defined in Table 10-8. 
To support these systems the Intel486 processor al- 
lows a burst cycle to be interrupted at any time. The 
Intel486 processor will automatically generate 
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another normal bus cycle after being interrupted to 
complete the data transfer. This is called an inter- 
rupted burst cycle. The external system can respond 
to an interrupted burst cycle with another burst cy- 
cle. 


The external system can interrupt a burst cycle by 
returning RDY# instead of BRDY#. RDY# can be 
returned after any number of data cycles terminated 
with BRDY #. | 


An example of an interrupted burst cycle is shown in 
Figure 10-16. The Intel486 processor immediately 
drives ADS # active to initiate a new bus cycle after 
RDY# is returned active. BLAST# driven inactive 
one clock after ADS# begins the second bus cycle 
indicating that the transfer is not complete. | 
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Figure 10-16. Interrupted Burst Cycle 
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KEN# need not be returned active in the first data 
cycle of the second part of the transfer in Figure 
10-16. The cycle had been converted to a cache fill 
in the first part of the transfer and the Intel486 proc- 
essor expects the cache fill to be completed. Note 
that the first half and second half of the transfer in 
Figure 10-16 are each two cycle burst transfers. 


The order in which the Intel486 processor requests 
operands during an interrupted burst transfer is de- 
termined by Table 10-7. Mixing RDY# and BRDY# 
does not change the order in which operand ad- 
dresses are requested by the Intel486 processor. 


An example of the order in which the Intel486 proc- 
essor requests operands during a cycle in which 
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Figure 10-17. Interrupted Burst Cycle with Unobvious Order of Addresses 
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the external system mixes RDY# and BRDY# is 
shown in Figure 10-17. The Intel486 processor ini- 
tially requests a transfer beginning at location 104. 
The transfer becomes a cache line fill when the ex- . 
ternal system returns KEN # active. The first cycle of 
the cache fill transfers the contents of location 104 
and is terminated with RDY#. The Intel486 proces- 
sor drives out a new request (by asserting ADS #) to 
address 100. If the external system terminates the 
second cycle with BRDY#, the Intel486 processor 
will next request/expect address 10C. The correct 
order is determined by the first cycle in the transfer, 
which may not be the first cycle in the burst if the 
system mixes RDY# with BRDY #. 
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10.2.5 8- AND 16-BIT CYCLES 


The Intel486 processor supports both 16- and 8-bit 
external buses through the BS16# and BS8# in- 
puts. BS16# and BS8# allow the external system to 
specify, on a cycle by cycle basis, whether the ad- 
dressed component can supply 8, 16 or 32 bits. 
BS16# and BS8# can be used in burst cycles as 
well as non-burst cycles. If both BS16# and BS8# 
are returned active for any bus cycle, the Intel486 
processor will respond as if only BS8# were active. 


The timing of BS16# and BS8# is the same as that 
of KEN#. BS16# and BS8# must be driven active 
before the first RDY# or BRDY# is driven active. 
Driving the BS16# and BS8# active can force the 
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Intel486 processor to run additional cycles to com- 
plete what would have been only a single 32-bit cy- 
cle. BS8# and BS16# may change the state of 
BLAST # when they force subsequent cycles from 
the transfer. 


Figure 10-18. shows an example in which BS8# 
forces the Intel486 processor to run two extra cycles 
to complete a transfer. The |Intel486 processor is- 
sues a request for 24 bits of information. The exter- 
nal system drives BS8# active indicating that only 
eight bits of data can be supplied per cycle. The 
Intel486 processor issues two extra cycles to com- 
plete the transfer. 
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Figure 10-18. 8-Bit Bus Size Cycle 
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Extra cycles forced by the BS16# and BS8# should 
be viewed as independent bus cycles. BS16# and 
BS8# should be driven active for each additional 
cycle.unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. The Intel486 processor will drive BLAST # 
inactive until the last cycle before the transfer is 
complete. a: 


Refer to section 10.1.2, “Dynamic Data Bus Sizing,” 
for the sequencing of addresses while BS8# or 
BS16# are active. / . 


; | | 
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BS8# and BS16# operate during burst cycles in ex- 
actly the same manner as non-burst cycles. For ex- 
ample, a single non-cacheable read could be trans- 
ferred by the Intel486 processor as four 8-bit burst 
data cycles. Similarly, a single 32-bit write could be 
written as four 8-bit burst data cycles. An example of 


a burst write is shown in Figure 10-19. Burst writes 
can only occur if BS8# or BS16# is asserted. 
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Figure 10-19. Burst Write as a Result of BS8# or BS16# 
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10.2.6 LOCKED CYCLES 


Locked cycles are generated in software for any in- 
struction that performs a read-modify-write opera- 
tion. During a read-modify-write operation the In- 
tel486 processor can read and modify a variable in 
external memory and be assured that the variable is 
not accessed between the read and write. 


Locked cycles are automatically generated during 
certain bus transfers. The xchg (exchange) instruc- 
tion generates a locked cycle when one of its oper- 
ands is memory based. Locked cycles are generat- 
ed when a segment or page table entry is updated 
and during interrupt acknowledge cycles. Locked cy- 
cles are also generated when the LOCK instruction 
prefix is used with selected instructions. 


Locked cycles are implemented in hardware with the 
LOCK# pin. When LOCK# is active, the Intel486 
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processor is performing a read-modify-write opera- 
tion and the external bus should not be relinquished 
until the cycle is complete. Multiple reads or writes 
can be locked. A locked cycle is shown in Figure 
10-20. LOCK # goes active with the address and bus 
definition pins at the beginning of the first read cycle 
and remains active until RDY# is returned for the 
last write cycle. For unaligned 32 bits read-modify- 
write operation, the LOCK # remains active for the 
entire duration of the multipie cycle. It will go inactive 
when RDY # is returned for the last write cycle. 


When LOCK # is active, the Intel486 processor will 
recognize address hold and backoff but will not rec- 
ognize bus hold. It is left to the external system to 
properly arbitrate a central bus when the Intel486 
processor generates LOCK #. 
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Figure 10-20. Locked Bus Cycle 


2-195 


Intel486™ PROCESSOR FAMILY 


10.2.7 PSEUDO-LOCKED CYCLES 


Pseudo-locked cycles assure that no other master 
will be given control of the bus during operand trans- 
fers which take more than one bus cycle. 


For the Intel486 processor, examples include 64-bit 
description loads and cache line fills. 


Pseudo-locked transfers are indicated by the 
PLOCK# pin. The memory operands must be 
aligned for correct operation of a pseudo-locked cy- 
cle. 


PLOCK# need not be examined during burst reads. 
A 64-bit aligned operand can be retrieved in one 
burst (note: this is only valid in systems that do not 
interrupt bursts). 


The system must examine PLOCK# during 64-bit 
writes since the Intel486 processor cannot burst 
write more than 32 bits. However, burst can be used 
within each 32-bit write cycle if BS8# or BS16# is 
asserted. BLAST will be de-asserted in response to 
BS8# or BS16#. A 64-bit write will be driven out as 
two non-burst bus cycles. BLAST # is asserted dur- 
ing both writes since a burst is not possible. 
PLOCK # is asserted during the first write to indicate 
that another write follows. This behavior is shown in 
Figure 10-21. | 
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The first cycle of a 64-bit floating point write is the 
only case in which both PLOCK# and BLAST# are 


asserted. Normally PLOCK# and BLAST#¥ are the 
inverse of each other. 


During all of the cycles where PLOCK # is asserted, 
HOLD is not acknowledged until the cycle com- 
pletes. This results in a large HOLD latency, espe- 
cially when BS8# or BS16# is asserted. To reduce 
the HOLD latency during these cycles, windows are 
available between transfers to allow HOLD to be ac- 
knowledged during non-cacheable code prefetches. 
PLOCK# will be asserted since BLAST # is negat- 
ed, but it is ignored and HOLD is recognized during 
the prefetch. : 


PLOCK# can change several times during a cycle 


settling to its final value in the clock ready is re- 
turned. 


_10.2.7.1 Floating Point Read and Write Cycles 


For Intel486 DX, IntelDX2, Write-Back Enhanced In- 
telDX2, and IntelDX4 processors, 64-bit floating 
point read and write cycles are also examples of op- 
erand transfers that take more than one bus cycle. 


WIR# 3 : 
a bo 


| | 
PLOCK# | 


BLAST# 


** From Processor 


| ee 
ROV# TYIVIIVINORRRLARRLARALARALAL || /OXQOOOOOROOOUR | OO 
| | 


| | | 
| | l 
| 
| 


242202-A5 


Figure 10-21. Pseudo Lock Timing 
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10.2.8 INVALIDATE CYCLES 


Invalidate cycles are needed to keep the Intel486 
processor internal cache contents consistent with 
external memory. The Intel486 processor contains a 
mechanism for listening to writes by other devices to 
external memory. When the Intel486 processor finds 
a write to a section of external memory contained in 
its internal cache, the Intel486 processor’s internal 
copy is invalidated. 


Invalidations use two pins, address hold request 
(AHOLD) and valid external address (EADS#). 
There are two steps in an invalidation cycle. First, 
the external system asserts the AHOLD input forcing 
the Intel486 processor to immediately relinquish its 
address bus. Next, the external system asserts 
EADS # indicating that a valid address is on the In- 
tel486 processor address bus. Figure 10-22 shows 
the fastest possible invalidation cycle. The Intel486 
processor recognizes AHOLD on one CLK edge and 
floats the address bus in response. To allow the ad- 
dress bus to float and avoid contention, EADS # and 
the invalidation address should not be driven until 
the following CLK edge. The Intel486 processor 
reads the address over its address lines. If the In- 
tel486 processor finds this address in its internal 
cache, the cache entry is invalidated. Note that the 
Intel486 processor address bus is input/output, un- 
like the Intel386 processor's bus, which is output 
only. 
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The Intel486 processor immediately relinquishes its 
address bus in the next clock upon assertion of 
AHOLD. For example, the bus could be 3 wait states 
into a read cycle. If AHOLD is activated, the Intel486 
processor will immediately float its address bus be- 
fore ready is returned terminating the bus cycle. 


When AHOLD is asserted only the address bus is 
floated, the data bus can remain active. Data can be 
returned for a previously specified bus cycle during 
address hold. (See Figure 10-22 and Figure 10-23.) 


EADS # is normally asserted when an external mas- 
ter drives an address onto the bus. AHOLD need not 
be driven for EADS# to generate an internal invali- 
date. If EADS# alone is asserted while the Intel486 
processor is driving the address bus, it is possible 
that the invalidation address will come from the 
Intel486 processor itself. 


Note that it is also possible to run an invalidation 
cycle by asserting EADS # when BOFF # is asserted 
or after HLDA has been returned, following the as- 
sertion of HOLD. 


Running an invalidate cycle prevents the Intel486 
processor cache from satisfying other internal re- 
quests, so invalidations should be run only when 
necessary. The fastest possible invalidate cycle is 
shown in Figure 10-22, while a more realistic invali- 
dation cycle is shown in Figure 10-23. Both of the 
examples take one clock of cache access from the 
Intel486 processor. 
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Figure 10-23. Typical Internal Cache Invalidation Cycle 
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10.2.8.1 Rate of Invalidate Cycles 


The Intel486 processor can accept one invalidate 
per clock except in the last clock of a line fill. One 
invalidate per clock is possible as long as EADS # is 
negated in ONE or BOTH of the following cases: 


1. In the clock RDY # or BRDY # is returned for the 
last time. 


2. In the clock following RDY # or BRDY # being re- 
turned for the last time. 


This definition allows two system designs. Simple 
designs can restrict invalidates to one every other 
clock. The simple design need not track bus activity. 
Alternatively, systems can request one invalidate 
per clock provided that the bus is monitored. 


10.2.8.2 Running Invalidate Cycles Concurrently 
with Line Fills 


Precautions are necessary to avoid caching stale 
data in the Intel486 processor cache in a system 
with a second level cache. An example of a system 
with a second level cache is shown in Figure 10-24. 


An external device can be writing to main memory 
over the system bus while the Intel486 processor is 
retrieving data from the second level cache. The 
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Figure 10-24. System with Second Level Cache 
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_ Intel486 processor will need to invalidate a line in its 


internal cache if the external device is writing to a 
main memory address also contained in the Intel486 
processor cache. . 


A potential problem exists if the external device is 
writing to an address in external memory, and at the 
same time the Intel486 processor is reading data 
from the same address in the second level cache. 
The system must force an invalidation cycle to invali- 
date the data that the Intel486 processor has re- 
quested during the line fill. 


If the system asserts EADS # before the first data in 
the line fill is returned to the Intel486 processor, the 
system must return data consistent with the new 
data in the external memory upon resumption of the | 
line fill after the invalidation cycle. This is illustrated 
by the asserted EADS# signal labeled 1 in Figure 
10-25. 


If the system asserts EADS# at the same time or 
after the first data in the line fill is returned (in the 
same clock that the first RDY# or BRDY# is re- 
turned or any subsequent clock in the line fill) the 
data will be read into the Intel486 processor input 
buffers but it will not be stored in the on-chip cache. 
This is illustrated by asserted EADS# signal labeled 
2 in Figure 10-25. The stale data will be used to 
satisfy the request that initiated the cache fill cycle. 


10.2.9 BUS HOLD 


The Intel486 processor provides a bus hold, hold 
acknowledge protocol using the bus hold request 
(HOLD) and bus hold acknowledge (HLDA) pins. As- 
serting the HOLD input indicates that another bus 
master desires control of the Intel486 processor 
bus. The Intel486 processor will respond by floating 


. its bus and driving HLDA active when the current 


bus cycle, or sequence of locked cycles is complete. 


- An example of a HOLD/HLDA transaction is shown 


in Figure 10-26. Unlike the Intel886 processor, the 
Intel486 processor can respond to HOLD by floating 
its bus and asserting HLDA while RESET is assert- 
ed. 


Note that HOLD will be recognized during un-aligned 
writes (less than or equal to 32-bits) with BLAST # 
being active for each write. For greater than 32-bit or 
un-aligned write, HOLD# recognition is prevented 
by PLOCK# getting asserted. However, HOLD is 
recognized during non-cacheable, non-burstable 
code prefetches even though PLOCK # is active. 
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1. Data returned must be consistent if its address equals the invalidation address in this clock 
| 2. Data returned will not be cached if its address equals the invalidation address in this clock 


Figure 10-25. Cache invalidation Cycle Concurrent with Line Fill 


For cacheable and non-bursted or bursted cycles, 
HOLD is acknowledged during backoff only if HOLD 
and BOFF # are asserted during an active bus cycle 
(after ADS # asserted) and before the first RDY # or 
_ BRDY# has been returned (see Figure 10-27). The 
order in which HOLD and BOFF # go active is unim- 
portant (so long as both are active prior to the 


first RDY#/BRDY# returned by the system). 
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Figure 10-27 shows the case where HOLD is assert- 
ed first; HOLD could be asserted simultaneously or 
after BOFF# and still be acknowledged. 


The pins floated during bus hold are: BEO# -BE3#, 
PCD, PWT, W/R#, D/C#, M/IO#, LOCK#, 
PLOCK#, ADS#, BLAST#, DO-D31, A2-A31, 
DPO-DP3. | 
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Figure 10-27. HOLD Request Acknowledged during BOFF # 
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10.2.10 INTERRUPT ACKNOWLEDGE 


The Intel486 processor generates interrupt acknowl- 
edge cycles in response to maskable interrupt re- 
quests generated on the interrupt request input 
(INTR) pin. Interrupt acknowledge cycles have a 


unique cycle type generated on the cycle type pins. — 


An example of an interrupt acknowledge transaction 
is shown in Figure 10-28. Interrupt acknowledge cy- 
cles are generated in locked pairs. Data returned 
during the first cycle is ignored. The interrupt vector 
is returned during the second cycle on the lower 8 
bits of the data bus. The Intel486 processor has 256 
possible interrupt vectors. 


intel. 


The state of A2 distinguishes the first and second 
interrupt acknowledge cycles. The byte address 
driven during the first interrupt acknowledge cycle is 
4 (A31-A3 low, A2 high, BE3#-BE1# high, and 
BEO# low). The address driven during the second 
interrupt acknowledge cycle is 0 (A31-—A2 low, 
BE3 #-BE1# high, BEO# low). . 


Each of the interrupt acknowledge cycles are termi- 
nated when the external system returns RDY# or 
BRDY #. Wait states can be added by withholding 
RDY # or BRDY #. The Intel486 processor automati- 


_ cally generates four idle clocks between the first and 


second cycles to allow for 8259A recovery time. 
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Figure 10-28. Interrupt Acknowledge Cycles 
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10.2.11 SPECIAL BUS CYCLES 


The Intel486 processor provides special bus cycles 
to indicate that certain instructions have been exe- 
cuted, or certain conditions have occurred internally. 
The special bus cycles in Table 10-9 are defined 
when the bus cycle definition pins are in the follow- 
ing state: M/IO# =0, D/C # =0 and W/R# = 1. 


During these cycles the address bus is driven low 
while the data bus is undefined. 
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Two of the special cycles indicate halt or shutdown. 
Another special cycle is generated when the 
Intel486 processor executes an INVD (invalidate 
data cache) instruction and could be used to flush 
an external cache. The Write Back cycle is generat- 
ed when the Intel486 processor executes the 
WBINVD (write-back invalidate data cache) instruc- 
tion and could be used to synchronize an external - 
write-back cache. : 


The external hardware must acknowledge these 
special bus cycles by returning RDY # or BRDY#. 


Table 10-9. Special Bus Cycle Encoding 


Write-Back(1) 
First Flush Ack Cycle() 
Flush(1) 


Second Flush Ack Cycle(1) 
Shutdown 

HALT 

Stop Grant Ack Cycle(2) 


NOTES: 


1. These cycles are specific to the Write-Back Enhanced IntelDX2 processor. (See section 7.4.1, “Snoop Cycles and Write- 
Back Invalidation.”)) The FLUSH # cycle is applicable to all Intel486 processors. See appropriate sections. 


2. See section 9.6.1, “Stop Grant Bus Cycle,” for details. 


ADS# 


A2-A31 
M/lO# 
D/Cc# 
BEO-34 


BLAST# ' 


* To Processor 


Figure 10-29. Restarted Read Cycle 
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10.2.11.1 HALT Indication Cycle 


The Intel486 processor halts as a result of executing 
a HALT instruction. Signaling its entrance into the 
HALT state, a HALT indication cycle is performed. 
The HALT indication cycle is identified by the bus 
definition signals in special bus cycle state and a 
byte address of 2. BEO# and BE2# are the only 
signals distinguishing HALT indication from shut- 
down indication, which drives an address of 0. Dur- 
ing the HALT cycle, undefined data is driven on 
DO-D31. The HALT indication cycle must be ac- 
knowledged by RDY# asserted. 


A halted Intel486 processor resumes execution 


when INTR (if interrupts are enabled) or NMI or RE- 
SET is asserted. 


10.2.11.2 Shutdown Indication Cycle 


The Intel486 processor shuts down as a result of a 
protection fault while attempting to process a double 
fault. Signaling its entrance into the shutdown state, 
_ a shutdown indication cycle is performed. The shut- 
down indication cycle is identified by the bus 


ADS# 


ADDR 
SPEC 


B 

intel. 
definition signals in special bus cycle state and a 
byte address of 0. 


10.2.11.3 Stop Grant Indication Cycle 


A special Stop Grant bus cycle will be driven to the 
bus after the processor recognizes the STPCLK# 
interrupt. The definition of this bus cycle is the same 


_as the HALT cycle definition for the Intel486 proces- 


sor, with the exception that the Stop Grant bus cycle 
drives the value 0000 0010H on the address pins. 
The system hardware must acknowledge this cycle 


by returning RDY# or BRDY#. The processor will 


not enter the Stop Grant state until either RDY # or 
BRDY # has been returned. (See Figure 10-31.) 


The Stop Grant Bus Cycle is defined as follows: 


M/IO# = 0, D/C# = 0, W/R# = 1, Address Bus 
= 0000 0010H (A, = 1), BE3#-BEO# = 1011, 
Data bus = undefined. 


The latency between a STPCLK# request and the 
Stop Grant bus cycle is dependent on the current 
instruction, the amount of data in the processor write . 
buffers, and the system memory performance. 


RDOY# [RRUAVAUARARASADARARARARALASALALAL RARER ERARAREREEEREEA APOE ERLAER ERASER EAERSARASRR ARERR RRR, 


BRDY# 


** From Processor 
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Figure 10-30. Restarted Write Cycle 
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Figure 10-31. Stop Grant Bus Cycle 


10.2.12 BUS CYCLE RESTART 


In a multi-master system another bus master may 
require the use of the bus to enable the Intel486 
processor to complete its current bus request. In this 
situation the Intel486 processor will need to restart 
its bus cycle after the other bus master has complet- 
ed its bus transaction. 


A bus cycle may be restarted if the external system 
asserts the backoff (BOFF#) input. The Intel486 
processor samples the BOFF # pin every clock. The 
Intel486 processor will immediately (in the next 
clock) float its address, data and status pins when 
BOFF # is asserted (see Figures 10-29 and 10-34). 
Any bus cycle in progress when BOFF # is asserted 
is aborted and any data returned to the processor is 
ignored. The same pins are floated in response to 
BOFF# as are floated in response to HOLD. HLDA 
is not generated in response to BOFF#. BOFF# 
has higher priority than RDY # or BRDY #. If either 
RDY # or BRDY # are returned in the same clock as 
BOFF#, BOFF # takes effect. 


The device asserting BOFF # is free to run any cy- 
cles it wants while the Intel486 processor bus is in 
its high impedance state. If backoff is requested af- 
ter the Intel486 processor has started a cycle, the 
new master should wait for memory to return RDY # 


or BRDY # before assuming control of the bus. Wait- 
ing for ready provides a handshake to insure that the 
memory system is ready to accept a new cycle. If 
the bus is idle when BOFF# is asserted, the new 
master can start its cycle two clocks after issuing 
BOFF #. | 


The external memory can view BOFF # in the same 
manner as BLAST #. Asserting BOFF # tells the ex- 
ternal memory system that the current cycle is the 
last cycle in a transfer. 


The bus remains in the high impedance state until 
BOFF# is negated. Upon negation, the Intel486 
processor restarts its bus cycle by driving out the 
address and status and asserting ADS#. The bus 
cycle then continues as usual. 


Asserting BOFF # during a burst, BS8# or BS16# 
cycle will force the Intel486 processor to ignore data 
returned for that cycle only. Data from previous cy- 
cles will still be valid. For example, if BOFF # is as- 
serted on the third BRDY# of a burst, the Intel486 
processor assumes the data returned with the first 
and second BRDY # is correct and restarts the burst 
beginning with the third item. The same rule applies 
to transfers broken into multiple cycle by BS8# or 
BS16#. : 
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Asserting BOFF # in the same clock as ADS#¥ will 
cause the Intel486 processor to float its bus in the 
next clock and leave ADS# floating low. Because 
ADS # is floating low, a peripheral may think that a 
new bus cycle has begun even-though the cycle was 
aborted. There are two possible solutions to this 
problem. The first is to have all devices recognize 
_ this condition and ignore ADS# until ready comes 


intel. 


guarantees that ADS# will not be floating low. This | 
is only necessary in systems where BOFF # may be 
asserted in the same clock as ADS#. . 


10.2.13 BUS STATES 


A bus state diagram is shown in Figure 10-32. A de- . 
scription of the signals used in the diagram is given 


back. The second approach is to use a “two clock’ 
backoff: in the first clock AHOLD is asserted, and in 
the second clock BOFF# is asserted. This 


in Table 10-10. 


(RDY# ASSERTED + (BRDY# e BLAST#)ASSERTED) e 
(HOLD + AHOLD + NO REQUEST) e 
BOFF# NEGATED 


REQUEST PENDING e« 
(RDY# ASSERTED + (BRDY# e BLAST#)ASSERTED) e 
HOLD NEGATED 
AHOLD NEGATED e 
BOFF# NEGATED °¢ 


REQUEST PENDING « 
HOLD NEGATED e 
AHOLD NEGATED e 
BOFF# NEGATED 


BOFF # 
ASSERTED 


. 9 
< 
BOFF# ASSERTED 


AHOLD NEGATED e 
BOFF# NEGATED ¢ 
(HOLD NEGATED *) 


BOF F# 
NEGATED 


*HOLD is only factored into this state transition if T, was entered while a 
non-cacheable, non-bursted, code prefetch was in progress. . 
Otherwise, ignore HOLD. 
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Figure 10-32. Bus State Diagram 


Table 10-10. Bus State Description 


Bus is idle. Address and status signals may be driven to undefined values, or the bus may be floated 


Ti 
to a high impedance state. 


First clock cycle of a bus cycle. Valid address and status are driven and ADS # is asserted. | | 


Second and subsequent clock cycles of a bus cycle. Data is driven if the cycle is a write, or data is 


expected if the cycle is a read. RDY # and BRDY # are sampled. 


First clock cycle of a restarted bus cycle. Valid address and status are driven and ADS # is asserted. 


Second and subsequent clock cycles of an aborted bus cycle. 
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10.2.14 FLOATING POINT ERROR HANDLING 
FOR THE INTEL486 DX, INTELDX2, 
AND INTELDX4 PROCESSORS 


The Intel486 DX, IntelDX2, and IntelDX4 processors 
provide two options for reporting floating point er- 
rors. The simplest method is to raise interrupt 16 
whenever an unmasked floating point error occurs. 
This option may be enabled by setting the NE bit in 
control register 0 (CRO). 


The Intel486 DX, IntelDX2, and IntelDX4 processors 
also provide the option of allowing external hard- 
ware to determine how floating point errors are re- 
ported. This option is necessary for compatibility 
with the error reporting scheme used in DOS based 
systems. The NE bit must be cleared in CRO to en- 
able user-defined error reporting. User-defined error 
reporting is the default condition because the NE bit 
is cleared on reset. 


Two pins, floating point error (FERR#) and ignore 
numeric error (IGNNE#), are provided to direct the 
actions of hardware if user-defined error reporting is 
used. The Intel486 DX, IntelDX2, and IntelDX4 proc- 
essors assert the FERR# output to indicate that a 
floating point error has occurred. FERR# corre- 
sponds to the ERROR # pin on the Intel387™ math 
coprocessor. However, there is a difference in the 
behavior of the two. 


In some cases FERR# is asserted when the next 
floating point instruction is encountered, and in other 
cases it is asserted before the next floating point 
instruction is encountered depending upon the exe- 
cution state of the instruction causing the exception. 


The following class of floating point exceptions drive 
FERR# at the time the exception occurs (i.e., before 
encountering the next floating point instruction). 


1. The stack fault, invalid operation, and denormal 
exceptions on all transcendental instructions, in- 
teger arithmetic instructions, FSQRT, FSEALE, 
FPREM(1), FXTRACT, FBLD, and FBSTP. 


2. Any exceptions on store instructions (including in- 
teger store instructions). 
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The following class of floating point exceptions drive 
FERR# only after encountering the next floating 
point instruction. 


1. Exceptions other than on all transcendental in- 
structions, integer arithmetic instructions, FSQRT, 
FSCALE, FPREM(1), FXTRACT, FBLD, and 
FBSTP. 


2. Any exception on all basic arithmetic, load, com- 
pare, and control instructions (i.e., all other in- 
structions). 


For both sets of exceptions above, the Intel387 
math coprocessor asserts ERROR# when the error 
occurs and does not wait for the next floating point 
instruction to be encountered. 


IGNNE# is an input to the Intel486 DX, IntelDX2, 
and IntelDX4 processors. When the NE bit in CRO is 
cleared, and IGNNE # is asserted, the Intel486 DX, 
IntelDX2, and IntelDX4 processors will ignore a user 
floating point error and continue executing floating 
point instructions. When IGNNE# is negated, the 
IGNNE# is an input to these processors that will 
freeze on floating point instructions which get errors 
(except for the control instructions FNCLEX, FNINIT, 
FNSAVE, FNSTENV, FNSTCW, FNSTSW, FNSTSW 
AX, FNENI, FNDIS!I and FNSETPM). IGNNE# may 
be asynchronous to the Intel486 DX, IntelDX2, and 
IntelDX4 processor clock. 


In systems with user-defined error reporting, the 
FERR # pin is connected to the interrupt controller. 
When an unmasked floating point error occurs, an 
interrupt is raised. If IGNNE# is high at the time of 
this interrupt, the Intel486 DX, IntelDX2, and In- 
telDX4 processors will freeze (disallowing execution 
of a subsequent floating point instruction) until the 
interrupt handler is invoked. By driving the IGNNE # 
pin low (when clearing the interrupt request), the in- 
terrupt handler can allow execution of a floating 
point instruction, within the interrupt handler, before 
the error condition is cleared (by FNCLEX, FNINIT, 
FNSAVE or FNSTENV). If execution of a non-control 
floating point instruction, within the floating point in- 
terrupt handler, is not needed, the IGNNE# pin can 
be tied HIGH. 
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10.2.15 INTEL486 DX, INTELDX2, AND 
INTELDX4 PROCESSORS FLOATING 
POINT ERROR HANDLING IN 
AT-COMPATIBLE SYSTEMS 


The Intel486 DX, IntelDX2, and IntelDX4 processors 
provide special features to allow the implementation 
of an AT-compatible numerics error reporting 
scheme. These features DO NOT replace the exter- 
nal circuit. Logic is still required that decodes the 
OUT FO instruction and latches the FERR# signal. 
What follows is a description of the use of these 
Intel Processor features. 


The features provided by the Intel486 DX, IntelDxX2, 
and IntelDX4 processors are the NE bit in the Ma- 
chine Status Register, the IGNNE# pin, and the 
FERR # pin. 


The NE bit determines the action taken by the 
-Intel486 DX, IntelDX2, and IntelDX4 processors 
when a numerics error is detected. When set this bit 
signals that non-DOS compatible error handling will 
be implemented. In this mode the Intel486 DX, 
IntelDX2, and IntelDX4 processors take a software 
exception (16) if a numerics error is detected. 


If the NE bit is reset, the Intel486 DX, IntelDX2, and 
IntelDX4 processors use the IGNNE# pin to allow 
an external circuit to control the time at which non- 
control numerics instructions are allowed to execute. 
Note that floating point control instructions such as 
FNINIT and FNSAVE can be executed during a 
floating point error condition regardless of the state 
of IGNNE#. | 
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To process a floating point error in the DOS environ- 
ment the following sequence must take place: 


1. The error is detected by the Intel486 Dx, 
IntelDX2, and IntelDX4 processor that activates | 
the FERR# pin. 


2. FERR # is latched so that it can be cleared by the 
OUT FO instruction. 


3. The latched FERR# signal activates an interrupt 
at the interrupt controller. This interrupt is usually 
handled on IRQ13. | 


4. The Interrupt Service Routine (ISR) handles the 
error and then clears the interrupt by executing an 
OUT instruction to port FO. The address FO is de- 
coded externally to clear the FERR# latch. The 
IGNNE# signal is also activated by the decoder 
output. | ; 


5. Usually the ISR then executes an FNINIT instruc- 
tion or other control instruction before restarting 
the program. FNINIT clears the FERR# output. 


Figure 10-33 illustrates a sample circuit that will per- 
form the function described above. Note that this 
circuit has not been tested and is included as an 
example of required error handling logic. 


Note that the IGNNE# input allows non-control in- 
structions to be executed prior to the time the 
FERR# signal is reset by the Intel486 DX, IntelDX2, 
and IntelDX4 processors. This function is imple- 
mented to allow exact compatibility with the AT im- 
plementation. Most programs reinitialize the floating 
point unit before continuing after an error is detect- 
ed. The floating point unit can be reinitialized using 
one of the following four instructions: FCLEX, FINIT, 
FSAVE and FSTENV. | . 


Intel486™ PROCESSOR FAMILY 


1/0 PORT OFOH 
Address decoder 


Processor Bus 


Intel486™ 
Processor 


8259A 
Programmable 
Interrupt IGNNE# 


Controller 
INTR 


242202-B7 


Figure 10-33. DOS-Compatible Numerics Error Circuit 
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10.3 Enhanced Bus Mode Operation 


(Write-Back Mode) for the Write- — 


Back Enhanced IntelIDX2 
Processor 


This section describes how the processor bus oper- 
ation changes for the Enhanced Bus mode when the 
internal cache is configured in write-back mode. 


10.3.1 SUMMARY OF BUS DIFFERENCES 


The following is a list of the differences between the 
Enhanced and Standard Bus modes: 


1. Burst write capability is extended to four double- 
word burst cycles (for write-back cycles only) 


2. Four new signals: INV, WB/WT#, HITM#, and 
CACHE #, have been added to support the write- 
back operation of the internal cache. These sig- 
nals function the same as the equivalent signals 
on the Pentium™ OverDrive™ Processor pins. 


3. The SRESET signal has been modified so that it 
neither writes back, invalidates, nor disables the 
cache. Special test modes are also not initiated 
through SRESET. 


| | ; 
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4. The FLUSH# signal behaves the same as the 
WBINVD instruction. Upon assertion, FLUSH# 


writes back all modified lines, invalidates the 
cache, and issues two special bus cycles. 


5. The PLOCK# signal remains inactive in the En- 
hanced Bus mode. 


10.3.2 BURST CYCLES 


Figure 10-34 shows a basic burst read cycle of the 
Write-Back Enhanced IntelDX2 processor. In. the En- 
hanced Bus mode, both PCD and CACHE # are as- 
serted if the cycle is internally cacheable. The Write- 


- Back Enhanced IntelDX2 processor samples KEN # 


in the clock before the first BRDY #. If KEN# is re- 
turned active by the system, this cycle is trans- 
formed into a multiple-transfer cycle. With each data 
item returned from external memory, the data is 
‘cached only if KEN # is returned active again in the 
clock before the last BRDY# signal. Data is sam- 
pled only in the clock in which BRDY # is returned. If 
the data is not sent to the processor every clock, it 
causes a “‘slow burst’ cycle. 
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Figure 10-34. Basic Burst Read Cycle 
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10.3.2.1 Non-Cacheabie Burst Operation 


If CACHE # is asserted on a read cycle, it indicates 
that the processor will follow with BLAST # high if 
KEN # is returned active. However, the converse is 
not true. The Write-Back Enhanced IntelDX2 proc- 
essor may elect to read-burst data, which are identi- 
fied as non-cacheable by either CACHE # or KEN#. 
In this case, BLAST # is also high in the same cycle 
as the first BRDY# (in clock four). To improve per- 
formance, the memory controller should try to com- 
plete the cycle as a burst cycle. 


The assertion of CACHE # on a write cycle signifies 
a replacement or snoop write-back cycle. These cy- 
cles consist of four doubleword transfers (either 
bursts or non-burst). The signals KEN# and 
WB/WT # are not sampled during write-back cycles 
because the processor does not attempt to redefine 
the cacheability of the line. 


10.3.2.2 Burst Cycle Signal Protocol 


The signals from ADS # through BLAST #, which are 
shown in Figure 10-34, have the same function and 
timing in both Standard and Enhanced Bus modes. 
Burst cycles can be up to 16-bytes long (four aligned 
doublewords) and can start with any one of the four 
doublewords. The sequence of the addresses are 
determined by the first address and the sequence 
follows the order shown previously in Table 10-8. 
The burst order for reads is the same as the burst 
order for writes. (See section 10.2.4.2, “Burst and 
Cache Line Fills.’’) 


An attempted line fill, which is caused by a read 
miss, is indicated by the assertion of CACHE# and 
W/R# to low. For a line fill to occur, the system 
must assert KEN # twice: one clock prior to the first 
BRDY # and one clock prior to last BRDY #. It takes 
only one assertion of KEN # to mark the line as non- 
cacheable. 
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A write-back cycle of a cache line, due to replace- 
ment or snoop, is indicated by the assertion of 
CACHE # low and W/R# high. KEN# has no effect 
during write-back cycles. CACHE # is valid from the 
assertion of ADS# through the clock in which the 
first RDY # or BRDY # is returned. CACHE # is inac- 
tive at all other times. PCD behaves the same in 
Enhanced Bus mode as in Standard Bus mode, ex- 
cept that it is low during write-back cycles. 


The Write-Back Enhanced IntelDX2 processor sam- 
ples WB/WT # once, in the same clock as the first 
BRDY #. This sampled value of WB/WT# is com- 
bined with PWT to bring the line into the internal 
cache, either as a write-back line or write-through 
line. 


10.3.3 CACHE CONSISTENCY CYCLES 


The system performs snooping to maintain cache 
consistency. Snoop cycles can be performed under 
AHOLD, BOFF#, or HOLD, described in Table 
10-11. 


The snoop cycle begins by checking whether a par- 
ticular cache line has been ‘‘cached” and invali- 
dates the line based on the state of the INV pin. If 
the Write-Back Enhanced IntelDX2 processor is 
configured in Enhanced Bus mode, the system must 
drive INV high to invalidate a particular cache line. 
The Write-Back Enhanced IntelDX2 processor does 
not have an output pin to indicate a snoop hit to an 
S-state line or an E-state line. However, the Write- 
Back Enhanced IntelDX2 processor will invalidate 
the line if the system snoop hits an S-state, E-state, 
or M-state line, provided INV was driven high during 
snooping. If INV is driven low during a snoop, a mod- 


- ified line will be written back to memory and will re- 


main in the cache as a write-back line; a write- 
through line also will remain in the cache as a write- 
through line. 
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Table 10-11. Snoop Cycles under AHOLD, BOFF#, or HOLD 


After asserting AHOLD or BOFF #, the external bus 
master driving the snoop cycle must wait for two 
clocks before driving the snoop address and assert- 
ing EADS#. If snooping is done under HOLD, the 
master performing the snoop must wait for at least 
one clock cycle before driving the snoop addresses 
and asserting EADS#. INV should be driven low 
during read operations to minimize invalidations, 
and INV should be driven high to invalidate a 
cache line during write operations. The Write- 
Back Enhanced IntelDX2 processor asserts HITM# 
if the cycle hits a modified line in the cache. This 
output signal becomes valid two clock periods after 
EADS # is valid on the bus. HITM# remains assert- 
ed until the modified line is written back and will re- 
‘main asserted until the RDY# or BRDY# of the 
snoop cycle is returned. Snoop operations could in- 
terrupt an ongoing bus operation in both the Stan- 
dard Bus and Enhanced Bus modes. The Write- 
Back Enhanced IntelDX2 processor can accept 
EADS# in every clock period while in Standard 
Bus mode. In Enhanced Bus mode, the Write- 
Back Enhanced IntelDX2 processor can accept 
EADS# every other clock period or until a snoop 
hits an M-state line. The Write-Back Enhanced 
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‘AHOLD | Floats the address bus. ADS # is asserted under AHOLD onlly to initiate a-snoop write-back cycle. 
An ongoing burst cycle is completed under AHOLD. For non-burst cycles, a specific non-burst 
transfer (ADS #-RDY # transfer) is completed under AHOLD and fractured before the next 
assertion of ADS #. A snoop write-back cycle is reordered ahead of a fractured non-burst cycle 
and the non-burst cycle is completed only after the snoop write-back cycle is completed, provided 
there are no other snoop write-back cycles scheduled. 


Overrides AHOLD and takes effect in the next clock. On-going bus cycles will stop in the clock 
following the assertion of BOFF # and resume when BOFF # is de-asserted. A snoop is the only 
bus cycle the Write-Back Enhanced IntelDX2™ processor responds to under BOFF #. Snoop 
write-back will be reordered ahead of the backed-off cycle. The snoop write-back cycle begins 
after BOFF # is de-asserted followed by the backed-off cycle. 


HOLD is acknowledged only between bus cycles, except for a non-cacheable, non-bursted code 
prefetch cycle. In a non-cacheable, non-bursted code prefetch cycle, HOLD is acknowledged 

after the system returns RDY #. Once HOLD is active, the processor blocks all bus activities until 
the system releases the bus (by de-asserting HOLD). 


IntelDX2 processor will not accept any further snoop 
cycles input until the previous snoop write-back op- 
eration is completed. 


All write-back cycles adhere to the burst address se-. 
quence of 0-4-8-C. The CACHE#, PWT, and PCD 
output pins are asserted and the KEN# and 
WB/WT # input pins are ignored. Write-back cycles 
can be either bursted or non-bursted. All write-back 
operations write 16 bytes of data to memory corre- 
sponding to the modified line that hit during the 
snoop. Note that the Write-Back Enhanced In- 
telDX2 processor will accept BS8# and BS16# 
line-fill cycles, but not on replacement or snoop- 
forced write-back cycles. 


- 10.3.3.1 Snoop Collision with a Current Cache 


Line Operation 


The system can also perform snooping concurrent 
with a cache access and may collide with a current 
cache bus cycle. Table 10-12 lists some scenarios. 
and the results of a snoop operation colliding with an 
on-going cache fill or replacement cycle. : 


@ 
intel : Intel486T™ PROCESSOR FAMILY 


Table 10-12. Various Scenarios of a Snoop Write-Back Cycle 
Colliding with an On-Going Cache Fill or Replacement Cycle 


Arbi- 
tration 
Control 


Snoop to a Different 
Line from the Line 
Being Filled 


Complete fill if the 
cycle is bursted. Start 
snoop write-back. 


Snoop to the Line 
That Is Being 
Replaced 


Snoop to a Different 
Line from the Line 
Being Replaced 


Snoop to the Line 
That Is Being Filled 


Complete replacement 
write-back if it is a burst 
cycle. Initiate snoop 
write-back. 


If the replacement 
write-back is a non- — 
burst cycle, the snoop 
write-back cycle is re- 
ordered in front of the 
replacement cycle. 
After the snoop write- 
back, the replacement 
write-back is continued 
from the interrupt point. 


Read all line fill data 
into cache line buffer. 


Update cache only if 
snoop occurred with 
INV = “0” 


No write-back cycle 
because the line has 
not been modified yet 


Complete replacement 
write-back if the cycle 
is bursted. Processor 
does not initiate a 
snoop write-back, but 
asserts HITM # until 
the replacement write- 
back is completed. 


If the cycle is non- 
bursted, the snoop 
write-back will be 
reordered ahead of 
the line fill. 


After the snoop write- 
back cycle is 
completed, continue 
with line fill 


If the replacement 
cycle is non-bursted, 
the snoop write-back is 
re-ordered ahead of the 
replacement write-back 
cycle. The processor 
does not continue with 
the replacement write- 
back cycle. 


Stop fil 


Wait for BOFF # to go 
inactive 


Stop reading line fill 
data 


Wait for BOFF # to go 
inactive. Continue 
read from backed off 
point 


Stop replacement 
write-back 


Wait for BOFF # to be 
de-asserted 


Stop replacement 
write-back 


Wait for BOFF # to go 
inactive 


Initiate snoop write- 
back 


Processor does not 
continue replacement 
write-back 


Initiate snoop write- 
back 


Continue replacement 
write-back from point of 
interrupt 


Update cache only if 
snoop occurred with 
INV = “0” 


Do snoop write-back 


Continue fill from 
interrupt point 


HOLD is not acknowledged until the current bus cycle (i.e., the line operation) is completed, except 
for a non-cacheable, non-bursted code prefetch cycle. Consequently there can be no collision with 
the snoop cycles using HOLD, except as mentioned earlier. In this case the snoop write-back is re- 
ordered ahead of an on-going non-burst, non-cached code prefetch cycle. After the write-back 

cycle is completed, the code prefetch cycle continues from the point of interrupt. 
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10.3.3.2 Snoop under AHOLD 


Snooping under AHOLD begins by asserting AHOLD 
‘to force the Write-Back Enhanced IntelDX2 proces- 
sor to float its address bus, as shown in Figure 10- 
35. The ADS # for the write-back cycle is guaran- 


- teed to occur no sooner than the second clock 


following the assertion of HITM# (i.e., there is a 
dead clock between the assertion of HITM# and 
the first ADS# of the snoop write-back cycle.) 


When a line is written back, KEN#, WB/WT#, 
BS8#, and BS16# are ignored, and PWT and PCD 
are always low during write-back cycles. 


The driven next ADS# for a new cycle can occur 
immediately after the last RDY# or BRDY# of the 
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write-back cycle. The Write-Back Enhanced _In- 
telDX2 processor does not guarantee a dead clock 
between cycles unless the second cycle is a 
snoop-forced write-back cycle. This allows snoop- 
forced write-backs to. be backed off (BOFF#) when 
snooping under AHOLD. | 


HITM# is guaranteed to remain asserted until the 
RDY # or BRDY # corresponding to the last double- 
word of the write-back cycle is returned. HITM# will 
be de-asserted from the clock edge in which the last 
BRDY# or RDY# for the snoop write-back cycle is 
returned. The write-back cycle could be a bursted or 
non-bursted. In either case, 16 bytes of data corre- 
sponding to the modified line that has a snoop hit is 
written back. 
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Figure 10-35. Snoop Cycle Invalidating a Modified Line 
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Snoop under AHOLD Overlaying a Line-Fill Cycle 


The assertion of AHOLD during a line fill is allowed 
on the Write-Back Enhanced IntelDX2 processor. In 
this case, when a snoop cycle is overlaid by an on- 
going line-fill cycle, the chipset must generate the 
burst addresses internally for the line fill to com- 
plete, because the address bus will have the valid 
snoop address. The write-back mode is more com- 
plex compared to the write-through mode because 
of the possibility of a line being written back. Figure 
10-36 shows a snoop cycle overlaying a line-fill cy- 
cle, when the snooped line is not the same as the 
line being filled. 


In Figure 10-36, the snoop to an M-state line causes 
a snoop write-back cycle. The Write-Back Enhanced 
IntelDX2 processor will assert HITM# two clocks af- 
ter the EADS #, but will delay the snoop write-back 
cycle until the line fill is completed, because the line 
fill shown in Figure 10-36 is a burst cycle. In this 
figure, AHOLD is asserted one clock after ADS#. In 


* To Processor 
** Write-back from Processor 


intel486™ PROCESSOR FAMILY 


the clock after AHOLD is asserted, the Write-Back 
Enhanced IntelDX2 processor will float the address 
bus (not the Byte Enables). Hence, the memory con- 
troller must determine burst addresses in this period. 
The chipset must comprehend the special ordering 
required by all burst sequences of the Write-Back 
Enhanced IntelDX2 processor. HITM# is guaran- 
teed to remain active until the write-back cycle com- 
pletes. 


lf AHOLD continues to be asserted over the forced 
write-back cycle, the memory controller also must 
supply the write-back addresses to the memory. The 
Write-Back Enhanced IntelDX2 processor always 
runs the write-back with an address sequence of 
0-4-8-C. 


In general, if the snoop cycle overlays any burst cy- 
cle (not necessarily a line-fill cycle) the snoop write- 
back will be delayed because of the on-going burst 
cycle. First, the burst cycle goes to completion and 
only then does the snoop write-back cycle start. 
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Figure 10-36. Snoop Cycle Overlaying a Line-Fill Cycle 
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AHOLD Snoop Overlaying a Non-Burst Cycle 


When AHOLD overlays a non-burst cycle, snooping 
is based on the completion of the current non-burst- 
ed transfer (ADS #-RDY# transfer). Figure 10-37 
shows a snoop cycle under AHOLD overlaying a 
non-burst line-fill cycle. HITM# is asserted two 
clocks after EADS #, and the non-burst cycle is frac- 
tured after the RDY # for a specific single transfer is 
returned. The snoop write-back cycle is re-ordered 
ahead of an on-going non-burst cycle. After the 
write-back cycle is completed, the fractured non- 


- burst cycle will continue. The snoop write-back 


ALWAYS precedes the completion of a fractured cy- 
cle, regardless of the point at which AHOLD is de- 
asserted, and AHOLD must be de-asserted before 
the fractured non-burst cycle can compiete. 
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AHOLD Snoop to the Same Line That Is Being 
Filled | 


A system snoop will not cause a write-back cycle to 
occur if the snoop hits a line while the line is being 
filed. The processor does not allow a line to be 
modified until the fill is completed (and a snoop will 
only produce a write-back cycle for a modified line). 
Although a snoop to a line that is being filled will not 
produce a write-back cycle, the snoop still has an 
effect based on the following rules: 


1. The processor always snoops the line being filled. 


2. In all cases, the processor uses the operand that 
triggered the line fill. 


3. If the snoop occurs when INV = ‘‘1”,, the proces- 
sor never updates the cache with the fill data. 


4. lf the snoop occurs when INV = “Q”, the proces- 
sor loads the line into the internal cache. | 
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Figure 10-37. Snoop Cycle Overlaying a Non-Burst Cycle 
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Snoop during Replacement Write-Back 


If the cache contains valid data during a line fill, one 
of the cache lines may be replaced as determined 
by the LRU algorithm. If the line being replaced is 
modified, this line will be written back to maintain 
cache coherency. When a replacement write-back 
cycle is in progress, it might be necessary to snoop 
the line that is being written back. (See Figure 
10-38.) 


If the replacement write-back cycle is bursted and 
there is a snoop hit to the same line as the line that 
is being replaced, the on-going replacement cycle 
runs to completion. HITM# is asserted until the line 
is written back and the snoop write-back will not be 
initiated. In this case, the replacement write-back is 
converted to the snoop write-back, and HITM# is 
asserted and de-asserted without a specific ADS # 
to initiate the write-back cycle. 


If there is a snoop hit to a different line from the line 
being replaced, and if the replacement write-back 
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cycle is bursted, the replacement cycle goes to com- 
pletion. Only then is the snoop write-back cycle ini- 
tiated. | 


If the replacement write-back cycle is a non-burst 


_ cycle, and if there is a snoop hit to the same line as 


the line being replaced, it will fracture the replace- 
ment write-back cycle after the RDY# for the cur- 
rent non-burst transfer is returned. The snoop write- 
back cycle will be reordered in front of the fractured 
replacement write-back cycle and will be completed 
under HITM#. However, after AHOLD is de-assert- 
ed the replacement write-back cycle is not complet- 
ed. 


If there is a snoop hit to the line that is different from 
the one being replaced, the non-burst replacement 
write-back cycle will be fractured, and the snoop 
write-back cycle will be reordered ahead of the re- 
placement write-back cycle. After the snoop write- 
back is completed, the replacement write-back cycle 
will continue. 
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Figure 10-38. Snoop to the Line That is Being Replaced 
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10.3.3.3 Snoop under BOFF # 


BOFF # is capable of fracturing any transfer, burst or 
non-burst. The output pins (see Table 3-8 and Table 
3-9) of the Write-Back Enhanced IntelDX2 processor 


will be floated in the clock period following the as- | 


_sertion of BOFF #. If the system snoop hits a modi- 
fied line using BOFF#, the snoop write-back cycle 
will be reordered ahead of the current cycle. BOFF # 
must be de-asserted for the processor to perform a 
snoop write-back cycle and resume the fractured cy- 
cle. The fractured cycle resumes with a new ADS # 
and begins with the first uncompleted transfer. 
Snoops are permitted under BOFF #, but write-back 
cycles will not be started until BOFF # is de-assert- 
ed. Consequently, multiple snoop cycles can occur 
under a continuously asserted BOFF #, but only up 
to the first asserted HITM#. 


Snoop under BOFF# during Cache Line Fill 


As shown in Figure 10-39, BOFF# fractured the 
second transfer of a non-burst cache line-fill cycle. 


| intel ® 
The system begins snooping by driving EADS # and 
INV in clock six. The assertion of HITM# in clock 
eight indicates that the snoop cycle hit a modified 
line and the cache line will be written back to memo- 
ry. The assertion of HITM# in. clock eight and 
CACHE# and ADS# in clock ten identifies the be- 
ginning of the snoop write-back cycle. ADS# is 
guaranteed to be asserted no sooner than two clock 
periods after the assertion of HITM#. Write-back cy- 
cles always use the four-doubleword address se- 
quence of 0-4-8-C (burst or non-burst). The snoop 
write-back cycle begins upon the de-assertion of 
BOFF# with HITM# asserted throughout the dura- 
tion of the snoop write-back cycle. 


If the snoop cycle hits a line that is different from the 
line being filled, the cache line fill will resume after _ 
the snoop write-back cycle completes, as shown in . 
Figure 10-39. 7 | 
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Figure 10-39. Snoop under BOFF # during a Cache Line-Fill Cycle 
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An ADS# is always issued when a cycle resumes 
after being fractured by BOFF #. The address of the 
fractured data transfer is reissued under this ADS#, 
and CACHE # is not issued unless the fractured op- 
eration resumes from the first transfer (e.g., first 
doubleword). If the system asserts BOFF# and 
RDY# simultaneously, as shown in clock four on 
Figure 10-39, BOFF# dominates and RDY¥# is ig- 
nored. Consequently, the Write-Back Enhanced 
IntelDX2 processor accepts only up to the x4h 
doubleword, and the line fill resumes with the xOh 
doubleword. ADS # initiates the resumption of the 
line-fill operation in clock period 15. HITM # is de-as- 
serted in the clock period following the clock period 
in which the last RDY # or BRDY # of the write-back 
cycle is returned. Hence, HITM# is guaranteed to 
be de-asserted before the ADS# of the next cycle. 


Figure 10-39 also shows the system returning 
RDY # to indicate a non-burst line-fill cycle. Bursted 
cache line-fill cycles behave similar to non-bursted 
cache line-fill cycles when snooping using BOFF #. 
If the system snoop hits the same line as the line 
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being filled (burst or non-burst), the Write-Back En- 
hanced IntelDX2 processor will not assert HITM# 
and will not issue a snoop write-back cycle, because 
it did not modify the line, and the line fill resumes 
upon the de-assertion of BOFF #. However, the line 
fill will be cached only if INV is driven low during the 
snoop cycle. 


Snoop under BOFF # during Replacement Write- 
Back a 


If the system snoop under BOFF # hits the line that 
is currently being replaced (burst or non-burst), the 
entire line is written back as a snoop write-back line, 
and the replacement write-back cycle is not contin- 
ued. However, if the system snoop hits to a different 
line than the one currently being replaced, the re- 
placement write-back cycle will continue after the 
snoop write-back cycle has been completed. Figure 
10-40 shows a system snoop hit to the same line as 
the one being replaced (non-burst). 
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Figure 10-40. Snoop under BOFF # to the Line that is Being Replaced 
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10.3.3.4 Snoop under HOLD 


HOLD can only fracture a non-cacheable, non-burst- 
ed code prefetch cycle. For all other cycles, the 
Write-Back Enhanced IntelDX2 processor will not 
assert HLDA until the entire current cycle is complet- 
ed. If the system snoop hits a modified line under 
HLDA during a non-cacheable, non-burstable code 
prefetch, the snoop write-back cycle will be reor- 
dered ahead of the fractured cycle. The fractured 
~non-cacheable, non-bursted code prefetch resumes 
with an ADS # and begins with the first uncompleted 
transfer. Snoops are permitted under HLDA, but 
write-back cycles will not occur until HOLD is de-as- 
serted. Consequently, multiple snoop cycles are per- 
mitted under a continuously asserted HLDA only up 
to the first asserted HITM#. 
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Figure 10-41. Snoop under HOLD during Line Fill 
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Snoop under HOLD durirg Cache Line Fill 


As shown in Figure 10-41, HOLD (asserted in clock 
two) does not fracture the Dursted cache line-fill cy- 
cle until the line fill is completed (in clock five). Upon 
completing the line fill in c ock five, the Write-Back 
Enhanced IntelDX2 processor asserts HLDA and 
the system begins snooping by driving EADS# and 
INV in the following clock period. The assertion of 
HITM# in clock nine indicates that the snoop cycle 
has hit a modified line and the cache line is written 
back to memory. The assertion of HITM# in clock 
nine and CACHE # and AD5# in clock 11 identifies 
the beginning of the snoop write-back cycle. The 
snoop write-back cycle bejins upon the de-asser- 
tion of HOLD, and HITM# is asserted throughout 
‘the duration of the snoop write-back cycle. 
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lf HOLD is asserted during a non-cacheable, non- If the system snoop hits a modified line, the snoop 
bursted code prefetch cycle, as shown in Figure write-back cycle will begin after HOLD is released. 
10-42, the Write-Back Enhanced IntelDX2 processor After the snoop write-back cycle is completed, an 
will issue HLDA in clock seven (which is the clock ADS# is issued and the code prefetch cycle re- 
period in which the next RDY# is returned). sumes. 
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Figure 10-42. Snoop using HOLD during a Non-Cacheable, Non-Burstable Code Prefetch 
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Snoop under HOLD during Replacement 
Write-Back 


Collision of snoop cycles under a HOLD during the 
replacement write-back cycle can never occur, be- 
cause HLDA is asserted only after the replacement 
write-back cycle (bursted or non-bursted) is complet- 
ed. 


10.3.4 LOCKED CYCLES 


In both Standard and Enhanced Bus modes, the 
Write-Back Enhanced IntelDX2 processor architec- 
ture supports atomic memory access. A programmer 
can modify the contents of a memory variable and 
be assured that the variable will not be accessed by 
another bus master between the read of the variable 
and the update of that variable. This function is pro- 
vided for instructions that contain a LOCK prefix, 
and also for instructions that implicitly perform 
locked read modify write cycles. In hardware, the 
LOCK function is implemented through the LOCK# 
pin, which indicates to the system that the processor 
is performing this sequence of cycles, and that the 
processor should be allowed atomic access for the 
location accessed during the first locked cycle. 


A locked operation is a combination of one or more 


read cycles followed by one or more write cycles 
with the LOCK # pin asserted. Before a locked read 
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cycle is run, the processor first determines if the cor- 
responding line is in the cache. If the line is present 
in the cache, and is in an E or S state, it is invalidat- 
ed. If the line is in the M state, the processor does a 
write-back and then invalidates the line. A locked 
cycle to an M, S, or E state line is always forced out 
to the bus. If the operand is misaligned across cache 
lines, the processor could potentially run two write 
back cycles before starting the first locked read. In 
this case the sequence of bus cycles is: write back, 
write back, locked read, locked read, locked write 
and the final locked write. Note that although a total 
of six cycles are generated, the LOCK# pin will be 
active only during the last four cycles, as shown in 
Figure 10-43. 


LOCK # will not be de-asserted if AHOLD is assert- 
ed in the middle of a locked cycle. LOCK# will re- 
main asserted even if there is a snoop write-back 
during a locked cycle. LOCK# will be floated if 
BOFF-# is asserted in the middle of a locked cycle. 
However, it will be driven LOW again when the cycle 
restarts after BOFF #. Locked read cycles are never 
transformed into line fills, even if KEN# is returned 
active. If there are back to back locked cycles, | 
the Write-Back Enhanced IntelDX2 processor 
does not insert a dead clock between these two 
cycles. HOLD is recognized if there are two back to 
back locked cycles, and LOCK# will float when 
HLDA is asserted. 
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Figure 10-43. Locked Cycles (Back to Back) 
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10.3.4.1 Snoop/Lock Collision is completed the locked cycle will continue. But dur- 
ing all this time (including the write-back cycle), the 
If there is a snoop cycle overlaying a locked cycle, LOCK# signal remains asserted. 


the snoop write-back cycle will fracture the locked 

cycle. As shown in Figure 10-44, after the read por- Because HOLD is not acknowledged if LOCK# is 
tion of the locked cycle is completed, the snoop asserted, snoop-lock collisions are restricted to 
write-back starts under HITM#. After the write-back AHOLD and BOFF # snooping. 
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Figure 10-44. Snoop Cycle Overlaying a Locked Cycle 
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10.3.5 FLUSH OPERATION 


The Write-Back Enhanced IntelDX2 processor exe- 
cutes a flush operation when the FLUSH# pin is 
active, and no outstanding bus cycles, such as a line 
fill or write back, are being processed. In the En- 
hanced Bus mode, the processor first writes back all 
the modified lines to external memory. After . the 
write-back is completed, two special cycles are gen- 
~ erated, indicating to the external system that the 
write-back is done. All lines in the internal cache are 
invalidated after all the write back cycles are done. 
Depending on the number of modified lines in the 
cache, the flush could take a minimum of 1280 bus 
clocks (2560 processor clocks) and up to a maxi- 
mum of 5000+ bus clocks to scan the cache, per- 


form the write backs, invalidate the cache, and 


intel. 


| run the flush acknowledge cycles. FLUSH # is imple- 


mented as an interrupt in the Enhanced Bus mode, 
and will be recognized only on an instruction bound- — 
ary. Write-back system designs should look for the 
flush acknowledge cycles to recognize the end of 
the flush operation. Figure 10-45 shows the flush ~ 
operation of the Write-Back Enhanced IntelDX2 
processor, when configured in the Enhanced Bus © 
mode. 


If the processor is in Standard Bus mode, the proc- 
essor will not issue special acknowledge cycles in 
response to the FLUSH# input, although the inter- 
nal cache is invalidated. The invalidation of the 
cache in this case, takes only two bus clocks. 
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Figure 10-45. Flush Cycle 
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10.3.6 PSEUDO LOCKED CYCLES 


In Enhanced Bus mode, PLOCK# is always driven 
inactive for both burst and non-burst cycles. Hence, 
it is possible for other bus masters to gain control of 
the bus during operand transfers that take more 
than one bus cycle. A 64-bit aligned operand can be 
read in one burst cycle or two non-burst cycles if 
BS8# and BS16# are not asserted. Figure 10-46 
shows a 64-bit floating point operand or Segment 
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Descriptor read cycle, which is burst by the system 
returning BRDY #. 


10.3.6.1 Snoop under AHOLD during Pseudo- 
Locked Cycles 


AHOLD can fracture a 64-bit transfer if it is a non- 
burst cycle. If the 64-bit cycle is burst, as shown in 
Figure 10-46, the entire transfer goes to completion 
and only then does the snoop write-back cycle start. 


242202-D0 


Figure 10-46. Snoop under AHOLD Overlaying Pseudo-Locked Cycle 
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10.3.6.2 Snoop under Hold during Pseudo- | Write-Back Enhanced IntelDX2 processor writes 
: Locked Cycles back the modified line to memory (if snoop hits to 
modified line). If the 64-bit transfer is non-bursted, 
As shown in Figure 10-47, HOLD will not fracture the the Write-Back Enhanced IntelDX2 processor can 
64-bit burst transfer. The Write-Back Enhanced issue HLDA in between bus cycles for a 64-bit trans- 
IntelDX2 processor will not issue HLDA until clock fer. 
four. After the 64-bit transfer is completed, the 
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Figure 10-47. Snoop under HOLD Overlaying Pseudo-Locked Cycle 


2-226 | 


intel : Intel486™ PROCESSOR FAMILY 


10.3.6.3 Snoop under BOFF # Overlaying a snoop hit under BOFF #, the snoop write-back oper- 
Pseudo-Locked Cycle ation will begin after BOFF# is de-asserted. The 
64-bit write cycle resumes after the snoop write- 
BOFF # is capable of fracturing any bus operation. back operation completes. 
As shown in Figure 10-48, BOFF # fractured a cur- © 
rent 64-bit read cycle in clock four. If there is a 
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Figure 10-48. Snoop under BOFF # Overlaying a Pseudo-Locked Cycle 
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11.0 TESTABILITY 


Testing in the Intel486 processor can be divided into 

two categories: Built-in Self Test (BIST) and external 
~ testing. The BIST tests the non-random logic, con- 
trol ROM (CROM), translation lookaside buffer (TLB) 
and on-chip cache memory. External tests can be 
run on the TLB and the on-chip cache. The Intel486 
processor also has a test mode in which all outputs 
are tri-stated. 


11.1 Built-in Self Test (BIST) 


The BIST is initiated by holding the AHOLD (address 
hold) HIGH for 1 CLK after RESET goes from HIGH 
to LOW, as shown in Figure 9.6. No bus cycles will 
be run by the Intel486 processor until the BIST is 
concluded. Note that for the Intel486 processor, the 
RESET must be active for 15 clocks with or without 
BIST enabled for warm resets. SRESET should not 
be driven active (i.e., high) when entering or during 
BIST. See Table 11-1 for approximate clocks and 
maximum completion times for different Intel486 
processors. 


The results of BIST is stored in the EAX register. 
The Intel486 processor has successfully passed the 
BIST if the contents of the EAX register are zero. If 
the results in EAX are not zero, then the BIST has 
detected a flaw in the Intel486 processor. The In- 
tel486 processor performs reset and begins normal 
operation at the completion of the BIST. 


The non-random logic, control ROM, on-chip cache 
and translation lookaside buffer (TLB) are tested 
during the BIST. 


The cache portion of the BIST verifies that the 
cache is functional and that it is possible to read and 
write to the cache. The BIST manipulates test regis- 

ters TR3, TR4 and TR5 while testing the cache. 
_ These test registers are described in section 11.2, 
“On-Chip Cache Testing.” 


Processor 
Type 
Intel486™ SX 


Intel486 DX 
IntelDX2™ 
IntelDX4™ 
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The cache testing algorithm writes a value to each 
cache entry, reads the value back, and checks that 
the correct value was read back. The algorithm may 
be repeated more than once for each of the 512 
cache entries using different constants. The In- 
telDX4 processor has 1024 cache entries. All other 
Intel486 processors have 512 cache entries. 


The TLB portion of the BIST verifies that the TLB is 
functional and that it is possible to read and write to 
the TLB. The BIST manipulates test registers TR6 
and TR7 while testing the TLB. TR6 and TR7 are ~ 
described in section 11.3.2, “TLB Test Registers 
TR6 and TR7.” 


11.2 On-Chip Cache Testing 


The on-chip cache testability hooks are designed to 
be accessible during the BIST and for assembly lan- 
guage testing of the cache. | : 


The Intel486 processor contains a cache fill buffer 
and a cache read buffer. For testability writes, data 
must be written to the cache fill buffer before it can 
be written to a location in the cache. Data must be » 
read from a cache location into the cache read buff-. 
er before the processor can access the data. The 
cache fill and cache read buffer are both 128 bits 
wide.. . 


11.2.1 CACHE TESTING REGISTERS TR3, TR4 
AND TR5 


Figure 11-1 shows the three cache testing registers: 
the Cache Data Test Register (TR3), the Cache 


Status Test Register (TR4) and the Cache Control _ 


Test Register (TR5). External access to these regis- 
ters is provided through MOV reg, TREG and MOV 
TREG, reg instructions. 


Table 11-1. Maximum BIST Completion Time 


Tenet [sean | oem] 
Freq. Clocks Completions 

__intetsxam =| S0MHz |G milion | 24 miliseconds 

[somHiz | 06min | —_24miliseconde 
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Entry 
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Figure 11-1. Cache Test Registers (All Intel486™ Processors Except the IntelDX4T Processor) 
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Figure 11-2. IntelDX4™ Processor Cache Test Registers 
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TR3 
Cache Data 
Test Register 


TR4 
Cache Status 
Test Register 


TR5 
Cache Control 
Test Register 
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TR3 
Cache Data 
Test Register 
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Test Register 
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Cache Data Test Register: TR3 


The cache fill buffer and the cache read buffer can 
only be accessed through TR3. Data to be written to 
the cache fill buffer must first be written to TR3. Data 
read from the cache read buffer must be loaded into 
— TRS. 


TR3 is 32 bits wide while the cache fill and read 
buffers are 128 bits wide. 32 bits of data must be 
written to TR3 four times to fill the cache fill buffer. 
32 bits of data must be read from TR3 four times to 
empty the cache read buffer. The entry select bits in 


TR5 determine which 32 bits of data TR3 will access 


in the buffers. 
Cache Status Test Register: TR4 


TR4 handles tag; LRU and valid bit information dur- 
_ ing cache tests. TR4 must be loaded with a tag and 
a valid bit before a write to the cache. After a read 
from a cache entry, TR4 contains the tag and valid 
bit from that entry, and the LRU bits and four valid 
bits from the accessed set. Note that the IntelDX4 
processor has one less bit in the TR4 TAG field. 
(See Figure 11-1.) 


Cache Control Test Register: TR5 


TR5 specifies which testability operation will be per- 
formed and the set and entry within the set which 
will be accessed. The set select field determines 
which will be accessed. Note that the IntelDX4 proc- 
essor has an 8-bit set select field and 256 sets. All 
_ other Intel486 processors have a /7-bit set select 
field and 128 sets. (See Figure 11-1.) 


The function of the two entry select bits depends on 
the state of the control bits. When the fill or read 
buffers are being accessed, the entry select bits 
point to the 32-bit location in the buffer being ac- 
cessed. When a cache location is specified, the en- 
try select bits point to one of the four entries in a set. 
(Refer to Table 11-2.) 


Five testability functions can be performed on the 
cache. The two control bits in TR5 specify the oper- 
ation to be executed. The five operations are: 


1. Write cache fill buffer 

2. Perform a cache testability write 
3. Perform a cache testability read 
4. Read the cache read buffer 

5. Perform a cache flush 
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Table 11-2 shows the encoding of the two control 
bits in TR5 for the cache testability functions. Table 


_ 11-2 also shows the functionality of the entry and set 


select bits for each control operation. 


_ The cache tests attempt to use as much of the nor- 


mal operating circuitry as possible. Therefore, when 
cache tests are being performed, the cache must be 


_ disabled (the CD and NW bits in control register 0 


(CRO) must be set to 1 to disable the cache. (See 
section 7.0, “On-Chip Cache.’’): 


11.2.2 CACHE TESTING REGISTERS ron THE 
INTELDX4 PROCESSOR 


The cache testing registers for the IntelDX4 proces- 
sor differ slightly from the other Intel486 processors. 
TR3 in the IntelDX4 processor is identical to other 
Intel486 processors. TR4 in the IntelDX4 processor 
uses bits 31 to 12 for the Tag field, and bit 11 is 
unused. TR5 uses bits 11 to 4 for the Set Select 
field. The Test Registers for the IntelDX4 processor 
are shown in Figure 11-2. 


NOTE: 
Software written for the Intel486 processor 
for testing the cache using the Test Register 
will produce failures due to the changes in 
the TAG bits and Set Select bits for the In- 
telDX4 processor. 


. Rewrite the code to take into account the 20 
TAG bits and 8 Set Select bits to address 
the larger cache. 


11.2.3 CACHE TESTABILITY WRITE 


A testability write to the cache is a two step process. 
First the cache fill buffer must be loaded with 128 
bits of data and TR4 loaded with the tag and valid 
bit. Next the contents of the fill buffer are written to a. 
cache location. 


Loading the fill buffer is accomplished by first writing 
to the entry select bits in TR5 and setting the control 
bits in TR5 to 00. The entry select bits identify one of 


- four 32-bit locations in the cache fill buffer to put 32 


bits of data. Following the write to TR5, TR3 is writ- 
ten with 32 bits of data which are immediately 
placed in the cache fill buffer. Writing to TR3 initiates 
the write to the cache fill buffer. The cache fill buffer 
is loaded with 128 bits of data by writing to TR5 and 
TR3 four times using a different entry select location 
each time. 
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Tabie 11-2. Cache Control Bit Encoding and Effect of Control Bits 
on Entry Select and Set Select Functionality 


Entry Select Bits Function Set Select Bits 
Enable: Fill Buffer Write Select 32-bit location in 
Read Buffer Read fill/read buffer 


TR4 must be loaded with the tag and valid bit (bit 10 
in TR4) before the contents of the fill buffer are writ- 
ten to a cache location. The IntelDX4 processor 
has a 20-bit tag in TR4. All other Intel486 proces- 
sors use a 21-bit tag in TR4. 


The contents of the cache fill buffer are written to a 
cache location by writing TR5 with a control field of 
01 along with the set select and entry select fields. 
The set select and entry select field indicate the lo- 
cation in the cache to be written. The normal cache 
LRU update circuitry updates the internal LRU bits 
for the selected set. 


Note that a cache testability write can only be done 
when the cache is disabled for replaces (the CD bit 
is control register 0 is reset to 1). Care must be tak- 
en when directly writing to entries in the cache. If the 
entry is set to overlap an area of memory that is 
being used in external memory, that cache entry 
could inadvertently be used instead of the external 
memory. This is exactly the type of operation that 
one would desire if the cache were to be used as a 
high speed RAM. Also, a memory reference (or any 
external bus cycle) should not occur in between the 
move to TR4 and the move to TR8, in order to avoid 
having the value in TR4 change due to the memory 
reference. 


11.2.4 CACHE TESTABILITY READ 


A cache testability read is a two step process. First 
the contents of the cache location are read into the 
cache read buffer. Next the data is examined by 
reading it out of the read buffer. 


Reading the contents of a cache location into the 
cache read buffer is initiated by writing TR5 with the 
control bits set to 10 and the desired set select and 


Perform Cache Write Select an entry in set Select a set to write to 
Le Perform Cache Read Select an entry in set Select a set to read from 


two-bit entry select. The IntelDX4 processor has a 
seven-bit select field. All other Intel486 proces- 
sors have an eight-bit select field. In response to 
the write to TR5, TR4 is loaded with the 21-bit tag 
field and the single valid bit from the cache entry 
read. TR4 is also loaded with the three LRU bits and 
four valid bits corresponding to the cache set that 
was accessed. The cache read buffer is filled with 
the 128-bit value which was found in the data array 
at the specified location. 


The contents of the read buffer are examined by 
performing four reads of TR3. Before reading TR3 
the entry select bits in TR5 must loaded to indicate 
which of the four 32-bit words in the read buffer to 
transfer into TR3 and the control bits in TR5 must be 
loaded with 00. The register read of TR3 will initiate 
the transfer of the 32-bit value from the read buffer 
to the specified general purpose register. 


Note that it is very important that the entire 128-bit 
quantity from the read buffer and also the informa- 
tion from TR4 be read before any memory refer- 
ences are allowed to occur. If memory operations 
are allowed to happen, the contents of the read buff- 
er will be corrupted. This is because the testability 
operations use hardware that is used in normal 
memory accesses for the Intel486 processor wheth- 
er the cache is enabled or not. 


11.2.5 FLUSH CACHE 


The control bits in TR5 must be written with 11 to 
flush the cache. None of the other bits in TR5 have 
any meaning when 11 is written to the control bits. 
Flushing the cache will reset the LRU bits and the 
valid bits to 0, but will not change the cache tag or 
data arrays. 
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When the cache is flushed by writing to TR5 the 
special bus cycle indicating a cache flush to the ex- 
ternal system is not run. (See section 10.2.11, ‘‘Spe- 
cial Bus Cycles.”) For normal operation, the cache 
should be flushed with the instruction INVD (Invali- 
date Data Cache) instruction or the WBINVD (Write- 
back and Invalidate Data Cache) instruction. 


11.2.6 ADDITIONAL CACHE TESTING 
FEATURES FOR ENHANCED BUS 
(WRITE-BACK) MODE 


When in Enhanced Bus (write-back) mode, the 
Write-Back Enhanced IntelDX2 cache testing is a 
superset of the Standard Bus (write-through) mode. 
The additional cache testing features for the Write- 
Back Enhanced IntelDX2 processor are summarized 
below: 


There are two state bits per cache line (VH and VL) 
instead of one (V). The assignment of VH and VL 
state bits is listed in the Table 11-3. 


Table 11-3. State Bit Assignments for the 
Write-Back Enhanced IntelDX2\ Processor 
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When the Write-Back Enhanced IntelDX2 processor 
is in standard mode, the VH state assignments are 


identical to the V state assignments of the IntelDX2 
processor, which only support S and | states. 


~TR@ is the same as described above for both Stan- 
dard and Enhanced Bus modes. | 
TR4 is the same as described above for the 
IntelDX2 processor in Standard Mode. However, in 
Enhanced Bus mode, the cache line state bits of all 
four lines of the set are no longer available, to avoid 
a conflicting definition of state bits for the selected 
entry. The entry’s state bits are moved to positions 0 
and 1. Bit 10 is reserved for the possible extension 
of the tag. The changes to TR4 for Enhanced Bus 
mode are shown in Figure 11-3. 


_ TR5 is the same as it is for the IntelDX2 processor in 


standard mode. In Enhanced Bus mode, control bit . 
TR5.SLF (bit 13) is added to allow 1,1 of TR5.CTL 
(bits O and 1) to perform two different kinds of cache 
flushes. When SLF=0, CTL=1,1 performs a single- 
clock invalidate of all lines in the cache, which will 
not write back M-state lines. In the M state, if 
SLF=1, the specific line addressed will be written 
back and invalidated. The state of SLF is significant 
only when CTL=1,1. The changes to TR5 for En- 
hanced Bus mode are shown in Figure 11-4. 
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Figure 11-3. TR4 Definition for Standard and Enhanced Bus Modes for the 
Write-Back Enhanced IntelDX2™ Processor 
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Figure 11-4. TR5 Definition for Standard and Enhanced Bus Modes for the 
Write-Back Enhanced IntelDX2™ Processor 


11.3 Translation Lookaside Buffer 
(TLB) Testing 


The Intel486 processor TLB testability hooks are 
similar to those in the Intel386 processor. The testa- 
bility hooks have been enhanced to provide added 
- test features and to include new features in the 
Intel486 processor. The TLB testability hooks are 
designed to be accessible during the BIST and for 
assembly language testing of the TLB. 


11.3.1 TRANSLATION LOOKASIDE BUFFER 
ORGANIZATION 


The Intel486 processor TLB is 4-way set associative 
and has space for 32 entries. The TLB is logically 
split into three blocks shown in Figure 11-5. 


The data block is physically split into four arrays, 
each with space for eight entries. An entry in the 
data block is 22 bits wide containing a 20-bit physi- 
cal address and two bits for the page attributes. The 
page attributes are the PCD (page cache disable) bit 
and the PWT (page write-through) bit. Refer to sec- 
tion 7.6, ‘‘Page Cacheability,” for a discussion of the 
PCD and PWT bits. 


The tag block is also split into four arrays, one for 
each of the data arrays. A tag entry is 21 bits wide 
containing a 17-bit linear address and four protec- 
tion bits. The protection bits are valid (V), user/su- 
pervisor (U/S), read/write (R/W) and dirty (D). 


The third block contains eight three bit quantities 
used in the pseudo least recently used (LRU) re- 
placement algorithm. These bits are called the LRU 
bits. Unlike the on-chip cache, the TLB will replace a 
valid line even when there is an invalid line in a set. 


2-233 


Intel486™ PROCESSOR FAMILY 


Page 
Protection 
_ Bits 


Physical Page 
Address Attributes 


20 Bits 8 Entries 


Entries 


242202-D7 


Figure 11-5. TLB Organization 


11.3.2 TLB TEST REGISTERS TR6 AND TR7 


The two TLB test registers are shown in Figure 11-6. 
TR6 is the command test register and TR7 is the 
data test register. External access to these registers 
is provided through MOV reg, TREG and MOV 
TREG,reg instructions. 


Command Test Register: TR6 
TR6 contains the tag information and control infor- 
mation used in a TLB test. Loading TR6 with tag and 


control information initiates a TLB write or lookup 
test. | 
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TR6 contains three bit fields, a 20-bit linear address 
(bits 12-31), seven bits for the TLB tag protection 
bits (bits 5-11) and one bit (bit 0) to define the type 
of operation to be performed on the TLB. 


The 20-bit linear address forms the tag information 
used in the TLB access. The lower three bits of the 
linear address select which of the eight sets are ac- 
cessed. The upper 17 bits of the linear address form 
the tag stored in the tag array. 
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Figure 11-6. TLB Test Registers 


The seven TLB tag protection bits are described be- miss or hit during a TLB lookup operation. The 
low. forced miss or hit will occur regardless of the state 
V: The valid bit for this TLB entry of the actual bit in the TLB. The meaning of these 


airs of bits is given in Table 11-4. 
D,D#: The dirty bit for/from the TLB entry P g 
U,U#: |The user/supervisor bit for/from the TLB The operation bit in TR6 determines if the TLB test 
entry operation will be a write or a lookup. The function of 
W,W#: The read/write bit for/from the TLB entry the operation bit is given in Table 11-5. 


Two bits are used to represent the D, U/S and R/W 
bits in the TLB tag to permit the option of a forced 


Table 11-4. Meaning of a Pair of TR6 Protection Bits 


TR6 Protection Bit TRS Protection Bit # Meaning on Meaning on 
= (B = TLB Write Operation TLB Lookup Operation 


Undefined Miss any TLB TAG Bit B 
Write 0 to TLB TAG Bit B Match TLB TAG Bit B if 0 
Write 1 to TLB TAG Bit B Match TLB TAG Bit B if 1 
Undefined Match any TLB TAG Bit B 
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Table 11-5. TR6 Operation Bit Encoding | 


TLB Write 
TLB Lookup 


Data Test Register: TR7 


TR7 contains the information stored or read from the 
data block during a TLB test operation. Before a TLB 
test write, TR7 contains the physical address and 
the page attribute bits to be stored in the entry. After 
a TLB test lookup hit, TR7 contains the physical ad- 
dress, page attributes, LRU bits and entry location 
from the access. 


TR7 contains a 20-bit physical address (bits 12-31), 
PLD bit (bit 11), PWT bit (bit 10), and three bits for 
the LRU bits (bits 7-9). The LRU bits in TR7 are only 
used during a TLB lookup test. The functionality of 
TR7 bit 4 differs for TLB writes and lookups. The 
encoding of bit 4 is defined in Table 11-6 and Table 
11-7. Finally, TR7 contains two bits (bits 2-3) to 
specify a TLB replacement pointer or the location of 
a TLB hit. 


Table 11-6. Encoding of Bit 4 of TR7 on Writes 


Replacement Pointer Used on TLB 
Bit 4 Write 


Pseudo-LRU Replacement Pointer 
Data Test Register Bits 3:2 


A replacement pointer is used during a TLB write. 
The pointer indicates which of the four entries in an 
accessed set is to be written. The replacement 
pointer can be specified to be the internal LRU bits 
or bits 2-3 in TR7. The source of the replacement 
pointer is specified by TR7 bit 4. The encoding of bit 
4 during a write is given by Table 11-6. 


Note that both testability writes and lookups affect 
the state of the internal LRU bits regardless of the 
replacement pointer used. All TLB write operations 
(testability or normal operation) cause the written 
entry to become the most recently used. For exam- 
ple, during a testability write with the replacement 
pointer specified by TR7 bits 2-3, the indicated en- 
try is written and that entry becomes the most re- 
cently used as specified by the internal LRU bits. 


There are two TLB testing operations: write entries 
into the TLB, and perform TLB lookups. One major 
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enhancement over TLB testing in the Intel386 proc- 


essor is that paging need not be disabled while exe- 
cuting testability writes or lookups. 


Note that any time one TLB set contains the same 
linear address in more than one of its entries, look- 
ing up that linear address will give unpredictable re- 
sults. Therefore a single linear address should not 
be written to one TLB set more than once. 


Table 11-7. Encoding of Bit 4 of TR7 on Lookups 


is : Meaning after TLB Lookup Operation 
TLB Lookup Resulted in a Miss 
1 


TLB Lookup Resulted in a Hit 
11.3.3 TLB WRITE TEST 


To perform a TLB write TR7 must be loaded fol- 
lowed by a TR6 load. The register operations must 
be performed in this order because the TLB opera- 
tion is triggered by the write to TR6. 


TR7 is loaded with a 20-bit physical address and 
values for PCD and PWT to _be written to the data 
portion of the TLB. In addition, bit 4 of TR7 must be 
loaded to indicate whether to use TR7 bits 3-2 or the 
internal LRU bits as the replacement pointer on the 
TLB write operation. Note that the LRU bits in TR7 
are not used in a write test. 


TR6 must be written to initiate the TLB write opera- 
tion. Bit O in TR6 must be reset to zero to indicate a 
TLB write. The 20-bit linear address and the seven 
page protection bits must also be written in TR6 to 
specify the tag portion of the TLB entry. Note that 
the three least significant bits of the linear address 
specify which of the eight sets in the data block will 
be loaded with the physical address data. Thus only 
17 of the linear address bits are stored in the tag. 
array. 3 


11.3.4 TLB LOOKUP TEST 


To perform a TLB lookup it is only necessary to write. 
the proper tags and control information into TR6. Bit 
0 in TR6 must be set to 1 to indicate a TLB lookup. 
TR6 must be loaded with a 20-bit linear address and 
the seven protection bits. To force misses and 
matches of the individual protection bits on TLB 
lookups, set the seven protection bits as specified in 
Table 11-4. 


intel. 


A TLB lookup operation is initiated by the write to 
TR6. TR7 will indicate the result of the lookup opera- 
tion following the write to TR6. The hit/miss indica- 
tion can be found in TR7 bit 4 (see Table 11-7). 


TR7 will contain the following information if bit 4 indi- 
cated that the lookup test resulted in a hit. Bits 2-3 
will indicate in which set the match occurred. The 22 
most significant bits in TR7 will contain the physical 
address and page attributes contained in the entry. 
Bits 9-7 will contain the LRU bits associated with 
the accessed set. The state of the LRU bits is previ- 
ous to their being updated for the current lookup. 


If bit 4 in TR7 indicated that the lookup test resulted 
in a miss the remaining bits in TR7 are undefined. 


Again it should be noted that a TLB testability lookup 
operation affects the state of the LRU bits. The LRU 
bits will be updated if a hit occurred. The entry which 
was hit will become the most recently used. 


11.4 Tri-State Output Test Mode 


The Intel486 processor provides the ability to float 
all its outputs and bidirectional pins, except for the 
VOLDET pin in the IntelDX4 processor. This in- 
cludes all pins floated during bus hold as well as 
pins which are never floated in normal operation of 
the chip (HLDA, BREQ, FERR# and PCHK#). 
When the Intel486 processor is in the tri-state output 
test mode external testing can be used to test board 
connections. 


The tri-state test mode is invoked if FLUSH# is 
sampled active at the falling edge of RESET. 
FLUSH # is an asynchronous signal. When driven, 
FLUSH # should be asserted for 2 clocks before and 
2 clocks after RESET is de-asserted. If FLUSH# is 
driven synchronously, the tri-state output test mode 
is initiated by driving FLUSH# so that it is sampled 
active in the clock prior to RESET going low and 
ensuring that specified setup and hold times are 
met. The outputs are guaranteed to tri-state no later 
than 10 clocks after RESET goes low (see Figure 
9.6). The Intel486 processor remains in the tri-state 
test mode until the next RESET. 


11.5 Intel486 Processor Boundary 
Scan (JTAG) 


The Intel486 processor provides additional testabili- 
ty features compatible with the IEEE Standard Test 
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Access Port and Boundary Scan Architecture (IEEE 
Std. 1149.1). (Note that the Intel486 SX processor in 
PGA package does not have JTAG capability.) The 


~ test logic provided allows for testing to insure that 


components function correctly, that interconnec- 
tions between various components are correct, and 
that various components interact eomecty: on the 
printed circuit board. 


The boundary scan test logic consists of a boundary 
scan register and support logic that are accessed 
through a test access port (TAP). The TAP provides 
a simple serial interface that makes it possible to 
test all signal traces with only a few probes. 


The TAP can be controlled via a bus master. The 
bus master can be either automatic test equipment 
or a component (PLD) that interfaces to the four-pin 
test bus. 


11.5.1 BOUNDARY SCAN ARCHITECTURE 


The boundary scan test logic contains the following | 
elements: 


e Test access port (TAP), consisting of input pins 
TMS, TCK, and TDI; and output pin TDO. 


e TAP controller, which interprets the inputs on the 
test mode select (TMS) line and performs the 
corresponding operation. The operations per- 
formed by the TAP include controlling the instruc- 
tion and data registers within the component. 


e Instruction register (IR), which accepts instruction 
codes shifted into the test logic on the test data 
input (TDI) pin. The instruction codes are used to 
select the specific test operation to be performed 
or the test data register to be accessed. 


e Test data registers: The Intel486 processor con- 
tains three test data registers: Bypass register 
(BPR), Device Identification register (DID), and 
Boundary Scan register (BSR). 


The instruction and test data registers are separate 
shift-register paths connected in parallel and have a 
common serial data input and a common serial data 
output connected to the TAP signals, TDI and TDO, 
respectively. 


11.5.2 DATA REGISTERS 


The Intel486 processor contains the two required 
test data registers; bypass register and boundary 
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scan register. In addition, they also have a device 
identification register. | 


Each test data register is serially connected to TDI 
and TDO, with TDI connected to the most significant 
bit and TDO connected to the least significant bit of 
the test data register. | 


Data is shifted one stage (bit position within the reg- 
ister) on each rising edge of the test clock (TCK). In 
addition the Intel486 processor contains a runbist 
register to support the RUNBIST boundary scan in- 
struction. | 


11.5.2.1 Bypass Register 


_ The Bypass Register is a one-bit shift register that 
~ provides the minimal length path between TDI and 
TDO. This path can be selected when no test opera- 
tion is being performed by the component to allow 
rapid movement of test data to and from other 


SYSTEM 
LOGIC 
INPUT 


LOGIC 


TCK 


_BOUNDARY SCAN REGISTER 


SYSTEM 
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components on the board. While the bypass register 


is selected data is transferred from TDI to TDO with- 


out inversion. 


11.5.2.2 Boundary Scan Register 


The Boundary Scan Register is a single shift register 
path containing the boundary scan cells that are 
connected to all input and output pins of the Intel486 
processor. Figure 11-7 shows the logical structure of 
the boundary scan register. While output cells deter- 
mine the value of the signal driven on the corre- 
sponding pin, input cells only capture data; they do 
not affect the normal operation of the device. Data is 
transferred without inversion from TDI to TDO 
through the boundary scan register during scanning. 
The boundary scan register can be operated by the 
EXTEST and SAMPLE instructions. The boundary | 
scan register order is described in section 11.5.5 
“Boundary Scan Register Bits and Bit Orders.” 


SYSTEM 
BIDIRECTIONAL 
PIN 


SYSTEM 
3-STATE 
OUTPUT 
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Figure 11-7. Logical Structure of Boundary Scan Register 


2-238 


intel. 


11.5.2.1 Device Identification Register 


The Device Identification Register contains the man- 
ufacturer’s identification code, part number code, 
and version code. Table 11-8 lists the codes corre- 
sponding to the Intel486 processor. 


11.5.2.2 Runbist Register 


The Runbist Register is a one bit register used to 
report the results of the Intel486 processor BIST 
when it is initiated by the RUNBIST instruction. This 
register is loaded with a “1” prior to invoking the 
BIST and is loaded with “0” upon successful com- 
pletion. 


11.5.3 INSTRUCTION REGISTER 


The Instruction Register (IR) allows instructions to 
be serially shifted into the device. The instruction 
selects the particular test to be performed, the test 
data register to be accessed, or both. The instruc- 
tion register is four (4) bits wide. The most significant 
bit is connected to TDI and the least significant bit is 
connected to TDO. There are no parity bits associat- 
ed with the Instruction register. Upon entering the 
Capture-IR TAP controller state, the Instruction reg- 
ister is loaded with the default instruction “0001,” 
SAMPLE/PRELOAD. Instructions are shifted into 
the instruction register on the rising edge of TCK 
while the TAP controller is in the SHIFT-IR state. 
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11.5.3.1 Boundary Scan Instruction Set 


The Intel486 processor supports all three mandatory 
boundary scan instructions (BYPASS, SAMPLE/ 
PRELOAD, and EXTEST) along with two optional in- 
structions (IDCODE and RUNBIST). Table 11-9 lists 
the Intel486 processor boundary scan instruction 
codes. The instructions listed as PRIVATE cause 
TDO to become enabled in the Shift-DR state and 
cause ‘0‘ to be shifted out of TDO on the rising edge 
of TCK. Execution of the PRIVATE instructions will 
not cause hazardous operation of the Intel486 proc- 
essor. 


EXTEST The instruction code is “0000.” The 
EXTEST instruction allows testing of cir- 
cuitry external to the component package, 
typically board interconnects. It does so by 
driving the values loaded into the Intel486 
processor's boundary scan register out on 
the output pins corresponding to each 
boundary scan cell and capturing the val- 
ues on Intel486 processor input pins to be 
loaded into their corresponding boundary 
scan register locations. |/O pins are se- 
lected as input or output, depending on 
the value loaded into their control setting 
locations in the boundary scan register. 
Values shifted into input latches in the 
boundary scan register are never used by 
the internal logic of the Intel486 proces- 
sor. 
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Table 11-8. Boundary Scan Component Identification Codes 


Processor 


1st| Boundary 
“Type Intel=009H | Bit | Scan ID (Hex) 


Intel4867™ SX 000001 0100 | 00010 | waa tok _ 
processor (3.3V) 


Intel486 SX 000001 0100 | 00010 aad lead 
processor 
(3.3V, 2X CLK) 

Intel486 SX 000001 0100 | 00010 iiiaaiaien lad 
- | processor (5V) 


Intel486 SX 000001 0100 ; 00010 | 00000001 mt x028201 ae 

processor 

(5V, 2X CLK) 

_ | IntelSX2T™ eo 0100 | 00101 | 00000001 i lowe 3H 
_ | processor 


Intel486 DX XXXxX * 000001 0100 stab Seimei hentia 
processor (3.3V) 


Intel486 DX 000001 0100 | 00001 00000001001 x8281013H 
processor 
(3. 3V, 2X CLK) 
_|Intel486 DX 000001 0100 | 00001 | 00000001001 iacmiad 
processor (5V) 
| Intel486 DX | 000001 0100 | 00001 | 00000001001 fnssiaal 3H © 
processor | | 
(5V, 2X CLK) ' 
IntelDX2™ 1 000001 0100 | 00101 | 00000001001; 1 x8285013H 
processor (3.3V) ca 
| IntelDX2 saa ade 000001 0100 | 00101 | 00000001001 x0285013H 
(5V) 


Write-Back | 000001 0100 |; 00111 | 00000001001 x8287013H 
Enhanced IntelDX2 

processor (3.3V) 

Write-Back 000001 0100 |; 00111 | 00000001001 x0287013H 
Enhanced IntelDX2 

processor (5V) 

IntelDX4T™ XXXX* 000001 0100 ; 01000 | 00000001001 ieseuailt 
processor (3.3V) 


NOTE: 
*Contact Intel for details 
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Table 11-9. Boundary Scan Instruction Codes 


NOTE: 
After using the EXTEST instruction, the In- 
tel486 processor must be reset before nor- 
mal (non-boundary scan) use. . 


SAMPLE/ The instruction code is “0001.” The 
PRELOAD SAMPLE/PRELOAD has two functions 
that it performs. When the TAP control- 
ler is in the Capture-DR state, the 
SAMPLE/PRELOAD instruction allows 
a “‘snap-shot” of the normal operation 
of the component without interfering 
with that normal operation. The instruc- 
tion causes boundary scan register 
cells associated with outputs to sample 
the value being driven by the Intel486 
processor. it causes the cells associat- 
ed with inputs to sample the value be- 
ing driven into the Intel486 processor. 
On both outputs and inputs the sam- 
pling occurs on the rising edge of TCK. 
When the TAP controller is in the Up- 
date-DR state, the SAMPLE/PRELOAD 
instruction preloads data to the device 


IDODE 


BYPASS 


RUNBIST 


_ Intel486T™™ PROCESSOR FAMILY 


pins to be driven to the board by exe- 
cuting the EXTEST instruction. Data is 
preloaded to the pins from the bounda- 
ry scan register on the falling edge of 
TCK. 


The instruction code is “0010.” The 
IDCODE instruction selects the device 
identification register to be connected 
to TDI and TDO, allowing the device 
identification code to be shifted out of 
the device on TDO. Note that the de- 
vice identification register is not altered 
by data being shifted in on TDI. 


The instruction code is “1111.” The 
BYPASS instruction selects the bypass 
register to be connected to TDI and 
TDO, effectively bypassing the test log- 
ic on the Intel486 processor by reduc- 
ing the shift length of the device to one 
bit. Note that an open circuit fault in the 
board level test data path will cause the 
bypass register to be selected following 
an instruction scan cycle due to the 
pull-up resistor on the TDI input. This 
has been done to prevent any unwant- 
ed interference with the proper opera- 
tion of the system logic. 


The instruction code is “1000.” The 
RUNBIST instruction selects the one 
(1) bit runbist register, loads a value of 
4” into the runbist register, and con- 
nects it to TDO. It also initiates the built- 
in self test (BIST) feature of the 
Intel486 processor, which is able to de- 
tect approximately 60% of the stuck-at 
faults on the Intel486 processor. The 
Intel486 processor ac/dc specifications 
for Vcc and CLK must be met and 
RESET must have been asserted at 
least once prior to executing the 
RUNBIST boundary scan instruction. 
After loading the RUNBIST instruction 
code in the instruction register, the TAP 
controller must be placed in the Run- 
Test/Idle state. BIST begins on the first 
rising edge of TCK after entering the 
Run-Test/Idle state. The TAP controller 
must remain in the Run-Test/Idle state 
until BIST is completed. It requires 1.2 
million clock (CLK) cycles to complete 
BIST and report the result to the runbist 
register. After completing the 1.2 million 
clock (CLK) cycles, the value in the run- 
bist register should be shifted out on 
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TDO during the Shift-DR state. A value 
of “0” being shifted out on TDO indi- 
cates BIST successfully completed. A 
value of “1” indicates a failure oc- 
curred. After executing the RUNBIST 
instruction, the Intel486 processor must 
be reset prior to normal operation. 


11.5.4 TEST ACCESS PORT (TAP) 
CONTROLLER 


The TAP controller is a synchronous, finite state ma- 
chine. It controls the sequence of operations of 


cx Test—Logic—Reset 
Co ( Run-Test-Idle 


Update~DR 


Select-DR-Scan 


Capture-DR 


intel. 


the test logic. The TAP controller changes state only 


-_ in response to the following events: 


1. arising edge of TCK 
2. power-up. 


The value of the test mode state (TMS) input signal 
at a rising edge of TCK controls the sequence of the 
state changes. The state diagram for the TAP con- 
troller is shown in Figure 11-8. Test designers must 
consider the operation of the state machine in order 
to design the correct sequence of values to drive on 
TMS. 


0) 
0 


; 
Update-IR 


ee [: aa 


Figure 11-8. TAP Controller State Diagram 
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11.5.4.1 Test-Logic-Reset State 


In this state, the test logic is disabled so that normal 
operation of the device can continue unhindered. 
This is achieved by initializing the instruction register 
such that the IDCODE instruction is loaded. No mat- 
ter what the original state of the controller, the con- 
troller enters Test-Logic-Reset state when the TMS 
input is held high (1) for at least five rising edges of 
TCK. The controller remains in this state while TMS 
is high. The TAP controller is also forced to enter 
this state at power-up. 


11.5.4.2 Run-Test/Idie State 


A controller state between scan operations. Once 
in this state, the controller remains in this state as 
long as TMS is held low. In devices supporting the 
RUNBIST instruction, the BIST is performed during 
this state and the result is reported in the runbist 
register. For instruction not causing functions to exe- 
cute during this state, no activity occurs in the test 
logic. The instruction register and all test data regis- 
ters retain their previous state. When TMS is high 
and a rising edge is applied to TCK, the controller 
moves to the Select-DR state. 


11.5.4.3 Select-DR-Scan State 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-DR state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held high and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 


The instruction does not change in this state. 


11.5.4.4 Capture-DR State 


In this state, the boundary scan register captures 
input pin data if the current instruction is EXTEST or 
SAMPLE/PRELOAD. The other test data registers, 
which do not have paraiiel input, are not changed. 


The instruction does not change in this state. 


When the TAP controlier is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exiti-DR state if TMS is high or the Shift-DR state if 
TMS is low. 
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11.5.4.5 Shift-DR State 


In this controller state, the test data register con- 
nected between TDI and TDO as a result of the cur- 
rent instruction shifts data one stage toward its serial 
output on each rising edge of TCK. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or remains in the Shift- 
DR state if TMS is low. 


11.5.4.6 Exit1-DR State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low anda 
rising edge is applied to TCK, the controller enters 
the Pause-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


11.5.4.7 Pause-DR State . 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the test data 
register in the serial path between TDI and TDO. An 
example of using this state could be to allow a tester 
to reload its pin memory from disk during application 
of a long test sequence. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-DR 
state. 


11.5.4.8 Exit2-DR State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low anda 
rising edge is applied to TCK, the controller enters 
the Shift-DR state. | 
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The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


11.5.4.9 Update-DR State 


The boundary scan register is provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in response to 
the EXTEST and SAMPLE/PRELOAD instructions. 
When the TAP controller is in this state and the 
boundary scan register is selected, data is latched 
onto the parallel output of this register from the shift- 
register path on the falling edge of TCK. The data 
held at the latched parallel output does not change 
other than in this state. 


All test data registers selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


11.5.4.10 Select-IR-Scan State 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous value. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-IR state, and a scan se- 
quence for the instruction register is initiated. If TMS 
is held high and a rising edge is applied to TCK, the 
controller moves to the Test-Logic-Reset state. 


’ The instruction does not change in this state. 


11.5.4.11 Capture-IR State 


In this controller state the shift register contained in 
_ the instruction register loads the fixed value “0001” 
on the rising edge of TCK. 


The test data register selected by the current in- 
struction retains its. previous value during this state. 
The instruction does not change in this state. When 
the controller is in this state and a rising edge is 
applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or the Shift-IR state if TMS 
is held low. 


11.5.4.12 Shift-IR State 


In this state the shift register contained in the in- 
struction register is connected between TDI and 
TDO and shifts data one stage towards its serial out- 
put on each rising edge of TCK. 
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The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or remains in the Shift-IR 
state if TMS is held low. 


11.5.4.13 Exit1-IR State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low anda 
rising edge is applied to TCK, the controller enters 
the Pause-IR state. 


The test data register selected by the current in-. 
struction retains its previous value during this state. 
The instruction does not change in this state. 


11.5.4.14 Pause-IR State 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the instruction 
register. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


11.5.4.15 Exit2-IR State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low anda 
rising edge is applied to TCK, the controller enters 
the Shift-IR state. 


The test data register selected by the current in- 


- Struction retains its previous value during this state. 


The instruction does not change in this state. 
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11.5.4.16 Update-iIR State 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the new current in- 
struction retain the previous value. 


11.5.5 BOUNDARY SCAN REGISTER BITS AND 
BIT ORDERS 


The boundary scan register contains a cell for each 
pin, as well as cells for control of I/O and tri-state 
pins. 


Intel486 SX and IntelSX2 Processor Boundary 
Scan Register Bits 


The following is the bit order of the Intel486 SX and 
IntelSX2 processor boundary scan register (from left 
to right and top to bottom. See notes below): 


TDO <— A2, A3, A4, A5, UP#, A6, A7, A8, AQ, 
A10, A11, A12, A13, A14, A15, A16, A17, A18, 
A19, A20, A21, A22, A23, A24, A25, A26, A27, 
A28, A29, A380, A31, DPO, DO, D1, D2, D3, D4, 
D5, D6, D7, DP1, D8, D9, D10, D11, D12, D13, 
D14, D15, DP2, Di6, D17, D18, D19, D20, 
D21, D22, D23, DP3, D24, D25, D26, D27, 
D28, D29, D30, D381, STPCLK#, Reserved, 
Reserved, SMI#, SMIACT#, SRESET, NMI, 
INTR, FLUSH#, RESET, A20M#, EADS#, 
PCD, PWT, D/C#, M/IO#, BES#, BE2#, 
BE1#, BEO#, BREQ, W/R#, HLDA, CLK, Re- 
served, AHOLD, HOLD, KEN #, RDY#, BS8#, 
BS16#, BOFF#, BRDY#, PCHK#, LOCK#, 
PLOCK#¥, BLAST#, ADS#, MISCCTL, 
BUSCTL, ABUSCTL, WRTL < TDI 
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Intel486 DX and IntelDX2 Processor Boundary 
Scan Register Bits 


The following is the bit order of the Intel486 DX and 
IntelDX2 processor boundary scan register (from left 
to right and top to bottom. See notes below): 


TDO < A2, A3, A4, A5, UP#, A6, A7, A8, AQ, 
A10, A11, A12, A13, A14, A15, A16, A17, A18, 
A19, A20, A21, A22, A23, A24, A25, A26, A27, 
A28, A29, A30, A31, DPO, DO, D1, D2, D3, D4, 
D5, D6, D7, DP1, D8, D9, D10, 011, D12, D13, 
D14, D15, DP2, D16, D17, D18, D19, D20, 
D21, D22, D23, DP3, D24, D25, D26, D27, 
D28, D29, D380, D31, STPCLK#, IGNNE#, 
FERR#, SMI#, SMIACT#, SRESET, NMI, 
INTR, FLUSH#, RESET, A20M#, EADS#, 
PCD, PWT, D/C#, M/lIO#, BE3#, BE2#, 
BE1#, BEO#, BREQ, W/R#, HLDA, CLK, Re- 
served, AHOLD, HOLD, KEN#, RDY#, BS8#, 
BS16#, BOFF#, BRDY#, PCHK#, LOCK#, 
PLOCK#, BLAST#, ADS#, MISCCTL, 
BUSCTL, ABUSCTL, WRTL <— TDI 


50-MHz Intel486 DX Processor Boundary Scan 
Register Bits 


The following is the bit order of the 50-MHz Intel486 
DX processor boundary scan register (from left to 
right and top to bottom. See notes below): 


TDO < A2, A3, A4, A5, UP#, A6, A7, A8, AQ, 
A10, Ai1, A12, A13, A14, A15, A16, A17, A18, 
A19, A20, A21, A22, A23, A24, A25, A26, A27, 
A28, A29, A380, A31, DPO, DO, Di, D2, D3, D4, 
D5, D6, D7, DP1, D8, D9, D10, D11, D12, D13, 
D14, D15, DP2, D16, D17, D18, D19, D20, 
D21, D22, D23, DP3, D24, D25, D26, D27, 
D28, D29, D380, D31, IGNNE#, FERR#, NMI, 
INTR, FLUSH#, RESET, A20M#, EADS#, 
PCD, PWT, D/C#, M/IO#, BES#, BE2#, 
BE1#, BEO#, BREQ, W/R#, HLDA, CLK, Re- 
served, AHOLD, HOLD, KEN#, RDY #, BS8#, 
BS16#, BOFF#, BRDY#, PCHK#, LOCK#, 
PLOCK#, BLAST#, ADS#, MISCCTL, 
BUSCTL, ABUSCTL, WRTL < TDI 
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Write-Back Enhanced intelIDX2 Processor 
Boundary Scan Register Bits 


The following is the bit order of the Write-Back En- 
hanced IntelDX2 processor boundary scan register 
(from left to right and top to bottom. See notes be- 
low): 


TDO <— A2, A3, A4, A5, UP#, A6, A7, A8, AQ, 
A10, A11, A12, A13, A14, A15, A16, A17, A18, 
A19, A20, A21, A22, A23, A24, A25, A26, A27, 
A28, A29, A30, A31, DPO, DO, D1, D2, D3, D4, 
D5, D6, D7, DP1, D8, D9, D10, D11, D12, D13, 
D14, D15, DP2, D16, D17, D18, D19, D20, 
D21, D22, D23, DP3, D24, D25, D26, D27, 
D28, D29, D30, D381, STPCLK#, IGNNE#, 
INV, CACHE#, FERR#, SMI#, WB/WT#, 
HITM#, SMIACT#, SRESET, NMI, INTR, 
FLUSH#, RESET, A20M#, EADS#, PCD, 
PWT, D/C#, M/lO#, BE3#, BE2#, BE1#, 
BEO#, BREQ, W/R#, HLDA, CLK, Reserved, 
AHOLD, HOLD, KEN#, RDY#, BS8#, 
BS164, BOFF#, BRDY#, PCHK#, LOCK#, 
PLOCK#, BLAST¥, ADS#, MISCCTL, 
BUSCTL, ABUSCTL, WRTL <— TDI 


IntelDX4 Processor Boundary Scan Register Bits 


The following is the bit order of the IntelDX4 proces- 
sor boundary scan register (from left to right and top 
to bottom. See notes below): 


TDO < A2, A3, A4, A5, UP#, A6, A7, A8, AQ, 
A10, A11, A12, A13, A14, A15, A16, A17, A18, 
A19, A20, A21, A22, A23, A24, A25, A26, A27, 
A28, A29, A30, A31, DPO, DO, D1, D2, D3, D4, 
D5, D6, D7, DP1, D8, D9, D10, D11, D12, D13, 
D1i4, D15, DP2, D16, D17, D18, D19, D20, 
D21, D22, D23, DP3, D24, D25, D26, D27, 
D28, D29, D380, D31, STPCLK#, IGNNE#, 
FERR#, SMI#, SMIACT#, SRESET, NMI, 
INTR, FLUSH#, RESET, A20M#, EADS#, 
PCD, PWT, D/C#, M/IO#, BES#, BE2#, 
BE1#, BEO#, BREQ, W/R#, HLDA, CLK, 
AHOLD, HOLD, KEN#, RDY#, CLKMUL, 
BS8#, BS16#, BOFF#, BRDY#, PCHK#, 


LOCK#, PLOCK#, BLAST#, ADS#, 
MISCCTL, BUSCTL, ABUSCTL, WRTL <— 
TDL 
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“Reserved” corresponds to no _ connect 
“NC” or “INC” signals on the Intel486 proc- 
essor. 


All the *CTL cells are control cells that are 
used to select the direction of bidirectional 
pins or tri-state output pins. If ‘1‘ is loaded 
into the control cell (*CTL), the associated 
pin(s) are tri-stated or selected as input. The 
following lists the control cells and their cor- 
responding pins. 


1. WRCTL controls the aos DO and DP3- 
DPO pins. 


2. ABUSCTL controls the A31—A2 pins. 


3. BUSCTL controls the ADS#, BLAST#, 

- PLOCK#, LOCK#, WR#, BEO#, BE14, 
BE2#, BE3#, MIO#, DC#, PWT, and 
PCD pins. 


4. MISCCTL controls the PCHK#, HLDA, and 
BREQ pins. 


11.5.6 TAP CONTROLLER INITIALIZATION 


The TAP controller is automatically initialized when a 
device is powered up. In addition, the TAP controller 
can be initialized by applying a high signal level on 
the TMS input for five TCK periods. 


11.5.7 BOUNDARY SCAN DESCRIPTION 
LANGUAGE (BSDL) FILES 


See Appendix D for an example ofa BSDL file for 
Intel486 processors. 
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12.0 DEBUGGING SUPPORT 


The Intel486 processor provides several features 
that simplify the debugging process. The three cate- 
gories of on-chip debugging aids are: 


1. Code execution breakpoint opcode (OCCH), 


2. Single-step capability provided by the TF bit in the 
flag register, and 


3. Code and data breakpoint capability provided by 
the Debug Registers DRO-3, DR6, and DR7. 


12.1 Breakpoint Instruction 


A single-byte-opcode breakpoint instruction is avail- 
able for use by software debuggers. The breakpoint 
opcode is OCCH, and generates an exception 3 trap 
when executed. In typical use, a debugger program 
can “plant” the breakpoint instruction at all desired 
code execution breakpoints. The single-byte break- 
point opcode is an alias for the two-byte general 
software interrupt instruction, INT n, where n=3. 
The only difference between INT 3 (OCCh) and INT n 
is that INT 3 is never IOPL-sensitive, while INT n is 
lOPL-sensitive in Protected Mode and Virtual 8086 
Mode. | 


12.2 Single-Step Trap 


If the single-step flag (TF, bit 8) in the EFLAG regis- 
ter is found to be set at the end of an instruction, a 
single-step exception occurs. The single-step ex- 
ception is auto vectored to exception number 1. Pre- 
cisely, exception 1 occurs as a trap after the instruc- 
tion following the instruction which set TF. In typical 
practice, a debugger sets the TF bit of a flag register 
image on the debugger’s stack. It then typically 
transfers control to the user program and loads the 
flag image with a signal instruction, the IRET instruc- 
tion. The single-step trap occurs after executing one 
instruction of the user program. 


Because exception 1 occurs as a trap (that is, it oc- 
curs after the instruction has already executed), the 
CS:EIP pushed onto the debugger’s stack points to 
the next unexecuted instruction of the program be- 
ing debugged. An exception 1 handler, merely by 
ending with an IRET instruction, can therefore effi- 
ciently support single-stepping through a user pro- 
gram. 
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12.3 Debug Registers 


| The Debug Registers are an advanced debugging 


feature of the Intel486 processor. They allow data 
access breakpoints as well as code execution 
breakpoints. Because the breakpoints are indicated 
by on-chip registers, an instruction execution break- 
point can be placed in ROM code or in code shared 
by several tasks, neither of which can be supported 
by the INT3 breakpoint opcode. 


The Intel486 processor contains six Debug Regis- 
ters, providing the ability to specify up to four distinct 
breakpoints addresses, breakpoint control options, 
and read breakpoint status. Initially after reset, 
breakpoints are in the disabled state. Therefore, no 
breakpoints will occur unless the debug registers are 
programmed. Breakpoints set up in the Debug Reg- 
isters are auto vectored to exception number 1. 


12.3.1 LINEAR ADDRESS BREAKPOINT 
REGISTERS (DRO-DR3) 


Up to four breakpoint addresses can be specified by 
writing into Debug Registers DRO—DR3, shown in 
Figure 12-1. The breakpoint addresses specified are 
32-bit linear addresses. Intel486 processor hard- 
ware continuously compares the linear breakpoint 
addresses in DRO-DR3 with the linear addresses 
generated by executing software (a linear address is 
the result of computing the effective address and 
adding the 32-bit segment base address). Note that 
if paging is not enabled the linear address equals the 
physical address. If paging is enabled, the linear ad- 
dress is translated to a physical 32-bit address by 
the on-chip paging unit. Regardiess of whether pag- 
ing is enabled or not, however, the breakpoint regis- 
ters hold linear addresses. 


12.3.2 DEBUG CONTROL REGISTER (DR7) 


A Debug Control Register, DR7 shown in Figure 
12-1, allows several debug control functions such as 
enabling the breakpoints and setting up other con- 
trol options for the breakpoints. The fields within the 
Debug Control Register, DR7, are as follows: 
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SERS EARNS 


indicates Intel Reserved; Do not define. 


NOTE: 
See section 4.2.7 


16 15 


' 


1615 


3 0 


DRO 
DR1 
DR2 
DR3 
~ — 1 DR4 


BRON 
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Figure 12-1. Debug Registers 


LENi (breakpoint length specification bits) 


A 2-bit LEN field exists for each of the four break- 
points. LEN specifies the length of the associated 
breakpoint field. The choices for data breakpoints 
are: 1 byte, 2 bytes, and 4 bytes. Instruction execu- 
tion breakpoints must have a length of 1 (LENi = 
00). Encoding of the LENi field is as described in 
Table 12-1. 


The LENi field controls the size of breakpoint field i 
by controlling whether all low-order linear address 
bits in the breakpoint address register are used to 
detect the breakpoint event. Therefore, all break- 
point fields are aligned; 2-byte breakpoint fields be- 
gin on Word boundaries, and 4-byte breakpoint 
fields begin on Dword boundaries. 


Figure 12-2 is an example of various size breakpoint 
fields. Assume the breakpoint linear address in DR2 
is OOOOOOO0SH. In that situation, the Figure 12-2 indi- 
cates the region of the breakpoint field for lengths of 
1, 2, or 4 bytes. 


- RWi (memory access qualifier bits) 
A 2-bit RW field exists for each of the four break- 
points. The 2-bit RW field specifies the type of usage 


which must occur in order to activate the associated 
breakpoint. 
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Table 12-1. LENi Encoding 


Usage of Least 
Significant Bits in 
Breakpoint Address 
Register i, (i= 0-3) 


All 32-bits used to - 
specify a single-byte 
breakpoint field. 


|A1-—A31 used to 
specify a two-byte, 
word-aligned 
breakpoint field. AO 
in Breakpoint 
Address Register is 
not used. 


10 |Undefined—do not 
use this encoding 


11 4 bytes A2-A31 used to 
specify a four-byte, 
dword-aligned 
breakpoint field. AO 
and A1 in Breakpoint 
Address Register are 
not used. 


Breakpoint 
Field Width 


DR2=00000005H; LEN2 = 00B 
31 : 0 


00000000H 


DR2=00000005H; LEN2 = 01B 


00000008H 
00000004H 


<— bkpt fid2 > 
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Figure 12-2. Size Breakpoint Fields 
Table 12-2. RW Encoding 


RW 
Usage Causing Breakpoint 


Instruction execution only 
Data writes only 

Undefined-do not use this encoding 
Data reads and writes only 


RW encoding 00 is used to set up an instruction 
execution breakpoint. RW encodings 01 or 11 are 
used to set up write-only or read/write data break- 
points. 
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Note that instruction execution breakpoints are 
taken as faults (j.e., before the instruction exe- 
cutes), but data breakpoints are taken as traps 
(i.e., after the data transfer takes place). 


Using LENi and RWi to Set Data Breakpoint j 


A data breakpoint can be set up by writing the linear 
address into DRi (i = O-3). For data breakpoints, 
RWi can = 01 (write-only) or 11 (write/read). LEN 
can = 00, 01, or 11. 


lf a data access entirely or partly falls within the data 
breakpoint field, the data breakpoint condition has 
occurred, and if the breakpoint is enabled, an excep- 
tion 1 trap will occur. 


Using LENi and RWi to Set Instruction Execution 
Breakpoint i 


An instruction execution breakpoint can be set up by 
writing address of the beginning of the instruction 
(including prefixes if any) into DRi (i = O-3). RWi 
must = 00 and LEN must = 00 for instruction exe- 
cution breakpoints. 


If the instruction beginning at the breakpoint address 
is about to be executed, the instruction execution 
breakpoint condition has occurred, and if the break- 
point is enabled, an exception 1 fault will occur be- 
fore the instruction is executed. 


Note that an instruction execution breakpoint ad- 
dress must be equal to the beginning byte address 
of an instruction (including prefixes) in order for the 
instruction execution breakpoint to occur. 


GD (Global Debug Register access detect) 


The Debug Registers can only be accessed in Real 
Mode or at privilege level 0 in Protected Mode. The 
GD bit, when set, provides extra protection against 
any Debug Register access even in Real Mode or at 
privilege level 0 in Protected Mode. This additional 
protection feature is provided to guarantee that a 
software debugger can have full control over the De- 
bug Register resources when required. The GD bit, 
when set, causes an exception 1 fault if an instruc- 
tion attempts to read or write any Debug Register. 
The GD bit is then automatically cleared when the 
exception 1 handler is invoked, allowing the excep- 
tion 1 handler free access to the debug registers. 
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GE and LE (Exact data breakpoint match, global 
and local) — 


The breakpoint mechanism of the Intel486 proces- 
sor differs from that of the Intel386 processor. The 
Intel486 processor always does exact data break- 
point matching, regardless of GE/LE bit settings. 
Any data breakpoint trap will be reported exactly af- 
ter completion of the instruction that caused the op- 
erand transfer. Exact reporting is provided by forcing 
‘the Intel486 processor execution unit to wait for 
completion of data operand transfers before begin- 
ning execution of the next instruction. 


When the Intel486 processor performs a_ task 
switch, the LE bit is cleared. Thus, the LE bit sup- 
ports fast task switching out of tasks, that have en- 
abled the exact data breakpoint match for their task- 
local breakpoints. The LE bit is cleared by the In- 
tel486 processor during a task switch, to avoid hav- 
ing exact data breakpoint match enabled in the new 
task. Note that exact data breakpoint match must be 
re-enabled under software control. | 


The Intel486 processor GE bit is unaffected during a 
task switch. The GE bit supports exact data break- 
point match that is to remain enabled during all tasks 
~ executing in the system. 


Note that instruction execution breakpoints are al- 


ways reported exactly. 


Gi and Li (breakpoint enable, global and local) 


If either Gi or Li is set then the associated breakpoint — 


(as defined by the linear address in DRi, the length 
in LENi and the usage criteria in RWi) is enabled. If 
either Gi or Li is set, and the Intel486 processor de- 
tects the ith breakpoint condition, then the exception 
1 handler is invoked. 


When the Intel486 processor performs a task switch 
to a new Task State Segment (TSS), all Li bits are 
cleared. Thus, the Li bits support fast task switching 
out of tasks that use some task-local breakpoint reg- 
isters. The Li bits are cleared by the Intel486 proces- 
sor during a task switch, to avoid spurious excep- 
tions in the new task. Note that the breakpoints must 
be re-enabled under software control. 
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All Intel486 processor Gi bits are unaffected during a 


task switch. The Gi bits support breakpoints that are 
active in all tasks executing in the system. 


12.3.3 DEBUG STATUS REGISTER (DR6) . 


A Debug Status Register, DR6 shown in Figure 12-1, 
allows the exception 1 handler to easily determine 
why it was invoked. Note the exception 1 handler 
can be invoked as a result of one of several events: 


. DRO Breakpoint fault/trap. . 
. DR1 Breakpoint fault/trap. 
XDR2 Breakpoint fault/trap. 
. XDR3 Breakpoint fault/trap. 
. XSingle-step (TF) trap. 

. XTask switch trap. 


. XFault due to attempted debug register access 
when GD= 1. 


NOOR OD —= 


The Debug Status Register contains single-bit flags 
for each of the possible events invoking exception 1. 
Note below that some of these events are faults (ex- 
ception taken before the instruction is executed), 
while other events are traps (exception taken after 
the debug events occurred). 


The flags in DR6 are set by the hardware but never 
cleared by hardware. Exception 1 handler software 
should clear DR6 before returning to the user pro- 
gram to avoid future confusion in identifying the 
source of exception 1. 


The fields within the Debug Status Register, DR6, 
are as follows: 


Bi (debug fault/trap due to breakpoint 0-3) 
Four breakpoint indicator flags, BO-B3, correspond 


one-to-one with the breakpoint registers in DRO- 
DR3. A flag Bi is set when the condition described 


by DRi, LENi, and RWi occurs. 


If Gi or Li is set, and if the ith breakpoint is detected, 
the Intel486 processor will invoke the exception 1. 
handler. The exception is handled as a fault if an 
instruction execution breakpoint occurred, or as a 
trap if a data breakpoint occurred. 


= | 
intal. 
IMPORTANT NOTE: 

A flag Bi is set whenever the hardware de- 
tects a match condition on enabled break- 
point i. Whenever a match is detected on at 
least one enabled breakpoint i, the hard- 
ware immediately sets all Bi bits correspond- 
ing to breakpoint conditions matching at that 
instant, whether enabled or not. Therefore, 
the exception 1 handler may see that multi- 
ple Bi bits are set, but only set Bi bits corre- 
sponding to enabled breakpoints (Li or Gi 
set) are true indications of why the excep- 
tion 1 handler was invoked. 


BD (debug fault due to attempted register ac- 
cess when GD bit set) 


This bit is set if the exception 1 handler was invoked 
due to an instruction attempting to read or write to 
the debug registers when GD bit was set. If such an 
event occurs, then the GD bit is automatically 
cleared when the exception 1 handler is invoked, 
allowing handler access to the debug registers. 


BS (debug trap due to single-step) 
This bit is set if the exception 1 handler was invoked 


due to the TF bit in the flag register being set (for 
single-stepping). 
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BT (debug trap due to task switch) 


This bit is set if the exception 1 handler was invoked 


due to a task switch occurring to a task having an 


Intel486 processor TSS with the T bit set. Note the 
task switch into the new task occurs normally, but 
before the first instruction of the task is executed, 
the exception 1 handler is invoked. With respect to 
the task switch operation, the operation is consid- 
ered to be a trap. 


12.3.4 USE OF RESUME FLAG (RF) IN FLAG 
REGISTER 


The Resume Flag (RF) in the flag word can sup- 
press an instruction execution breakpoint when the 
exception 1 handler returns to a user program ata 
user address which is also an instruction execution 
breakpoint. 
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13.0 INSTRUCTION SET SUMMARY 


This section describes the Intel486 processor in- 
struction set. Detailed information on the CPUID in- 
struction can be found in Appendix B: Feature Deter- 
mination. Further details of the instruction encoding 
are then provided in section 13.1, which describes 
the entire encoding structure and the definition of all 
fields occurring within the Intel486 processor in- 
structions. | 


13.1 Instruction Encoding 


13.1.1 OVERVIEW 


All instruction encodings are subsets of the general 
instruction format shown in Figure 13-1. Instructions 
consist of one or-two primary opcode bytes, possibly 
an address specifier consisting of the ‘mod r/m” 
byte and “scaled index” byte, a displacement if re- 
quired, and an immediate data field if required. 


Within the primary opcode or opcodes, smaller en- 
coding fields may be defined. These fields vary ac- 
cording to the class of operation. The fields define 
such information as direction of the operation, size 
of the displacements, register encoding, or sign ex- 
tension. 


| 

intel . 
Almost all instructions referring to an operand in 
memory have an addressing mode byte following 
the primary opcode byte(s). This byte, the mod r/m. 
byte, specifies the address mode to be used. Certain 
encodings of the mod r/m byte indicate a second .- 
addressing byte, the scale-index-base byte, follows 


the mod r/m byte to fully specify the addressing 
mode. 


Addressing modes can include a displacement im- 
mediately following the mod r/m byte, or scaled in- 
dex byte. If a displacement is present, the possible 
sizes are 8, 16 or 32 bits. 


If the instruction specifies an immediate operand, 
the immediate operand follows any displacement 
bytes. The immediate operand, if specified, is always 
the last field of the instruction. 


Figure 13-1 illustrates several of the fields that can 
appear in an instruction, such as the mod field and 
the r/m field, but the figure does not show all fields. 


_ Several smaller fields also appear in certain instruc- 


tions, sometimes within the opcode bytes them- 
selves. Table 13-1 is a complete list of all fields ap- 
pearing in the Intel486 processor instruction set. Fol- 
lowing Table 13-1 are detailed tables for each field. 


TTTTTTTT|TTTTTTTT| mod TTT rm d32| 16| 8| none data32| 16/8 | none 


7 07 
NT Ne ee / 


opcode "mod r/m" 


(one or two bytes) byte 


(T represents an 
opcode bit.) 


0765320765320 
a ai 


ee 


NN oN 


"sei-b" address immediate 
byte displacement data 
(4, 2, 1 bytes (4, 2, 1 bytes 
or none) or none) 


register and address 
mode specifier 
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Figure 13-1. General Instruction Format 
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' Table 13-1. Fields within Intel486™ Processor Instructions 


NOTE: 


Field | Description Number of 
Name P Bits 
Pwo Specifies if Data is Byte or Full Size (Full Size is either 16 or 32 Bits) 


ea Specifies Direction of Data Operation 
bs | Specifies if an Immediate Data Field Must be Sign-Extended 
ad 


General Register Specifier 


Address Mode Specifier (Effective Address can be a General Register) 
3 for r/m 


Scale Factor for Scaled Index Address Mode a ae 
General Register to be used as Index Register f/ 3 | 
| base General Register to be used as Base Register 
Segment Register Specifier for CS, SS, DS, ES ae 


For Conditional Instructions, Specifies a Condition Asserted or a Condition 
Negated 


Segment Register Specifier for CS, SS, DS, ES, FS, GS 


Table 13-15 through Table 13-19 show encoding of individual instructions. 


13.1.2 32-BIT EXTENSIONS OF THE 
INSTRUCTION SET 


With the Intel486 processor, the 8086/80186/80286 
instruction set is extended in two orthogonal direc- 
tions: 32-bit forms of all 16-bit instructions are added 
to support the 32-bit data types, and 32-bit address- 
ing modes are made available for all instructions ref- 
erencing memory. This orthogonal instruction set ex- 
tension is accomplished having a Default (D) bit in 
the code segment descriptor, and by having 2 prefix- 
es to the instruction set. 


Whether the instruction defaults to operations of 16 
bits or 32 bits depends on the setting of the D bit in 
the code segment descriptor, which gives the de- 
fault length (either 32 bits or 16 bits) for both oper- 
ands and effective addresses when executing that 
code segment. In the Real Address Mode or Virtual 
8086 Mode, no code segment descriptors are used, 
but a D value of 0 is assumed internally by the In- 
tel486 processor when operating in those modes 
(for 16-bit default sizes compatible with the 8086/ 
80186/80286). 


Two prefixes, the Operand Size Prefix and the Effec- 
tive Address Size Prefix, allow overriding individually 
the Default selection of operand size and 


effective address size. These prefixes may precede 
any opcode bytes and affect only the instruction 
they precede. If necessary, one or both of the prefix- 
es may be placed before the opcode bytes. The 
presence of the Operand Size Prefix and the Effec- 
tive Address Prefix will toggle the operand size or 
the effective address size, respectively, to the value 
“opposite” from the Default setting. For example, if 
the default operand size is for 32-bit data operations, 
then presence of the Operand Size Prefix toggles 
the instruction to 16-bit data operation. As another 
example, if the default effective address size is 16 
bits, presence of the Effective Address Size prefix 
toggles the instruction to use 32-bit effective ad- 
dress computations. | . 


These 32-bit extensions are available in all Intel486 
processor modes, including the Real Address Mode 
or the Virtual 8086 Mode. In these modes the default 
is always 16 bits, so prefixes are needed to specify 
32-bit operands or addresses. For instructions with 
more than one prefix, the order of prefixes is unim- 
portant. 


Unless specified otherwise, instructions with 8-bit 


and 16-bit operands do not affect the contents of 
the high-order bits of the extended registers. 
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13.1.3 ENCODING OF INTEGER INSTRUCTION 
FIELDS 


Within the instruction are several fields indicating 
register selection, addressing mode and so on. The 


exact encodings of these fields are defined immedi- 


ately ahead. 


13.1.3.1 Encoding of Operand Length (w) Field 


For any given instruction performing a data opera- 
tion, the instruction is executing as a 32-bit operation 
or a 16-bit operation. Within the constraints of the 
operation size, the w field encodes the operand size 

as either one byte or the full operation size, as 
shown in the table below. , 


Table 13-2. Encoding of Operand Length 


_ (w) Field 
Operand Size 
during 16-Bit during 32-Bit 
Data Operations | Data Operations 
en ee 
16 Bits 


Operand Size 


13.1.3.2 Encoding of the General Register (reg). 
Field 


The general register is specified by the reg field, 


which may appear in the primary opcode bytes, or as 
the reg field of the ‘mod r/m” byte, or as the r/m 
field of the “mod r/m’’ byte. 


Table 13-3. Encoding of reg Field when the w 
Field Is Not Present in Instruction 


Register Register 
‘Selected during | Selected during 
16-Bit Data 32-Bit Data 
Operations Operations 


reg Field 
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Table 13-4. Encoding of reg Field when the w 
Field Is Present in Instruction 


Register Specified by reg Field 
during 16-Bit Data Operations: 


: pre Function of w Field 


Register Specified by reg Field 
during 32-Bit Data Operations 


pre Function of w Field 


13.1.3.3 Encoding of the Segment Register 
(sreg) Field | 


The sreg field in certain instructions is a 2-bit field 
allowing one of the four 80286 segment registers to 
be specified. The sreg field in other instructions is a 
3-bit field, allowing the Intel486 processor FS and 
GS segment registers to be specified. 


do not use 
do not use 


13.1.3.4 Encoding of Address Mode 


Except for special instructions, such as PUSH or 
POP, where the addressing mode is pre-determined, 
the addressing mode for the current instruction is 
specified by addressing bytes following the primary 
opcode. The primary addressing byte is the “mod r/ 
m’’ byte, and a second byte of addressing informa- 
tion, the “s-i-b” (scale-index-base) byte, can be 
specified. 
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The s-i-b byte (scale-index-base byte) is specified 
when using 32-bit addressing mode and the “mod r/ 
m” byte has r/m = 100 and mod = 00, 01 or 10. 
When the sib byte is present, the 32-bit addressing 
mode is a function of the mod, ss, index, and base 
fields. 


The primary addressing byte, the “mod r/m’’ byte, 
also contains three bits (shown as TTT in Figure 13- 
1) sometimes used as an extension of the primary 
opcode. The three bits, however, may also be used 
as a register field (reg). 


When calculating an effective address, either 16-bit 
addressing or 32-bit addressing is used. 16-bit ad- 
dressing uses 16-bit address components to calcu- 
late the effective address while 32-bit addressing 
uses 32-bit address components to calculate the ef- | 
fective address. When 16-bit addressing is used, the 
“mod r/m” byte is interpreted as a 16-bit addressing 
mode specifier. When 32-bit addressing is used, the 
“mod r/m’’ byte is interpreted as a 32-bit addressing 
mode specifier. 


Tables 13-7, 13-8, and 13-9 define all encodings of 


all 16-bit addressing modes and 32-bit addressing 
modes. 
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Table 13-7. Encoding of 16-Bit Address Mode with “mod r/m” Byte 


| Effective Address 


DS:[BX+ SI] 
DS:[BX + DI] 
SS:[BP + SI] 
SS:[BP + Dl] 
DS:[SI] 
DS:[DIJ 
DS:d16 
DS:[BX] 
DS:([BX+ S!I+d8] 
DS:[BX+ DI +d8] 
SS:[BP + SI+ d8] 
SS:[BP + DI+ d8] 
DS:[S!+ d8] 


DS:[DI + d8] 


SS:[BP + d8] 
DS:[BX + d8] 


Register Specified by r/m during 
| 16-Bit Data Operations 
Function of w Field 
mod r/m 
i (when w=0) | (whenw=1)_ 
AL AX 
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Effective Address 


DS:[BX + Si+ d16] 
DS:[BX + DI+d16] 
SS:[BP + Si+d16] 
SS:[BP + Di+d16] 
DS:[Si+ d16] 
DS: [DI + d16] 
SS:[BP + d16] 
DS: [BX + d16] 
register—see below 


register—see below 


register—see below 


register—see below 


register—see below 


register—see below 


register—see below 


register—see below 


Register Specified by r/m during 
| 32-Bit Data Operations 


Function of w Field 
AL BAX 


ECX | 


mod r/m 


a | s 
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Table 13-8. Encoding of 32-Bit Address Mode with “mod r/m” Byte (No “s-i-b” Byte Present) 


DS:[EAX] DS:[EAX + d32] 
DS:[ECX] DS: [ECX + d32] 
DS:[EDX] DS:[EDX + d32] 
DS: [EBX] DS: [EBX + d32] 
s-i-b is present s-i-b is present 
DS:d32 SS:[EBP + d32] 
DS:[ESI] DS:[ESI+ d32] 
DS:{EDI] DS:[EDI+ d32] 


DS:([EAX + d8] register—see below 
DS:[ECX + d8] register—see below 
DS:[EDX + d8] register—see below 
DS:[EBX + d8] register—see below 


s-i-b is present register—see below | 


SS:[EBP + d8] register—see below 
DS:{ESI + dg] , register—see below 


DS:[EDI+ d8] register—see below 
' Register Specified by reg or r/m Register Specified by reg or r/m 
during 16-Bit Data Operations: during 32-Bit Data Operations: | 


Function of w Field Function of w Field 
mod r/m mod r/m : | 
(when w=0) | (when w=1) | (when w=0) | 


AL : AL EAX 
CL 
DL 
BL 
AH 
CH 
DH 
BH 
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Table 13-9. Encoding of 32-Bit Address Mode (“mod r/m” Byte and “s-hot Byte Present) 


DS:[EAX + (scaled index)] 
DS:[ECX + (scaled index)] 
DS:[EDX + (scaled index)] 
DS: [EBX + (scaled index)] 
SS:[ESP + (scaled index)] 
DS:[d32 + (scaled index)] 

DS:[ESI+ (scaled index)] 


DS: [EDI + (scaled index)] 


DS:[EAX + (scaled index) + d8] 
DS:[ECX + (scaled index) + d8] 
DS:[EDX + (scaled index) + d8] 
DS:[EBX + (scaled. index) + d8] 
SS:[ESP + (scaled index) + d8] 
SS:[EBP + (scaled index) + d8] 
DS:{ESI+ (scaled index) + d8] 
DS:({EDI + (scaled index) + d8] 


_DS:[EAX + (scaled index) + d32] 
DS:[ECX + (scaled index) + d32] 
DS:[EDX + (scaled index) + d32] 
DS:(EBX + (scaled index) + d32] 

SS:[ESP + (scaled index) + d32] 

SS:[EBP + (scaled index) + d32] 

DS:[ES!I+ (scaled index) + d32] 

DS:[EDI + (scaled index) + d32] 

NOTE: 


Mod field in “mod r/m” byte; ss, index, base fields in 
“*s-i-b” byte. | 
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Scale Factor 7 
ee a a Se 
Index — 


**IMPORTANT NOTE: 

When index field is 100, indicating “no index register,’ then 
ss field MUST equal 00. If index is 100 and ss does not 
equal 00, the effective address is undefined. 


intel. 


13.1.3.5 Encoding of Operation Direction (d) 
Field 


In many two-operand instructions the d field is pres- 
ent to indicate which operand is considered the 
source and which is the destination. 


Table 13-10. Encoding of Operation 
Direction (d) Field 


de Direction of Operation 


Register/Memory <— Register “‘reg”’ Field 
Indicates Source Operand; ‘‘mod r/m” or 

“mod ss index base” Indicates Destination 
Operand 


Register <— Register/Memory ‘‘req’’ Field 
Indicates Destination Operand; ‘mod r/m’” 
or ‘‘mod ss index base”’ Indicates Source 

Operand 


13.1.3.6 Encoding of Sign-Extend (s) Field 


The s field occurs primarily to instructions with im- 
mediate data fields. The s field has an effect only if 
the size of the immediate data is 8 bits and is being 
placed in a 16-bit or 32-bit destination. 


Table 13-11. Encoding of Sign-Extend (s) Field 


Effect on 
immediate immediate 
Data 8 Data 16/32 


[a None 


1 | Sign-Extend Data 8 to Fill 16-bit None 
or 32-bit Destination 


13.1.3.7 Encoding of Conditional Test (tttn) 
Field 


Effect on 


For the conditional instructions (conditional jumps 
and set on condition), tttn is encoded with n indicat- 
ing to use the condition (n= 0) or its negation (n= 1), 
and ttt giving the condition to test. 
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Table 13-12. Encoding of Conditional Test (tttn) 
Field 


Condition 


Overflow 

No Overflow 

Below/Not Above or Equal | 
Not Below/Above or Equal 
Equal/Zero 

Not Equal/Not Zero 

Below or Equal/Not Above ~ 


Not Below or Equal/Above 

Sign 

Not Sign 

Parity/Parity Even 

Not Parity/Parity Odd 

Less Than/Not Greater or Equal 
Not Less Than/Greater or Equal 


13.1.3.8 Encoding of Control or Debug or Test 
Register (eee) Field 


For the loading and storing of the Control, Debug 
and Test registers. | 


Table 13-13. Encoding of Control or Debug or 
Test Register (eee) Field 


When Interpreted as Control Register Field: 


Reg Name 


When Interpreted as Debug Register Field: 


When Interpreted as Test Register Field: 


Do not use any other encoding 
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Table 13-14. Encoding of Floating-Point Instruction Fields 


a fF& ®© NO — 


15-11 10 9 8 7 


13.1.4 ENCODING OF FLOATING POINT 
_ INSTRUCTION FIELDS 


Instructions for the FPU assume one of the five 
forms shown in the following table. In all cases, in- 
structions are at least two bytes long and begin with 
the bit pattern 11011B. 


OP = Instruction opcode, possible split into 
two fields OPA and OPB 


Memory Format 

00-32-bit real 

01-—32-bit integer 

10-64-bit real 

11-16-bit integer 

BS Pop 
O-Do not pop stack 
1-Pop stack after operation 

d= Destination 

| 0-—Destination is ST(0) 
1-Destination is ST(i) 

R XOR d = 0-Destination (op) Source 

R XOR d = 1-Source (op) Destination 


Register stack element / 


000 = Stack top 
001 = Second stack element 
111 = Eighth stack element 


mod (Mode field) and r/m (Register/Memory specifi- 
er) have the same interpretation as the correspond- 
ing fields of the integer instructions. 


s-i-b (Scale Index Base) byte and disp (displace- 
ment) are optionally present in instructions that have 
mod and r/m fields. Their presence depends on the 
values of mod and r/m, as for integer instructions. 


2-260 


6 5 


Second Byte 


OPB 


OPB 


43 2 1 0 


13.2 Clock Count Summary 


To calculate elapsed time for an instruction, multiply 
the instruction clock count, as listed in Table 13-15 
through Table 13-19 by the processor core clock pe- 
riod (e.g., 10 ns for a 100-MHz IntelDX4 processor). 


13.2.1 INSTRUCTION CLOCK COUNT 
ASSUMPTIONS 


The Intel486 processor instruction core clock count 
tables give clock counts assuming data and instruc- 
tion accesses hit in the cache. The combined in- 
struction and data cache hit rate is over 90%. © 


A cache miss will force the Intel486 processor to run 
an external bus cycle. The Intel486 processor 32-bit 
burst bus is defined as r-b-w. 


Where: 


r= The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle in a non-burst read. ‘ 


b = The number of bus clocks for the second and 
subsequent cycles in a burst read. 


w = The number of bus clocks for a write. 


The clock counts in the cache miss penalty column 
assume a 2-1-2 bus. For slower buses add r-2 
clocks to the cache miss penalty for the first dword 
accessed. Other factors also affect instruction clock 
counts. 
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Instruction Clock Count Assumptions 


1; 


The external bus is available for reads or writes 
at all times. Else add bus clocks to reads until the 
bus is available. 


. Accesses are aligned. Add three core clocks to 


each misaligned access. 


. Cache fills complete before subsequent access- 


es to the same line. If a read misses the cache 
during a cache fill due to a previous read or pre- 
fetch, the read must wait for the cache fill to com- 
plete. If a read or write accesses a cache line still 
being filled, it must wait for the fill to complete. 


. If an effective address is calculated, the base 


register is not the destination register of the pre- 
ceding instruction. If the base register is the des- 
tination register of the preceding instruction add 
1 to the core clock counts shown. Back-to-back 
PUSH and POP instructions are not affected by 
this rule. 


. An effective address calculation uses one base 


register and does not use an index register. How- 
ever, if the effective address calculation uses an 
index register, 1 core clock may be added to the 
clock count shown. 


. The target of a jump is in the cache. If not, add r 


clocks for accessing the destination instruction of 
a jump. If the destination instruction is not com- 
pletely contained in the first dword read, add a 
maximum of 3b bus clocks. If the destination in- 
struction is not completely contained in the first 
16 byte burst, add a maximum of another r+ 3b 
bus clocks. 


10. 


11. 


12. 
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. If no write buffer delay, w bus clocks are added 


only in the case in which all write buffers are full. 


. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
core clock may be added to the core clock 
count shown. | 


. No invalidate cycles. Add a delay of 1 bus clock 


for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external bus 
when the Intel486 processor needs to use it. 


Page translation hits in TLB. A TLB miss will add 
13, 21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. 
This assumes that neither page entry is in the 
data cache and a page fault does not occur on 
the address translation. 


No exceptions are detected during instruction 
execution. Refer to Interrupt core Clock Counts 
Table for extra clocks if an interrupt is detected. 


Instructions that read multiple consecutive data 
items (i.e. task switch, POPA, etc.) and miss the 
cache are assumed to start the first access ona 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+ 3b) 
bus clocks to the cache miss penalty. 
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Table 13-15. Clock Count Summary 


| | Penalty 
Instruction ; . if Cache 


INTEGER OPERATIONS 


MOV = Move: | 
reg1 to reg2 1000 100w : 11 reg1 reg2 


reg2 to reg1 | 1000 101w: 11 reg reg2 

memory to reg 1000 100w : mod reg r/m 

Immediate to reg - 1100 011w : 11000 reg : immediate data 
or . 1011W reg : immediate data 


Immediate to Memory | | 1100 01w: mod 000 r/m: displacement 
immediate 


Memory to Accumulator 1010 000w: full displacement 
Accumulator to Memory 1010 001w: full displacement 


MOVSX/MOVZX = Move with Sign/Zero Extension 
reg2 to reg1 0000 1111: 1011 z11w: 11 regi reg2 
memory to reg | 0000 11114 : 1014 z11w: mod reg r/m 
z__ instruction 
0 MOVZX 
-1 MOVSX 
PUSH = Push | 
reg. 11111111:11 110 reg 
or 01010 reg 
memory 11111111:mod110r/m 
immediate 0110 10s0 : immediate data 


1000 1111: 11000 reg 
01011 reg 
1000 1111: mod 000r/m 


XCHG = Exchange 
reg1 with reg2 1000 011w: 11 reg1 reg2 
Accumulator with reg 10010 reg 
Memory with reg 1000 011w: mod reg r/m 
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Penalty 
Instruction . Format | if Cache 


INTEGER OPERATIONS (Continued) 
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NOP = No Operation 


LEA = Load EA to Register 
no index register 
with index register 

Instruction 

ADD = Add 

ADC = Add with Carry 

AND = Logical AND 

OR = Logical OR 

SUB = Subtract 


SBB = Subtract with Borrow 
XOR = Logical Exclusive OR 


regi to reg2 

reg2 to regi 

memory to register 
register to memory 
immediate to register 
immediate to Accumulator 
immediate to memory 


Instruction 
INC = Increment 
DEC = Decrement 


reg 
or 
memory 


Instruction 
NOT = Logical Complement 
NEG = Negate 


reg 
memory 


1001 0000 
1000 1101 : mod reg r/m 


cnni 
000 
010 
100 
001 
101 
O11 
110 


OOTT TOOw: 11 regi reg2 
OOTT TO1w: 11 reg1 reg2 
OOTT TO1w: mod reg r/m 
OOTT TOOw : mod reg r/m 


1000 00sw : 11 TTT reg : immediate register 


OOTT T10w : immediate data 


1000 00sw : mod TTT r/m: immediate data 


Laas 
000 
001 


1111111w: 11 TTT reg 
O1TTT reg 
1111114wW:mod TTT r/m 


Ut 
010 
011 


1111 011w:11 TTT reg 
1111011w: mod TTT r/m 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache 
Miss 


Instruction. Format 


INTEGER OPERATIONS (Continued) 


CMP = Compare 
regi with reg2 0011 100w: 11 regi reg2 1 
reg2 with reg1 0011 101w: 11 reg1 reg2 1 
memory with register 0011 100w : mod reg r/m 2 
register with memory 0011 101w: mod reg r/m 2 
immediate with register 1000 O0sw: 11 111 reg : immediate data 1 
immediate with acc. 0011 110w : immediate data 1 
immediate with memory 1000 00sw : mod 111 r/m: immediate data 2 


TEST = Logical Compare 


regi and reg2 1000 010w: 11 regi reg2 
memory and register 1000 010w: mod reg r/m 
immediate and register 1111 011w: 11 000 reg : immediate data 


immediate and acc. 1010100w : immediate data 
1111 011w: 


Nm ——- — NN —- 


immediate and memory mod 000 r/m : immediate data 


MUL = Multiply (unsigned) 


acc. with register 1111 011w: 11 100 reg 
Multiplier-Byte 
Word 
Dword 
acc. with memory 1111011w: mod 100 r/m 
Multiplier-Byte 
Word 


Dword 


IMUL = Integer Multiply (unsigned) 


acc. with register 1111 011w: 11 101 reg 
Multiplier-Byte MN/MX,3 
Word MN/MX,3 
Dword MN/MxX,3 | 


acc. with memory 1111 011w: mod 101 r/m 


Multiplier-Byte MN/MX,3 
Word MN/MX,3 
Dword MN/MX,3 


regi withreg2 — 0000 1111 : 10101111: 11 reg1 reg2 
Multiplier-Byte MN/MX,3 
Word MN/MX,3 
Dword MN/MX,3 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache 
Miss — 


- Cache 
Hit 


instruction Format 


INTEGER OPERATIONS (Continued) 


IMUL = Integer Multiply (unsigned), (Continued) 
register with memory 0000 1111 : 10101111: mod reg r/m 


Multiplier-Byte . 1 
Word 1 
Dword 1 


0110 10s1 : 11 reg1 reg2 : immediate data 


MN/MX,3 
MN/MX,3 
MN/MX,3 


regi with imm. to reg2 


Multiplier-Byte 
Word 
Dword 


MN/MX,3 
MN/MX,3 
MN/MX,3 


mem. with imm. to reg. 


Multiplier-Byte 
Word 
Dword 


0110 10s1 : mod reg r/m: immediate data 


MN/MX,3 
MN/MX,3 
MN/MX,3 


For the IntelDX4™ Processor Only: 


IMUL = Integer Multiply (signed) 


acc. with register 


Multiplier-Byte 
Word 
Dword 


acc. with memory 


Multiplier-Byte 
Word 
Dword 


reg1 with reg2 
Multiplier-Byte 


Word 
Dword 


register with memory 
Multiplier-Byte 


Word 
Dword 


regi with imm. to reg2 
Multiplier-Byte 


Word 
Dword 


mem. with imm. to reg. 


Multiplier-Byte 
Word 
Dword 


1111 011w: 11101 reg 


1111 011w: 


0000 1111: 


0000 1111 


0110 10s1 


0110 10s1 


mod 101 r/m 


10101111: 11 reg1 reg2 


: 10101111 : mod reg r/m 


: 11 regi reg2 : immediate data 


: mod reg r/m: immediate data 


MN/MX,3 
MN/MX,3 
MN/MxX,3 | 


MN/MX,3 
MN/MX.3 | 
MN/MX.3 


MN/MX,3 
MN/MX,3 
MN/MX,3 


MN/MX,3 
MN/MX,3 
MN/MX,3 


MN/MX,3 
MN/MX,3 
MN/MX,3 


MN/MX,3 
MN/MX,3 
MN/MX,3 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache 
Miss 


Instruction Format 


INTEGER OPERATIONS (Continued) 


DIV = Divide (unsigned) 
acc. by register 1111 011w: 11110 reg 


Divisor-Byte 
Word 
Dword 


acc. by memory 


1111 011w: mod 110 r/m 


Divisor-Byte 
Word 
Dword 


IDIV = Integer Divide (signed) 
acc. by register 1111 011w: 11111 reg 


Divisor-Byte 
Word 
Dword 


-acc. by memory 1111 011w: mod 111 r/m 


Divisor-Byte 
Word 
Dword 


CBW = Convert Byte to Word 1001 1000 


‘CWD = Convert Word to Dword 1001 1001 ae a 
| Instruction TIT | 
ROL = Rotate Left 000 
_1|ROR = Rotate Right 001 
RCL = Rotate Through Carry Left 010 
RDR = Rotate Through Carry Right 011 
SHL/SAL = Shift Logical/ Arithmetic Left 100 
SHR = Shift Logical Right ; 101 
| SAR = Shift Arithmetic Right 111 
Not Through Carry (ROL, ROR, SAR, SHL, and SHR) 
| regby 1 1101 000w: 11 TTT reg 
memory by 1 1101 000w: mod TTT r/m 
reg by CL 1101 001w: 11 TTT reg 
memory by CL | 1101 001w: mod TTT r/m 
regbyimmediate count > - 1100 000w: 11 TTT reg: imm. 8-bit data 
mem by immediate count 1100 000w : mod TTT r/m: imm. 8-bit data 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache | 
Miss 


instruction Format 


INTEGER OPERATIONS (Continued) 
Through Carry (RCL and RCR) 


reg by 1 1101 000w: 11 TTT reg 
memory by 1 1101 000w : mod TTT r/m 
reg by CL 1101 001w: 11 TTT reg MN/MX,4 | 
memory by CL 1101 001w: mod TTT r/m MN/MxX,5 
reg by immediate count 1100 000w: 11 TTT reg: imm. 8-bit data MN/MX,4 
mem by immediate count 1100 000w : mod TTT r/m: imm. 8-bit data MN/MX,5 
Instruction TEL 
SHLD = Shift Left Double 100 
SHRD = Shift Right Double 101 
register with immediate 0000 1111: 10TT T1100: 11 reg2 reg 
:imm. 8-bit data 
memory with immediate 0000 1111: 10TT T7100: mod reg r/m 
> imm. 8-bit data 
register by CL 0000 1111: 10TT T101: 11 reg2 reg 
memory by CL 0000 1111: 10TT T101 : mod reg r/m 


XADD = Exchange and Add 
regi, reg2 0000 1111: 1100 000w: 11 reg2 regt 
memory, reg 0000 1111 : 1100 O00w : mod reg r/m 


CMPXCHG = Compare and Exchange 
regi, reg2 0000 1111: 1011 O00w: 11 reg2 reg1 
memory, reg 0000 1111 : 1011 000w: mod reg r/m 


Note: Times are jump taken/not taken 
Jeccc = Jump on cccc 


8-bit displacement 0111 tttn : 8-bit disp. 3/1 T/NT,23 
full displacement. 0000 1111 : 1000 tttn : full displacement 3/1 T/NT,23' 


Note: Times are jump taken/not taken 


SETcccc = Set Byte on cccc (Times are cccc true/false) 
reg 0000 1111: 1001 tttn: 11 000 reg 4/3 
memory ~ 0000 1111: 1001 tttn : mod 0000 r/m 3/4 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache 


J Instruction —. Format 


Miss 


CONTROL TRANSFER (within segment) (Continued) 


Mnemonic cccc Condition 
Overflow 
No Overflow 
Below/Not Above or Equal 
Not Below/Above or Equal 
Equal Zero 
Not Equal/Not Zero 
Below or Equal/Not Above 
Not Below or Equal/Above 
Sign 
Not Sign 
Parity/Parity Even 
Not Parity/Parity Odd 
Less Than/Not Greater or Equal 
Not Less Than/Greater or Equal 
Less Than or Equal/Greater Than 
Not Less Than or Equal/Greater Than 


LOOP = LOOP CX Times 1110 0010 : 8-bit disp. 7/6 


LOOPZ/LOOPE = Loop with Zero/Equal 
1110 0001 : 8-bit disp. 9/6 


L/NL,23 


L/NL,23 
L/NL,23 


T/NT,23 


LOOPNZ/LOOPNE = Loop While Not Zero 
1110 0000 : 8-bit disp. 9/ 


JCXZ = Jump on CX Zero 1110 0011 : 8-bit disp. 8/5 


JECXZ = Jump on ECX Zero 1110 0011 : 8-bit disp. 8/5 
(Address Size Prefix Differentiates JCXZ for JECXZ) 


JMP = Unconditional Jump (within segment) 


Short 1110 1011 : 8-bit disp. 
Direct 1110 1001 : full displacement 
Register Indirect 11111111:11100reg — 


1111 1111:mod100r/m 


Memory Indirect 


CALL = Call (within segment) 


Direct 1110 1000 : full displacement 7,23 
Register Indirect 1111 1111:11010 reg 7,23 
Memory Indirect 1111 1111: mod 010 reg 7 


RET = Return from CALL (within segment) 
1100 0011 


1100 0010: 


Adding Immediate to SP 16-bit disp. 


ENTER = Enter Procedure 1100 1000 : 16-bit disp., 8-bit level 


14 
17 
17+3L 


LEAVE = Leave Procedure 1100 1001 oe ae nd 


Level = 0 
Level = 1 
Level (L) > 1 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
if Cache 


Instruction Format 


MULTIPLE-SEGMENT INSTRUCTIONS 
MOV = Move 


reg. to segment reg. 1000 1110: 11 sreg3 reg 
memory to segment reg. 1000 1110: mod sreg3 r/m 
segment reg. to reg. 1000 1100: 11 sreg3 reg 


segment reg. to memory 1000 1100 : mod sreg3 r/m 


PUSH = Push 


segment reg. 000sreg 2110 
(ES, CS, SS, or DS) 


segment reg. (FS or GS) 
POP = Pop 


segment reg. 000sreg 2111 
(ES, CS, SS, or DS) 


segment reg. (FS or GS) 


0000 1111: 10 sreg3001 


0000 1111 : 10 sreg3001 


LDS = Load Pointer to DS 1100 0101 : mod reg r/m 6/12 7/10 


LFS = Load Pointer to FS 0000 1111 : 1011 0100: mod reg r/m 
LGS = Load Pointer to GS 0000 1111 : 1011 0101 : mod reg r/m 
LSS = Load Pointer to SS 0000 1111 : 1011 0010 : mod reg r/m 


CALL = Call oo 
Direct intersegment 1001 1010 : unsigned full offset, selector 


to same level 

thru Gate to same level 

to inner level, no parameters 

to inner level, x parameters (d) words 
to TSS 

thru Task Gate 


Indirect intersegment 11111111: mod011r/m 


to same level 
thru Gate to same level 
to inner level, no parameters 


to inner level, x parameters (d) words 
to TSS 
thru Task Gate 


RET = Return from CALL 
intersegment 1100 1010 


to same level 
to outet lever 


intersegment adding 1100 1010 : 16-bit disp. 
imm. to SP 


to same level 
to outer level 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 


Instruction 


Format 


MULTIPLE-SEGMENT INSTRUCTIONS (Continued) 


| JMP = Unconditional Jump 
Direct intersegment 


to same level 

thru Call Gate to same level 
thru TSS 

thru Task Gate 


Indirect intersegment 


to same level 

thru Call Gate to same level 
thru TSS 

thru Task Gate 


BIT MANIPULATION 


1110 1010 : unsigned full offset, selector 


11111111: mod 011 r/m 


BT = Test Bit 
register, immediate 0000 1111: 1011 1010: 11 100 reg: 
imm. 8-bit data 
memory, immediate 0000 1111: 1011 1010: mod 100 r/m: 
imm. 8-bit data 
regi, reg2 * 0000 1111 : 1010 0011 : 11 reg2 reg1 
memory, reg 0000 1111 : 1010 0011 : mod reg r/m 
Instruction | TIT 
BTS = Test Bit and Set 101 
BTR = Test Bit and Reset 110 
BTC = Test Bit and Compliment 111 
register, immediate 0000 1111: 1011 1010: 11 TTT reg 
: imm. 8-bit data 
memory, immediate 0000 1111: 1011 1010: mod TTT r/m 
imm. 8-bit data 
regi, reg2 0000 1111: 10TT T011: 11 reg2 regt 
memory, reg 0000 1111: 10TT T011 : mod reg r/m 


BSF = Scan Bit Forward 


regi, reg2 0000 1111: 1011 1100: 11 reg2 regi 
memory, reg 0000 1111: 1011 1100: mod reg r/m 

BSR = Scan Bit Reverse . 
regi, reg2 0000 1111: 1011 1101: 11 reg2 reg1 
0000 1111: 1011 1101: mod reg r/m 


memory, reg 
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Table 13-15. Clock Count Summary (Continued) 


Penaity 
if Cache 


Instruction Format 


Hi ey 


STRING INSTRUCTIONS 


CMPS = Compare Byte Word 1010 011w pk 
LODS = Load Byte/Word to AL/AX/EAX a 


ok 
o 


1010 111w 5 


MOVS = Move Byte/Word 1010 010w 
SCAS = Scan Byte/Word _ 1010 111w 


STOS = Store Byte/Word from AL/AX/EX 

1010 101w 5 
XLAT = Translate String 11010111 
REPEATED STRING INSTRUCTIONS 
Repeated by Count in CX or ECX (C= Count in CX or ECX) | 


REPE CMPS = Compare String 11110011: 1010011w 
(Find Non-match) 


—_ 
o> 


C=0 
C>0 


16, 17 
7+/7c 


REPNE CMPS = Compare String 11110010: 1010011w 
(Find Match) 


C=0 
C>0 


16,17 
7+7c 
REP LODS = Load String 11110010 : 1010 110w 


C=0 
C>0 


16, 18 
7+4c 


REP MOVS = Move String 11110010: 1010 010w 
C=0 


oh 


16, 19 


C= 1 

C> 1 12+3c 

REPE SCAS = Scan String 11110011: 1010 111W 
(Find Non-AL/AX/EAX) 


C=0 


C>0 7+5c 


Nh 


N —_ : 
oO Oo 


REPNE SCAS = Scan String 11110010: 1010 1411w 
(Find AL/AX/EAX) 


C=0 


C>0 7+5c 


io) 


REP STOS = Store String 11110010: 1010 101w 


C=0 
C>0 


7+4c 
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Table 13-15. Clock Count Summary (Continued) 


Instruction 


FLAG CONTROL — 


CLC = Clear Carry Flag 

STC = Set Carry Flag 

CMC = Complement Carry Flag 

CLD = Clear Direction Flag 

STD = Set Direction Flag 

CLI = Clear Interrupt Enable Flag 
| STI = Set Interrupt Enable Flag 

LAHF = Load AH into Flag 

SAHF = Store AH into Flag 

PUSHF = Push Flags 

POFF = Pop Flags 


DECIMAL ARITHMETIC 


‘AAA = ASCII Adjust to Add 
| AAS = ASCII Adjust for Subtract 
AAM = ASCII Adjust for Multiply 
AAD = ASCII Adjust for Divide 
DAA = Decimal Adjust for Add 


DAS = Decimal Adjust for Subtract 


Format 


1111 1000 
1111 1001 


11110101 


1111 1100 
1111 1101 
1111 1010 
1111 1011 
1001 1111 
1001 1110 
1001 1100 
1001 1101 


0011 0111 
0011 1111 
1101 0100 
1101 0101 
0010 0111 


0010 1111 


PROCESSOR CONTROL INSTRUCTIONS 


HLT = Halt 


11110100 


: 0000 1010 
: 0000 1010 


MOV = Move To and From Control/Debug/Test Registers 


CRO from register 
CR2/CR3 from register 
Reg from CRO-3 
DRO-3 from register 
DR6-7 from register 
Register from DR6-7 
Register from DRO-3 
TR3 from register 
TR4-7 from register 
Register from TR3 
Register from TR4-7 
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0000 1111 
0000 1111 
0000 1111 
0000 1111 
0000 1111 
0000 1111 
0000 1111 
0000 1111 


0000 1111: 


0000 1111 
0000 1111 


- 0010 0010: 
:0010 0010: 
: 0010 0000 : 


: 0010 0011 
:0010 0011 
: 0010 0001 
: 0010 0001 


11 000 reg 
11 eee reg 
11 eee reg 


: 11 eee reg 
: 11 eee reg 
: 11 eee reg 
: 11 eee reg 
:0010 0110: 
0010 0110: 
:0010 0100: 
:0010 0100: 


11011 reg 
11 eee reg 
11011 reg 
11 eee reg 


Penalty 


if Cache Notes 
Miss 


2 

2 

2 

2 

2 

5 

5 

3 

2 
RV/P 
RV/P 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 
Instruction Format Cache | if Cache 
Hit Miss 


Notes 


PROCESSOR CONTROL INSTRUCTIONS (Continued) 


CPUID = CPU Identification 0000 1111 : 1010 0010 


EAX = 1 
EAX = 0, >1 


CLTS = Clear Task Switched Flag 0000 1111: 00000110 


INVD = Invalidate Data Cache 0000 1111 : 0000 1000 


WBINVD = Write-Back and Invalidate Data Cache 
0000 1111 : 0000 1001 
INVLPG = Invalidate TLB Entry 


INVLPG memory 0000 1111 : 0000 0001 : mod 111 r/m 12/11 H/NH 


| 
| 
—_, 


PREFIX BYTES 


Address Size Prefix 01100111 
LOCK = Bus Lock Prefix 1111 0000 
Operand Size Prefix 01100110 


Segment Override Prefix 
CS: 0010 1110 
DS: 0011 1110 
ES: 0010 0110 
FS: 01100100 
GS: 01100101 
SS: 00110110 


PROTECTION CONTROL 


ARPL = Adjust Requested Privilege Level 
From memory 0110 0011: mod regr/m 


= 
co 
—s 
1 
1 
1 
1 
1 
1 
— 
LAR = Load Access Rights 
From register 0000 1111 : 0000 0010: 11 regi reg2 11 
From memory 0000 1111 : 0000 0010 : mod reg r/m 11 


From register 0110 0011: 11 reg! reg2 


LGDT = Load Global Descriptor 
Table register 0000 1111 : 0000 0001 : mod 010 r/m 12 


LIDT = Load Interrupt Descriptor 
Table register 0000 1111 : 0000 0001 : mod 011 r/m 12 


LLDT = Load Local Descriptor 


Table register from reg. 0000 1111 : 0000 0000: 11 010 reg 
Table register from mem. 0000 1111 : 0000 0000 : mod 010 r/m 
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Table 13-15. Clock Count Summary (Continued) 


Instruction Format 


PROTECTION CONTROL (Continuec) 
LMSW = Load Machine Status Word 


_From register 0000 1111: 

' From memory 0000 1111: 
LSL = Load Segment Limit 

From register 0000 1111: 

From memory 0000 1111: 
LTR = Load Task Register 

From register 0000 1111: 

From memory 0000 1111: 


SGDT = Store Global Descriptor Table 
0000 1111 


SIDT = Store Interrupt Descriptor Table 

0000 1111 
SLDT = Store Local Descriptor Table 
0000 1114 
0000 1111 


To register 
To memory 


SMSW = Store Machine Status Word 


INT3 = Interrupt Type 3 
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_ To register 0000 1111 
| To memory 0000 1111 
STR = Store Task Register 
To register 0000 1111 
To memory 0000 1111 
VERR = Verify Read Access 
Register 0000 1111 
Memory 0000 1111 
VERW = Verify Write Access | 
To register 0000 1111 
To memory 0000 1111 
INTERRUPT INSTRUCTIONS 
INTn = Interrupt Type n 1100 1101 
1100 1100 


0000 0001 
0000 0001 


0000 0011 


0000 0000 : 
0000 0000: 


: 0000 0001 


: 0000 0001 


: 0000 0000 : 
: 0000 0001 


: 0000 0001 
: 0000 0001 


: 0000 0000 : 
: 0000 0000 : 


: 0000 0000 : 
: 0000 0000 : 


: 0000 0000 : 
: 0000 0000 : 


: type 


:11110 reg 
:>mod 110 r/m 


: 11 reg1 reg2 
0000 0011: 


mod reg r/m 


11011 reg 
mod 011 r/m 


: mod 000 r/m 


: mod 001 r/m 


11 000 reg 


: mod 000 r/m 


: 11 000 reg 
:mod 100 r/m 


11001 r/m 
mod 001 r/m 


11100 r/m 
mod 100 r/m 


11101 r/m 
mod 101 r/m 


Penalty 
if Cache 
Miss 


20 


ss 


ae 


~~“ 


INT + RV/P, 21 
4/0 
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Table 13-15. Clock Count Summary (Continued) 


Penalty 


Instruction Format if Cache 


INTERRUPT INSTRUCTIONS (Continued) 


INTO = Interrupt 4 if Overflow Flag Set 
. 1100 1110 


Taken 
Not Taken 


BOUND = Interrupt 5 if Detect Value Out Range 
0110 0010: mod reg r/m 


If in range 
if out of range 


IRET = Interrupt Return 4100 1111 


Real Mode/ Virtual Mode 
Protected Mode 
To same level 
To outer level 
To nested task 
(EFLAGS.NT= 1) 


RSM = Exit System Management Mode 
0000 1111: 1010 1010 


SMBASE Relocation 
Auto HALT Restart 
1/O Trap Restart 


External interrupt . 
41 


NMI = Non-Maskable Interrupt INT+3 


Page Fault 


VM86 Exceptions 


CLI 
STI 
INTn 
PUSHF 
POPF 
IRET 
IN 
Fixed Port INT +50 
Variable Port INT +51 
OUT 
Fixed Port INT + 50 
Variable Port INT + 51 
INS INT + 50 
OUTS INT + 50 
REP INS INT +51 


REP OUTS INT+ 51 
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Table 13-16. Task Switch Clock Counts 


Value for INT 


oe 


Real Mode 


Protected Mode 
Interrupt/Trap gate, same level 
Interrupt/Trap gate, different level 
Task Gate 


Virtual Mode 
Interrupt/Trap gate, different eve 
Task Gate 


Abbreviations Definition 


16/32 16/32 bit modes 

U/L unlocked/locked 

MN/MX minimum/maximum 

L/NL loop/no loop 

RV/P real and virtual mode/protected mode 

R real mode 

P protected mode | 
T/NT. taken/not taken | 
H/NH hit/no hit 


NOTES (for Tables 13-17 through 13-79): 


. Assuming that the operand address and stack address fall in different cache sets. 
. Always locked, no cache hit case. 


. Clocks = 10 + max(loga(|m)),n) 

. Clocks = {qoutient(couint/operand length)}*7+9 
= 8 if count < operand length (8/16/32) 

. Clocks {qoutient(couint/operand length)}*7 +9 


Hl 


9 if count < operand length (8/16/32) 


Equal/not equal cases (penalty is the same regardless of lock) 

. Assuming that addresses for memory read (for indirection), stack puch/pop and branch fall in different cache sets. 
. Penalty for cache miss: add 6 clocks for every 16 bytes copied to new stack frame. 

. Add 11 clocks for each unaccessed descriptor load. 

. Refer to task switch clock counts table for value of TS. 

. Add 4 extra clocks to the cache miss penalty for each 16 bytes. 
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For notes 12-13: (6 =0-3, non-zero byte number); 
(i=0-1, non-zero nibble number); 
(n=0-3, non-bit number in nibble); 


12. Clocks = 8 + 4 (b+1) + 3(i+1) + 3(n+1) 
= 6 if second operand = 0 

13. Clocks = 9 + 4 (b+1) + 3(14+1) + 3(n+1) 
= 7 if second operand = 0 

For notes 14-15: (n= bit position 0-31) 

14. Clocks = 7 + 3(32-n) 
= 6 if second operand = 0 

15. Clocks = 8 + 3(32-n) 


7 if second operand = 0 

16. Assuming that the two string addresses fall in different cache sets. 

17. Cache miss penalty: add 6 clocks for every 16 bytes compared. Entire penalty on first compare. 
18. Cache miss penalty: add 2 clocks for every 16 bytes of data. Entire penalty on first load. 

19. Cache miss penalty: add 4 clocks for every 16 bytes moved. (1 clock for the first operation and 3 for the second) 
20. Cache miss penalty: add 4 clocks for every 16 bytes scanned. (2 clocks each for first and second operations) 
21. Refer to interrupt clock counts table for value of INT. 

22. Clock count includes one clock for using both displacement and immediate. 
23. Refer to assumption 6 in the case of a cache miss. 

24. Virtual Mode Extensions are disabled. 

25. Protected Virtual Interrupts are disabled. 


Table 13-18. I/O Instructions Clock Count Summary 


Instruction 


IN = Input from: 
Fixed Port 1110 010w: port number 
Variable Port 1110 110w 


OUT = Output to: 
Fixed Port 1110 011w: port number 
Variable Port 1110 110w 


INS = Input Byte/Word from DX Port 
0110 110w 


OUTS = Output Byte/Word to DX Port 
0110 111w 


NOTES: 

1. Two clock cache miss penalty in all cases. 

2. c = count in CX or ECX. 

3. Cache miss penalty in all modes: Add 2 clocks for every 16 bytes. Entire penalty on second operation. 
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Table 13-19. Floating Point Clock Count Summary 


instruction Format 
DATA TRANSFER 
FLD = Real Load to ST(0) 
32-bit memory 11011 001 : mod 000 r/m: 
64-bit memory 11011101 :mod000 r/m: 
80-bit memory 11011011: mod 101 r/m: 
ST(i) 11011 001 : 11000 ST(i) 
FILD = Integer Load to ST(0) 
16-bit memory 11011 111: mod 000 r/m: 
32-bit memory 11011 014: mod 000 r/m: 
64-bit memory 11011 111 : mod 101 r/m: 
FBLD = BCD Load to ST(0) 
11011 111: mod100r/m: 
FST = Store Real from ST(0) 
32-bit memory 11011 011:mod010r/m: 
64-bitmemory 11011 101:mod010r/m: 
ST(i) 11011 101 : 11001 ST(i) « 
FSTP = Store Real from ST(0) and Pop 
32-bit memory 11011011: mod011r/m: 
64-bit memory 11011 101: mod011 r/m: 
80-bit memory 11011011: mod111r/m: 
ST(i) 11011 101 : 11001 ST(i) 
FIST = Store Integer from ST(0) 
16-bit memory 11011111:mod010r/m: 
|. 32-bit memory 11011 011:mod010r/m: 
FISTP = Store Integer from ST(0) and Pop 
16-bit memory 11011111: mod0111r/m: 
32-bit memory 11011 011:mod011r/m: 
_ 64-bit memory 11011 111 :mod 111 r/m: 
FBSTP = Store BCD from ST(0) and Pop 
11011 111:mod110r/m 


FXCH = Exchange ST(0) and ST(i) 


11011 001 
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: 11001 ST(i) 


s-i-b/disp. 
s-i-b/disp. 
s-i-b/disp. 


s-i-b/disp. 
s-i-b/disp. 
s-i-b/disp. 


s-i-b/disp. 
s-i-b/disp. 


s-i-b/disp. 


s-i-b/disp. 
s-i-b/disp. 
s-i-b/disp. 


s-i-b/disp. 
s-i-b/disp. 


s-i-b/disp. 
s-i-b/disp. 
s-i-b/disp. 


: S-i-b/disp. 


Avg (Lower 
Range... 
Upper 

Range) 


DW ® 
££ 


2 
3 

ie 4) 
| 7.8(2-8) 


7 

8 

Hf | 
33.4(29-34) 

| 33.4(29-34) 


a 


14.5(13-16) 2 
11.5(9-12) 2 
16.8(10-18) 3 


ie) 


7 
8 
6 
3 


33.4(29-34) 
32.4(28-34) 


33.4(29-34) 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Avg (Lower | 
Range... 
Upper 


Instruction Format 


COMPARISON INSTRUCTIONS 


FCOM = Compare ST(0) with Real 
32-bit memory 11011 000 : mod 010 r/m: s-i-b/disp. 
64-bit memory 11011 100 : mod 010 r/m: s-i-b/disp. 
ST(i) 11011 000 : 11010 ST(i) 


FCOMP = Compare ST(0) with Real and Pop 
32-bit memory 11011 000 : mod 011 r/m: s-i-b/disp. 
64-bit memory 11011 100 : mod 011 r/m: s-i-b/disp. 

ST(i) 11011 000: 11011 ST(i) 


FCOMPP = Compare ST(0) with ST(1) and Pop Twice 
11011 110: 1101 1001 


FICOM = Compare ST(0) with Integer 
16-bit memory 11011110: mod010r/m: s-i-b/disp. 
32-bit memory 11011010: mod 010 r/m: s-i-b/disp. 


18(16-20) 
16.5(15-17) 


FICOMP = Compare ST(0) with Integer 
16-bit memory 11011110: mod 011 r/m: s-i-b/disp. 18(16-20) 
32-bit memory 11011010: mod 011 r/m: s-i-b/disp. 16.5(15-17) 


FTST = Compare ST(0) with 0.0 
11011 011: 11100100 
FUCOM = Unordered compare ST(0) with ST(i) 
11011 101 : 11100 ST(i) 
FUCOMP = Unordered compare ST(0) with ST(i) and Pop 
11011 101: 11101 ST(i) 
FUCOMPP = Unordered compare ST(0) with ST(1) and Pop Twice 
11011 101: 11101 1001 


FXAM = Examine ST(0) 
11011 001 : 1110 0101 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Instruction Format 


_|CONSTANTS 


FLDZ = Load + 0.0 Into ST(0) 


11011 001:11101110: 


FLD1 = Load + 1.0 Into ST(0) 


11011 001 : 1110 1000: 


FLDP1 = Load 7 Into ST(0) 


11011 001:11101011: 


FLDL2T = Load logo(10) Into ST(0) 


11011 001: 11101001: 


FLDL2E = Load log2(e) Into ST(0) 


11011 001: 11101010: 


FLDLG2 = Load log,o(2) Into ST(0) 


11011 001: 11101100: 


FLDLN2 = Load loge(2) Into ST(0) 


— 11011 001: 1110 1101: 


ARITHMETIC 


FADD = Add Real with ST(0) 
ST(0) <— ST(0) + 32-bit memory 


11011 000 : mod 000 r/m: s-i-b/disp. 


ST(0) <— ST(0) + 64-bit memory 


11011 100 : mod 000 r/m: s-i-b/disp. 


ST(d) — ST(0) + ST(i) 


11011 d0O : 11000 ST(i) 


FADDP = Add real with-ST(0) and Pop (ST(i) <- ST(0) + ST(i)) 
11011 110: 11000 ST(i): 
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Concurrent 
Execution 
Notes 


Avg (Lower 
Range... 
Upper 

Range) 


10(8-20) 


10(8-20) 7(5-17) 


10(8-20) 7(5-17) 


Lose | [sim | 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Avg (Lower 
Range... 
Upper 

Range) 


Instruction Format 


ARITHMETIC (Continued) 


Concurrent 
..| Execution 
FSUB = Subtract Real from ST(0) 


ST(0) <— ST(0) — 32-bit memory 


11011 000 : mod 100 r/m: s-i-b/disp. 10(8-20) 7(5-17) 
ST(0) <— ST(0) — 64-bit memory 
11011 100 : mod 100 r/m: S-i- -b/disp. 10(8-20) 7(5-17) 
ST(d) <— ST(0) — ST(i) 


11011 d0O : 11001 ST(i) 10(8-20) 7(5-17) 


FSUBP = Subtract real from ST(0) and Pop (ST(i) <— ST(0)—ST(i)) 
11011 110: 11001 ST(i) 


FSUBR = Subtract Real reversed (Subtract ST(0) from Real) 
ST(0) <— 32-bit memory — ST(0) 

11011 000 : mod 101 r/m: s-i-b/disp. 10(8-20) 2 
ST(0) <— 64-bit memory — ST(0) 

11011 100 : mod 101 r/m: s-i-b/disp. 10(8-20) 3 
ST(d) <— ST(i) — ST(0) 

11011 d0O: 11001 ST(i) 10(8-20) 7(5-17) 


FSUBRP = Subtract Real reversed and Pop (ST(i) <— ST(i) —ST(0)) 
11011 110: 11100 ST(i) 10(8- 20) Bae 17) 


FMUL = Multiply Real with ST(0) 
ST(0) <— ST(0) X 32-bit memory 
11011 000 : mod 001 r/m: s-i-b/disp. 
ST(0) <— ST(0) X 64-bit memory 
11011 100 : mod 001 r/m: s-i-b/disp. 
ST(d) <— ST(0) X ST(i) 
11011 d0O : 11001 ST(i) 


7(5-17) 


7(5-17) 


FMULP = Multiply ST(0) with ST(i) and Pop (ST(i) <— ST(0)xST(i)) 
11011 110: 11001 ST(i) 


FDIV = Divide ST(0) by Real 
ST(0) <— ST(0)/ 32-bit memory 

11011 000 : mod 110 r/m: s-i-b/disp. 
ST(0) <— ST(0)/ 64-bit memory 

11011 100 : mod 110 r/m: s-i-b/disp. 
ST(d) <— ST(0)/ ST(i) 


11011 d0O: 11111 ST(i) 


FDIVP = Divide ST(0) by ST(i) and Pop (ST(i) — ST(0)/ST(i)) 
11011 110: 11111 ST(i) 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Instruction Format Avg (Lower 
Range... 
Upper 
Range) 
ARITHMETIC (Continued) 


FDIVR = Divide real reversed (Real/ST(0)) 
ST(0) <— 32-bit memory/ ST(0) 


| 11011 000 : mod 114 r/m: s+ b/disp. 73 
_ |ST(O) <— 64-bit memory/ ST(0) . . 
11011 100: mod 111 r/m: s-i-b/disp. 73 


 IST(d) — ST(i)/ ST(0) 
7 11011 dOO : 11110 ST(i) 73 


FDIVRP = Divide real reversed and Pop (ST(i) <— ST(i)/ ST(0)) 
11011 110: 11110 ST(i) : 73 70 3 


FIADD = Add Integer to ST(0) 

| Sto) <— ST(0) + 16-bit memory 
11011 110: mod 000 r/m: s-i-b/disp. 24(20-35) 7(5-17) 

210) <— ST(0) + 32-bit memory | 
11011010: mod 000 r/m : s-i-b/disp. 22.5(19-32) 7(5-17) 


FISUB = Subtract Integer from ST(0) 
ST(0) <— ST(0) — 16-bit memory | | | 
a 11011 110: mod 100 r/m: s-i-b/disp. 24(20-35) 
ST(0) <— ST(0) — 32-bit memory 
11011 010: mod 100 r/m: s-i-b/disp. 22.5(19-32) 7(5-17) 


7(5-17) 


FISUBR = Integer Subtract Reversed 
ST(0) <— 16-bit memory — ST(0) | : 
11011 110: mod 101 r/m: s-i-b/disp. 
a <— 32-bit memory — ST(0) 
11011010: mod 101 r/m: s-i-b/disp. 


24(20-35) 7(5-17) 


22.5(19-32) 7(5-17) 


FIMUL = Multiply Integer with ST(0) 
ST(0) <— ST(0) X 16-bit memory 

11011 110: mod 101 r/m: s-i-b/disp. 
ST(0) <— ST(0) X 32-bit memory 

11011 010: mod 001 r/m: s-i-b/disp. 


25(23-27) 


23.5(19-32) 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Concurrent 
Execution 


Avg (Lower 
Range... 
Upper 

Range) 


Instruction Format 


ARITHMETIC (Continued) 


FIDIV = Integer Divide 
ST(0) <— ST(0)/ 16-bit memory 

11011 110: mod 110 r/m: s-i-b/disp. 87(85-89) 
ST(0) <— ST(0)/ 32-bit memory 

11011 010: mod 110 r/m: s-i-b/disp. 85.5(84-86) 


FIDVR = Integer Divide Reversed 
ST(0) <— 16-bit memory/ST(0) 

11011 110: mod 1111r/m: s-i-b/disp. 87(85-89) 
ST(0) < 32-bit memory/ST(0) 

11011 010: mod 111 r/m: s-i-b/disp. 85.5(84-86) 


FSQRT = Square Root 
11011 001 : 11111010 85.5(83-87 


FSCALE = Scale ST(0) by ST(1) 
11011 001: 1111 1101 31 (30-32) 


70 


NSN 
oO 


~ 
a 


FXTRACT = Extract Components of ST(0) 


11011 001 : 1111 0100 19(16-20) 4(2-4) 


FPREM = Partial Reminder 
11011 001: 1111 1000 84(70-138) 2(2-8) 


FPREM1 = Partial Reminder (IEEE) | 
11011 001 : 1111 0101 94.5(72-167) 5.5(2-18) . 


7.4(2-8) 


FRNDINT = Round ST(0) to Integer 

11011 001: 1111 1100 29.1(21-30) 
FABS = Absolute value of ST(0) 

11011 001 : 1110 0001 3 
FCHS = Change Sign of ST(0) 

11011 001 : 1110 0000 


TRANSCENDENTAL | 


FCOS = Cosine of ST(0) 
11011 001 : 1111 1111 241(193-279) 
FPTAN = Partial Tangent of ST(0) 
11011 001 : 1111 0010 244(200-273) 70 


FPATAN = Partial Arctangent 
11011 001: 1111 0011 289(218-303) 


add 
~/ 


Rep) 
N“N 


9(2-17) 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Instruction Format 


TRANSCENDENTAL (Continued) 


FSIN = Sine of ST(0) 
11011 001: 11111110 


FSINCOS = Sine and Cosine of ST(0) 
11011 001: 1111 1011 


F2XM1 = 2ST(0) — 1 
11011 001: 1111 0000 


FYL2X = ST(1) x loga(ST(0)) 
11011 001: 1411 0001 


FYL2XP1 = ST(1) x logo(ST(0) + 1.0) 
11011 001 : 1111 1001 


PROCESSOR CONTROL 


FINIT = Initialize FPU 
11011 001: 11100011 


FSTSW AX = Store status word into AX 
11011 111: 11100000 


FSTSW = Store status word into memory 
| 11011 101 : mod 111 r/m 


FLDCW = Load control word 
11011 001 : mod 101 r/m 


{FSTCW = Store control word 


11011 001: mod 111r/m: 


FCLEX = Clear exceptions 
11011011: 11100010 


FSTENV = Store environment 


11011011:mod110r/m: 


Real and Virtual Modes 16-bit address 
Real and Virtual Modes 32-bit address 
Protected Mode 16-bit address 
Protected Mode 32-bit address 


FLDENV = Load Environment 


11011 011: mod 100 r/m: 


Real and Virtual Modes 16-bit address 
Real and Virtual Modes 32-bit address 
Protected Mode 16-bit address 
Protected Mode 32-bit address 
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: S-i-b/disp. 


: S-i-b/disp. 


s-i-b/disp. 


s-i-b/disp. 


s-i-b/disp. 


Cache Hit 


Avg (Lower 
Range... 
Upper 

Range) 


01010) 0) 
OONN 
AAALA 


44 2 
44 9 
34 2 
34 2 
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Table 13-19. Floating Point Clock Count Summary (Continued) 


Avg (Lower 
Range... 
Upper 

Range) 


Instruction Format 


PROCESSOR CONTROL (Continued) 


FSAVE = Save State ; | 
11011 101 : mod 110 r/m: s-i-b/disp. 


Real and Virtual Modes 16-bit address 
Real and Virtual Modes 32-bit address 
Protected Mode 16-bit address 
Protected Mode 32-bit address 


hPAAAH 


FRSTOR = Restore State 
11011 101 : mod 100 r/m: s-i-b/ 
Real and Virtual Modes 16-bit address 
Real and Virtual Modes 32-bit address 


Protected Mode 16-bit address 
Protected Mode 32-bit address 


FINCSTP = Increment Stack Pointer 
11011001 : 11110111 


FDECSTP = Decrement Stack Pointer 
11011 001: 11110110 


FFREE = Free ST(i) 
11011 101 : 11000 ST(i) 


FNOP = No Operations 
11011 101 : 1101 0000 
WAIT = Wait until FPU ready (min/max) 
10011011 


NOTES: 
. If operand is 0 clock counts = 27. 
2. If operand is 0 clock counts = 28. 
3. If CW.PC indicates 24-bit precision then subtract 38 clocks. 
If CW.PC indicates 53-bit precision then subtract 11 clocks. 
. If there is a numeric error pending from a previous instruction add 17 clocks. 
. If there is a numeric error pending from a previous instruction add 18 clocks. 
. The INT pin is polled several times while this function is executing to assure short interrupt latency. 
. lf ABS(operand) is greater than 7/4 then add n clocks, where n= (operand/(7/4)). 


as 


NO OS 
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14.0 DIFFERENCES BETWEEN 
INTEL486 PROCESSORS AND 
INTEL386 PROCESSORS _ 


The differences between Intel486 processors and 
Intel386 processors are due to performance en- 
hancements. The differences are listed below. 


1. Instruction clock counts have been reduced to 
achieve higher performance. (See section 13.0, 
“Instruction Set Summary.’’) 


2. The Intel486 processor bus is significantly faster 
than the Intel3886 processor bus. Differences in- 
clude a 1X clock, parity support, burst cycles, 
cacheable cycles, cache invalidate cycles and 8- 
bit bus support. The Hardware Interface and Bus 
Operation sections (sections {9.0 and 10.0) of the 
data sheet should be carefully read to under- 
stand the Intel486 processor bus functionality. 


3. To support the on-chip cache bits have been 
added to control register 0 (CD and NW) (see 
section 4.2.3.1, ‘Control Registers’), new pins 


have been added to the bus (see section 9.0, . 


“Hardware Interface’) and new bus cycle types 
have been added (see section 10.0, ‘‘Bus Opera- 
tion’). The on-chip cache needs to be enabled 
after reset by clearing the CD and NW bit in CRO. 
4. Eight new instructions have been added: 
_ @ Byte Swap (BSWAP) 
e Exchange-and-Add (XADD) 
© Compare and Exchange (CMPXCHG) 
Invalidate Data Cache (INVD) 


e Write-back and _ Invalidate Data Cache 
(WBINVD) 


e invalidate TLB Entry (INVLPG) 
-@ Processor Identification (CPUID) 
e Resume (RSM) 


5. Two bits defined in control register 3, the page 

_ table entries and page directory entries (PCD and 

PWT). (See section 6.4.2.5, ‘Page Directory/Ta- 
ble Entries.’’) 


6. A page protection feature has been added. This 
feature required a new bit in control register 0 
(WP) (See sections 4.2.3.1 ‘Control Registers” 
and 6.4.3 “Page Level Protection.”) 


7. An Alignment Check feature has been added. 

- This feature required a bit in the flags register 
(AC) (section 4.2.2.3 “Flags Register’) and a bit 
in control register 0 (AM) (section 4.2.3.1 “Con- 
trol Registers’). 
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10. 


11. 
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. The replacement algorithm for the translation 


lookaside buffer has been changed from a ran- 
dom algorithm to a pseudo least recently used 
algorithm like that used by the on-chip cache. 
(See section 7.5 “Cache Replacement” for a 
description of the algorithm.) 


. Three testability registers, TR3, TR4 and TRS, 


have been added for testing the on-chip cache. 
TLB testability has been enhanced. (See sec- 
tion 11.0, ‘‘Testability.”’) 


The prefetch queue has been increased from 16 
bytes to 32 bytes. A jump always needs to exe- 
cute after modifying code to guarantee correct 
execution of the new instruction. 


After reset, the ID in the upper byte of the DX 
register is 04. 


14.1 Differences between the Intel386 


In 


Processor with an Intel387™ 
Math CoProcessor and 
intel486 DX, IntelDX2 and 
intelDX4 Processors 


addition to the previously mentioned enhance- 


ments, the Intel486 DX, IntelDX2 and IntelDX4 proc- 
essors offer the following features: 


1. 


The complete Intel387 math coprocessor instruc- 
tion set and register set have been added. No I/ 
O cycles are performed during Floating Point in- 
structions. The instruction and data pointers are 
set to 0 after FINIT/FSAVE. Interrupt 9 can no 
longer occur, interrupt 13 occurs instead. 


. Support for floating point error reporting modes 


to guarantee DOS compatibility. These modes re- 
quire a bit in control register 0 (NE) (see section 


. 4.2.3.1, “Control Registers”) and pins (FERR# 


and IGNNE#). (See sections 9.2.15, “Numeric 
Error Reporting” and 10.2.14 “Floating Point Er- 


~ ror Handling.”’) 


_ In some cases FERR # is asserted when the next 


floating point instruction is encountered and in 
other cases it is asserted before the next floating 
point instruction is encountered, depending upon 
the execution state the instruction causing ex- 
ception. (See sections 9.2.15, ‘Numeric Error 
Reporting” and 10.2.14, “Floating Point Error 
Handling.”’) For both of these cases, the Intel387 
math coprocessor asserts ERROR# when the 
error occurs and does not wait for the next float- 
ing point instruction to be encountered. 


. The contents of the base registers including the 


floating point registers may be different after — 
reset. 


intel. 

15.0 DIFFERENCES BETWEEN THE 
PGA, SQFP AND PQFP 
VERSIONS OF THE INTEL486 SX 


AND INTEL486 DX 
PROCESSORS 


The section lists: the differences between PGA, 
SQFP, and PQFP packages of the Intel486 SX and 
Intel486 DX processor. It also provides a quick pin 
reference table that is useful for converting a system 
design from one that uses a PGA package to one 
that uses an SQFP or PQFP package. 


NOTE: 
The boundary scan feature is not supported 
in the Intel486 SX processor in 168-pin PGA 
package. See sections 3.0, “Pin Descrip- 
tion,’ and 11.5, “Intel486 Processor Bound- 
ary Scan,” for pinout differences. 


15.1 2X Clock Mode 


The Intel486 processors offer 2X clock mode for 
systems that rely on dynamic frequency scaling for 
processor power management. This product is not 
intended for the desktop computer. This 2X clock 
processor differs from the 1X clock processor in the 
following ways: 


Pin Assignment/Function: The 2X clock product 
has a CLK2 input, rather than the 1X clock product’s 
CLK input. The CLK2 input must be synchronized to 
the system phase using the falling edge of RESET. 
(For reference, the pinout change from the existing 
Low Power Intel486 DX and SX processors is also 
shown. The CLKSEL pin is not used on the Intel486 
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processors, as it is on the existing Low Power In- | 
tel486 DX and SX processors.) 


Clock Control: The CLK2 input can be changed dy- 
namically. The Stop Clock interrupt is handled in a 
different manner. 


AC Specifications: In general, the AC specifica- 
tions for the 2X clock device will have slightly longer 
setups, holds, and maximum valid delays. This is 
consistent with the existing Low Power Intel486 DX 
and Intel486 SX processors. See section 15.1.5, 
“AC Specifications,” for 2X clock mode AC specifi- 
cations. 


Upgrades: There are no end user upgrade products 
planned for the 2X clock mode product. The UP# 
function is still provided for use by system designers 
that offer Intel486 SX to Intel486 DX processor up- 
grade cards. 


This section will explain the differences between the 
processor with the 2X clock mode and the processor 
with the 1X clock mode. 


15.1.1 PIN ASSIGNMENTS 


The Intel486 processor with the 2X clock option is 
available in the 208-lead SQFP and 196-lead PQFP 
packages. The pinout is identical to the Intel486 
processor with the 1X clock option with the excep- 


. tion of the name of the clock input. The 1X clock 


input is called CLK and the 2X clock input is 
called CLK2. 


Table 15-1 shows the changes between the existing 
products and new products. 


Table 15-2 is a list of pin descriptions. 


Table 15-1. Pinout Differences for the 2X Clock Mode (Low Power) Processors 
(196-Lead PQFP Package) 


Low Power 
Intel486T™ SX 
Processor 


Intel486 SX 
Processor 


a AC 
Sc 
eee 

[oer [owen 


Low Power 
intel486 DX 
Processor 


Intel486 
Processor 


SRESET 
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15.1.2 QUICK PIN REFERENCE 


_ Table 15-2. Pin Descriptions 
| Name and Function 


CLK2 provides the fundamental timing for the processor. Both of the internal timing 
phases, phase-1 (ph1) and phase-2 (ph2), are provided by the external CLK2 input. All 
external timing parameters are specified with respect to the phase-1 rising aiedee of 
CLk2. 


For the 2X clock mode the CLK frequency is twice the frequency of the processor. 


The RESET input forces the processor to begin execution at a known state. The 
processor cannot begin execution of instructions until at least 1 ms after Voc and CLK2 
have reached their proper AC and DC specifications. However, for soft resets, RESET 
should remain active for at least 30 CLK2 periods (equal to 15 internal processor CLK). 
The RESET pin should remain active during this time to insure proper processor 
operation. RESET is active HIGH. Reset is asynchronous, but must meet setup and hold 
times too, toga and to3 for recognition in any specific clock. 


RESET sets the SMBASE descriptor to default address of 30000H. If the system uses 
SMBASE relocation, then the SRESET pin should be used for soft resets. 


For the 2X clock mode, the falling edge of RESET synchronizes the processor internal 
clock phase. RESET must be used at power up and anytime the phase of the processor 
clock must be re-synchronized to the system phase. 


The SRESET pin duplicates all the functionality of the RESET pin with the following two 
exceptions: 

1. The SMBASE agiaier will retain its previous value. 

2. If UP# is asserted, SRESET will not have an effect on the host processor. 


For soft resets, SRESET should remain active for at least 30 CLK2 periods (equal to 15 
internal processor CLK). SRESET is active HIGH. SRESET is asynchronous but must 
meet setup and hold times tao and ta, for recognition in any specific clock. 


RESET ; 


SRESET [ 
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15.1.3 CLOCK CONTROL 


- 45.1.3.1 Clock Generation 


The frequency of CLK2 is twice the internal frequen- 


cy of the processor. The internal clock is comprised 
of two phases, ‘“PH1” and ‘“PH2’. Each CLK2 peri- 
od is a phase of the internal clock. Figure 15-1 illus- 
trates the relationship between the CLK2 input and 


INTERNAL PROCESSOR 
CLOCK PERIOD 


CLK2 PERIOD | 


INTERNAL 
CPU CLK 


(half the freq. 
of CLK2) 


INTERNAL 33% 
CPU CLK 


RESET 
15 CLK min 


SRESET —=/ 


This edge has no relationship 
to Internal clock phase 
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the internal phases. All set-up, hold, float-delay and 
valid delay timings are referenced to the rising edge 
of phase 1 of CLK2. Thus it is important to synchro- 
nize the external circuitry with the phase of the CLK2 
input. The internal processor clock phase is deter- 
mined at the falling edge of the RESET input. RE- 
SET must meet the specified setup and hold times 
to correctly synchronize the internal clock phase. 
See Figure 15-2. ? 
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| | | | 
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Figure 15- 2. CLK2 and Internal Processor CLK vs. SRESET and RESET Timings 
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15.1.3.2 Stop Clock 


The processor with the 2X clock option does not rely 
on an internal Phase Lock Loop to generate the in- 
ternal phase clocks. Therefore, the frequency of the 
CLK2 input can be changed dynamically or ‘‘on-the- 
fly.” 


The 2X clock mode, Intel486 processor provides an 
interrupt mechanism, STPCLK#, that places the 
processor into a known state. Although the frequen- 
cy of the CLK2 input can be dynamically changed 
between 0 MHz and the maximum operating fre- 
quency of the processor, operation between 0 MHz 
and 8 MHz is not tested. Stopping the CLK2 input 
with the processor in a known state requires use of 
the STPCLK# mechanism. When the processor, rec- 
ognizes a STPCLK # request, the processor will stop 
execution on the next instruction boundary, stop the 
prefetcher, empty all internal pipelines and the write 
buffers, and then generate a Stop Grant bus cycle. 
At this point the processor is in the Stop Grant state. 
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The rising edge of STPCLK# will tell the processor 


that it can return to program execution at the instruc-. | 
tion following the interrupted instruction. 


Unlike the normal interrupts, INTR and NMI, the 
STPCLK# interrupt does not initiate interrupt ac- 
knowledge cycles or interrupt vector table reads. 


STPCLK# is active LOW and is provided with an 
internal pull-up resistor. STPCLK# is an asynchro- 
nous signal, but must remain active until the proces- 
sor issues the Stop Grant bus cycle. STPCLK# may 
be de-asserted at any time after the processor has 
issued the Stop Grant bus cycle. Note that 
STPCLK# should NOT be de-asserted before the 
processor has issued the Stop Grant bus cycle. 
STPCLK# must be de-asserted for a minimum of 5 
clocks after RDY# is returned active for the Stop 


Grant bus cycle before being asserted again. 


15.1.3.3 Clock Control State Diagram 


The state diagram in Figure 15-3 shows the Stop 
Clock state transitions for the 2X clock mode. 


4 Halt State 


CLK2 is running 


EADS# 


STPCLK# de-asserted 


STPCLK# asserted and 
Stop Grant Bus Cycle generated 


INTR, NMI, SMI#, 
RESET, SRESET 
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1 Normal Execution 


Frequency can be 
changed dynamically 


STPCLKH 


STPCLK# asserted and 
Stop Grant Bus cycle de-asserted or 
generated RESET, SRESET 


2 Stop Grant State 


HOLD/BOFF/AHOLD,EADS# 
respond ICC depends on 
CLK2 frequency 


Start CLK2 input Stop CLK input 


3 Stop Clock State 


CLK2 inputis stopped _ 
CPU will not respond to inputs - 
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Figure 15-3. Stop Clock State Machine 2X Clock Mode 


Normal State 


This is the Normal operating state of the processor. 
During this state, the CLK2 input frequency can be 
changed dynamically or “on-the-fly” for power con- 
sumption control with no clock control latency. This 
capability provides a wide range of performance/ 


power consumption options. Operation of the proc- 
essor is tested between 8 MHz and the maximum 
operating frequency of the processor. Operation be- 
low 8 MHz is guaranteed by design, though is not 
100% tested. Operation at 0 MHz is tested when the 
stop clock protocol (STPCLK#) is used. 
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_ Stop Grant State 


The processor enters the Stop Grant state in re- 
sponse to a STPCLK# interrupt. The processor will 
- generate a Stop Grant bus cycle when it enters this 
state from the Normal state or the HALT state. The 
processor will not generate a Stop Grant bus cycle 
when it enters the Stop Grant state from the Stop 
Clock state. | | 


_While in the Stop Grant state, the pull-up resistors 
on STPCLK# and UP# are disabled internally. The 
system must continue to drive these inputs to the 


state they were in immediately before the processor © 


entered the Stop Grant state. For minimum proces- 
sor power consumption, all other input pins should 
be driven to their inactive level while the processor 
is in the Stop Grant state. 


During the Stop Grant state, the processor will re- 
spond to HOLD, AHOLD and BOFF# normally and 
can perform cache invalidates. An active edge on 
either the SMI # or NMI interrupts will be latched and 
will be serviced after the rising edge of STPCLK#. 
_ An INTR request will be serviced after the processor 
returns to the normal state as long as INTR is held 
active until the processor issues an interrupt ac- 
knowledge bus cycle. 


Stop Clock State | 


The processor enters the Stop Clock state when the 
system stops the CLK2 input. The system can stop 
the CLK2 input on either a logic high or a logic low. 
The CLK2 input must be restarted in the same state 
as when. it was stopped. In other words, any CLK2 
input state can be stretched. (See Figure 15-4 for 
details.) Processor operation at 0 MHz is tested only 
when the processor is in the Stop Clock state. 


. 

intel. 
In the Stop Clock state, the processor does not re- 
spond to any stimulus. The processor must re-enter 
the Stop Grant state (CLK2 input must be restarted) 
in order to perform any bus actions such as HOLD/ 
HLDA cycles, invalidates (AHOLD/EADS# or 
FLUSH# cycles), and BOFF#. It is recommended 
that CLK2 be restarted 2 clocks before and continue 
until 2 clocks after the transition of the HOLD, 
AHOLD, EADS #, FLUSH#, or BOFF # signals. 


The interrupt signals (SMI#, NMI, and INTR) will be 
recognized and serviced correctly if the input is held 
in the active state until the processor returns to the 
Stop Grant state. The Intel486 processor family re- 
quires that INTR be held active until the processor 
issues an interrupt acknowledge cycle in order to 
guarantee recognition. This condition also applies to 
the existing Intel486 processors. 


HALT State 


The processor enters the HALT state from the Nor- 
mal state on a HALT instruction. The system can 
place the processor into the Stop Grant state from 
the HALT state by asserting the STPCLK# input. 
The processor will generate a Stop Grant bus cycle 
when it enters the Stop Grant state. If the processor 
entered the Stop Grant state from the HALT state 
then it will return to the HALT state when the 
STPCLK# interrupt is de-asserted. When the proc- 
essor re-enters the HALT state it will generate a 
HALT bus cycle. 
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Figure 15-4. CLK2 Phase Coherence in CLK2 Stop and Restart 
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15.1.3.3 Supply Current Model for Stop Clock Modes and Transitions 


Figure 15-5 illustrates the effect of different Stop Clock state transitions on the supply current. 


A Change CLK2 Dynamically 


® Normal State 


© Stop Clock State 


B Not Tested ° 


lec max = 1 MA @ 3.3V 


Icc max = 2mA @ 5V 


C Stop and Restart CLK2 within Stop Grant State 
(Fully tested) 


2MHz 8MHz 


* guaranteed by design characterization. not 100% tested. 


16MHz 


33MHz 
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Figure 15-5. Supply Current Model for Stop Clock Modes and Transitions 


15.1.4 DC SPECIFICATIONS FOR 2X CLOCK 
OPTION 


For 2X clock DC specifications, refer to the 1X clock 
DC specifications. (See section 17.3, “DC Specifica- 
tions.’’) 


15.1.5 AC SPECIFICATIONS FOR 2X CLOCK 
OPTION 


The AC specifications given in the tables of this sec- 
tion consist of output delays, input setup require- 
ments and input hold requirements. All AC specifica- 
tions are relative to the rising edge of the phase 1 of 
the input system clock (CLK2), unless otherwise 
specified. (See Figures 15-6 through 15-8.) 
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15.1.15.1 3.3V AC Characteristic 
Table 15-3 is for 25- and 33-MHz Intel486 SX and 33-MHz Intel486 DX processors in 2X Clock Mode. 


Table 15-3. 3.3V AC Characteristics (2X Clock) 


Functional epereing range: Vcc = 3.3V £0.3V; Tcase = O°C to + 85°C; C, = 50 pF, roe otherwise 


specified 
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pereter | win | wx | win [wax [ unt [ _notwe 
— Pfs fe wel 
cL Feary TE fe Pee Pe 
CLR Pet ee se 
CLG igh Tne cf fs [| wa 
CLK Lon Ti A Ba 3 SS 


CLK2 Fall Time 2V to 0.8V, 
. Note 2 

CLK2 Rise Time 0.8V to 2V, 
Note 2 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 17 

D/C#, W/R#, ADS#, LOCK#, BREQ, | 

HLDA, SMIACT#, FERR# Valid Delay 

A2-A31, PWT, PCD, BEO-3#, M/IO#, 28 21 2 
D/C#, W/R#, ADS#, LOCK#, BREQ, 

HLDA Float Delay 


BLASTS, PLOGK# FostDoey | | | | er | wm | 2 


DO-D31, DPO-DP3 Write Data Valid 3 20 3 19 | ns 7 
Delay | 
DO-D31, DPO-DP3 Write Data Float 28 21 2 
Delay 


EADS Sep Ti ret tel [tel 
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Table 15-3. 3.3V AC Characteristics (2X Clock) (Continued) | 
Functional operating range: Vcc = 3.3V +0.3V; Tcase = O0°C to +85°C; C_ = 50 pF, unless otherwise 
specified 


[Symbot | __ Parameter ‘(win | wae 

fie | eEAGSeroTe SSC 

Fie | Kens, este aserSeuptine | @ | 

Fis | Kens, aster, asertostine | «| 

[ie | Ove. BROveSeup Tne | a 

rie | WOLD. AWOLD Sete Tie | ot 
: 


—s 
ah, 


BOFF # Setup Time 
HOLD, AHOLD, BOFF #Hold Time 


t20 FLUSH #, A20M#, NMI, INTR, SMI#, 11 
STPCLK#, SRESET, RESET, 
IGNNE # Setup Time 


—k 


RESET Falling Edge Setup Time | 9 


to4 FLUSH #, A20M #, NMI, INTR, SMI#, 4 


STPCLK #, SRESET, RESET, 
IGNNE #Hold Time 


too DO-—D31, DPO-—DP3, A4—A31 Read 
Setup Time 

to3 DO-D31, DPO—DP3, A4-—A31 Read 4 
Hold Time 


NOTES: 

1. O-MHz operation is tested using the STPCLK# and Stop Grant bus cycle protocol. Operation between 0 MHz < CLK2 
< 8 MHz is guaranteed by design characterization, but is not 100% tested. 7 . 

2. Not 100% tested, guaranteed by design characterization. 

3. FERR# and IGNNE# are present only in Intel486 DX processors. 
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15.1.5.2 5V AC Characteristics | 
Table 15-4 is for 25- and 33-MHz Intel486 SX and 33-MHz Intel486 DX processors in 2X Clock Mode. 


Table 15-4. 5V AC Characteristics (2X Clock) | 


Pincha operating range: Vcc = 5V +5%,; TCASE = 0°C +85°C; C; = 50 pF, unless otherwise specified. 


Symbol 
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— win [ ax | win [ max [ unt | Notes 
— ee 
CLK Freane fe [ee [ie [a 
GLK Pe Tle [ep fel 
cue Hh Ti 2 a AN A 
cLN@Low Tine cfs | [we | atoav 


CLK2 Fall Time 2V to 0.8V, 
. Note 2 

CLK2 Rise Time 0.8V to 2V, 
Note 2 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 17 

D/C#, W/R#, ADS #, LOCK#, BREQ, 

HLDA, SMIACT #, FIERR # Valid Delay 

A2-A31, PWT, PCD, BEO-3#, M/IO#, 21 
D/C#, W/RE, ADS#, LOCK #, BREQ, 

HLDA Float Delay 


PCHK # Valid Delay 


BLAST #, PLOCK#Valid Delay | 3 | 24 | 3) 
BLAST#,PLOCK#Float Delay «|= | 28 | 


DO-D31, DPO-DP3 Write Data Valid 3 3 
Delay 

DO-D31, DPO-DP3 Write Data Float 

Delay | 


EADS # Setup Time po | | 6 | 
EADS #Hold Time 


N 


re co = — (o) 


KEN #, BS16#, BS8#Setup Time 


KEN#, BS16#, BS8#Hold Time pat 
RDY #, BRDY #Setup Time po | | 6 | 


ele 
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Table 15-4. 5V AC Characteristics (2X Clock) (Continued) 
Functional operating range: Vcc = 5V +5%;. Toasge 0°C + 85°C; Cy, = 50 pF, unless otherwise specified. 


cs 
ri | Rove aRove HolgTime it a | 
fin | wo.arouDseuptine | | 
Tien [e0rresenptine | 
Tan | un. anoun corr enti | | 

_ 


FLUSH #, A20M#, NMI, INTR SMI#, 
STPCLK #, SRESET, RESET, IGNNE#, 
Setup Time 


toy FLUSH #, A20M#, NMI, INTR, SMI#, 
STPCLK #, SRESET, RESET, 
IGNNE #Hold Time 


too DO-D31, DPO-DP3, A4-—A31 Read 
Setup Time 
DO-D31, DPO-DP3, A4-A31 Read 
Hold Time 


NOTES: 

1. 0-MHz-operation is tested using the STPCLK# and Stop Grant bus cycle protocol. Operation between 0 MHz < CLK2 
< 8 MHz is guaranteed by design characterization, but is not 100% tested. 

2. Not 100% tested, guaranteed by design characterization. 

3. FERR# and IGNNE# are present only in Intel486 DX processors. 
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Figure 15-6. CLK2 Waveform 
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Output Valid Delay tx = tg, tg, tga, tio 
Output Float Delay ty = t7, to, ty4 


Figure 15-7. Valid and Float Delay Timings 


INPUTS 


242202-F1 
tx = tra, tia, te, te, tao, taoa, tea, t18a 
ty = t13, t15, t17, tra, tor, tog 


Figure 15-8. Setup and Hold Timings 
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16.0 OverDrive™ Processor Socket 


This section contains the specifications for the Over- 
Drive processor socket for systems based on In- 
tel486 processors. All of the specifications de- 
scribed herein are based on the specifications of the 
Intel486 processors. 


One of the most important features of the Intel486 
family architecture, compared with previous Intel ar- 
chitectures, is its upgradability via the OverDrive 
processor socket. Inclusion of the OverDrive proces- 
sor socket in systems based on the Intel486 family 
of microprocessors provides the end user with an 
easy and cost-effective way to increase system per- 
formance. The paradigm of simply installing an addi- 
tional component into an empty OverDrive proces- 
sor socket to achieve enhanced system perform- 
ance is familiar to the millions of end users and deal- 
ers who have purchased Intel math coprocessor up- 
grades to boost system floating point performance. 
The OverDrive processor provides improvement 
over the base system. The OverDrive processor 
takes advantage of Intel’s next generation processor 
technology to provide this performance improve- 
ment. 


The OverDrive processor socket described in this 
chapter is designed for the Pentium OverDrive proc- 
essor. The Pentium OverDrive processor is de- 
signed with Pentium processor core technology. 
This socket is also backwards compatible for the In- 
telSX2 and IntelDX2 OverDrive processors. Support 
for the future 3.3V Pentium OverDrive processor up- 
grade for IntelDX4 processor-based systems is also 
specified. 


The Pentium OverDrive processor implements a su- 
perset of Intel486 processor signals. The new sig- 
nals for the Pentium OverDrive processor socket, in 
addition to Intel486 processor signals, support a 
write-back protocol for the on-chip cache in the 
Write-Back Enhanced IntelDX2 processor. Imple- 
mentation of the cache writeback capability for the 
OverDrive processcr socket is optional, although im- 
plementation of the on-chip writeback protocol en- 
ables maximum performance gain. For more infor- 
mation, please contact Intel. 


As a new system architecture feature, the provision 
of the OverDrive processor socket as a means for 
PC users to take advantage of the ever more rapid 
advances in software and hardware technology 
helps to maintain the competitiveness of Intel PC- 
compatible systems over other architectures. 
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| The majority of upgrade installations which take ad- 


vantage of the OverDrive processor socket will be 
performed by end users and resellers. Therefore, it 
is important that the design minimize the amount of 
training and technical expertise required to install 
the OverDrive processors. Upgrade installation in- 
structions should be clearly described in the system 
user’s manual. In addition, by making installation 
simple and foolproof, PC manufacturers can reduce 
the risk of system damage, warranty claims and 
service calls. Feedback from Intel’s math coproces- 
sor upgrade customers highlights three main charac- 
teristics of end user easy designs: accessible Over- 
Drive processor socket location, clear indication of 
upgrade component orientation, and minimization of 
insertion force. 


OverDrive Socket Location 


The OverDrive processor socket can be located on 
either the motherboard or modular processor card. 
the OverDrive processor socket should be easily 
accessible for installation and readily visible when. 
the PC case is removed. The OverDrive processor 
socket should not be located in a position that re- 
quires removal of any other hardware (such as hard 
disk drives) in order to install the OverDrive proces- 
sor. 


Component Orientation 


The most common mistake made by end users and 
resellers when installing Math Coprocessor up- 
grades is incorrect orientation of the chip. This can 
result in irreversible damage to the chip. To solve 
this problem, Intel has designed OverDrive proces- 
sor sockets and OverDrive processors with keying — 
mechanisms to ensure that upgrade components fit 
in the right socket, with the right orientation. There 
are two OverDrive processor sockets, presented in 
this chapter, that can accept the OverDrive proces- 
sor for Intel486 processor-based systems, designat- 
ed as socket 3 and socket 6. The two sockets and 
the keying mechanism are illustrated in Figure 16-1. 


Socket 3 is a 237-pin socket and accepts both 5V 
and 3.3V OverDrive processors. The keying mecha- 
nism consists of one Key Pin (A1) and four missing 
pins (B1, C1, A2 and A3). To be effective as a keying 
mechanism, the locations in the socket correspond- 
ing to the four missing pins must be plugged. This 
socket is designed to be backwards compatible with 
the 169-pin IntelSX2 and IntelDX2 OverDrive 
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OverDrive Processor 
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Figure 16-1. OverDriveT™ Processor Sockets for Intel486™ Processor-Based Systems 


processors. In order to maintain compatibility, sock- 
et 3 includes the Key Pin at location E5. 


Socket 6 is a 235-pin socket and accepts 3.3V Over- 
Drive processors only. The keying mechanism con- 
sists of one Key Pin (A1) and five missing pins (B1, 
C1, A2, A3 and A19). Being designed for 3.3V Over- 
Drive processors only, socket 6 must not accept the 
169-pin IntelSX2 and IntelDX2 OverDrive proces- 
sors, therefore, the Key Pin E& is missing. To be 
effective as a keying mechanism, the locations in 
the socket corresponding to the six aun pins 
must be plugged. 
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In addition, the location of the pin 1 corner should be 
clearly marked on the motherboard, for example by 
silk screening. 


insertion Force 


The third major concern voiced by end users refers 
to how much pressure should be exerted on the up- 
grade chip and PC board for proper installation with- 
out damage. This becomes even more of a concern 
with the larger components which require up to 200 
pounds of pressure for insertion into a standard 
screw machine socket. This level of pressure can 
easily result in cracked traces and stress to solder 
joints. To minimize the risk of system damage, it is 
recommended that a Zero Insertion Force (ZIF) 
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socket be used for the OverDrive processor socket. 
Designing with a ZIF socket eliminates the need to 
design in additional structural support to prevent 
flexing of the PC board during installation, and re- 
sults in improved end user and reseller product sat- 
isfaction due to easy “drop-in” installation. If a LIF 
socket is used, sufficient support should be provided 
directly under the OverDrive processor socket. This 
_ will minimize the possibility of damage to the moth- 
erboard during insertion of the OverDrive processor. 


16.1 OverDrive Processor Socket 
Overview 


The circuit design requirements for the OverDrive 
processor socket are discussed in section 16.2, 
“OverDrive Processor Circuit Design,’’. In addition to 
the OverDrive processor socket circuits, there are 
layout considerations for the OverDrive processor 
socket and processor spatial requirements. These 
issues are discussed in section 16.3, “Socket Lay- 
out,”. Because the OverDrive processor must func- 
tion in the OverDrive processor socket, the Over- 
Drive processor socket heat dissipation specifica- 
tions must be implemented. Section 16.4, ‘““Thermal 
Design Consideration,” discusses the thermal con- 
siderations in the system design. Because the sys- 
tem must operate correctly with any OverDrive proc- 
essor without a BIOS change, BIOS and software 
restrictions and recommendations are provided in 
section 16.5, “BIOS and Software,”. Section 16.6, 
“Test Requirements,” discusses OverDrive proces- 
sor socket test requirements. Sections and 16.7, 
“OverDrive Processor Socket Pinout,” and 16.8, 
“3.3V Socket Specifications,”. specify the pinout 
specifications for the 5V and 3.3V OverDrive proces- 
sor sockets, respectively. Finally, section 16.9, “DC/ 
AC Specification,” specifies the electrical character- 
istics of the OverDrive processor socket. 


16.2 OverDrive Processor Circuit 
Design 


The Intel486 processors in the 168-pin PGA pack- 
age fit in the three inner rows of the 237-pin Over- 
Drive socket. The corresponding pins of the Intel486 
processor and the OverDrive socket define identical 
signals, with the following exceptions: 


e The pins corresponding to the signals TCK, TDI 
and TDO on Intel486 processors are defined as 
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Internal No Connect (INC) on the OverDrive proc- 
essor socket, and the pin corresponding to TMS 
is defined as UP# (pin C15 of the OverDrive 
processor socket). For compatibility, the system 
should not do boundary scan testing when the 
OverDrive processor is installed. 


e On Intel486 processors, the pin C14 defines 
FERR# and the pin A13 is INC, while on the 
OverDrive processor socket the pin D15 is INC 
and the pin B14 defines FERR#. 


e On Intel486 processors the pin C10 defines SRE- 
SET, while on the OverDrive processor socket 
the corresponding pin D11 is INC. 


In a system with a single socket motherboard de- 
sign, the processor and the OverDrive processor 
share the same socket. The Intel486 processor oc- 
cupies the three innermost rows of pins of the 237- 
pin OverDrive processor socket, with the ‘pin 1” 
(notched corner) oriented towards the “pin 1” cor- 
ner of the socket. The processor will be replaced 
with the OverDrive processor when the system is 
upgraded. In a jumperless, plug-and-play, upgrada- 
ble system, the pins D15 and B14 of the OverDrive 
processor socket should be connected together to 
the FERR# signal line, and the pins F19 (INIT) and 
D11 should be connected together to SRESET. 
When the OverDrive processor is installed, the pin 
C15 (UP#) must be isolated from TMS (see Figure 
16-2). 


DATA ADDR CTRL 


RESERVED 


SRESET 
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Figure 16-2. OverDrive™ Processor Socket 
Circuit Diagram-Single Socket Design 
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16.2.1 BACKWARD COMPATIBILITY 


The Pentium OverDrive processor socket for In- 
tel486 processor-based systems is designed to be 
compatible with the other OverDrive processors. 


The Pentium OverDrive processor socket has a 
fourth row of contacts around the outside of the 169 
contacts defined for the IntelSX2 and IntelDX2 Over- 
Drive processors. The three inner rows of the Penti- 
um OverDrive processor socket, with the inner key 
pin, are 100% compatible with the 169-pin PGA 
OverDrive processors. For backward compatibility, 
the inner row key pin location (E5) must be included 
in the Pentium OverDrive processor socket. 


16.3 Socket Layout 


This section discusses three aspects for the Over- 
Drive processor socket: size, upgradability, and ven- 
dors. | 


16.3.1 MECHANICAL DESIGN 
CONSIDERATIONS 


The Pentium OverDrive processcr for Intel486 proc- 
essor-based systems is designed to fit in a standard 
240-lead (19 x 19) PGA socket with four corner pins 
removed. The Pentium OverDrive processor uses a 
fan/heatsink, and therefore, requires vertical clear- 
ance to allow adequate air circulation. 


16.3.1.1 Fan Heatsink Design 


The maximum and minimum dimensions of the 
- Pentium OverDrive processor package with a 
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fan/heatsink are shown in Table 16-1. The fan/heat- 
sink unit space requirement is divided into the size of 
the actual heatsink, and the required free space | 
above the heatsink. The total height required for the 
Pentium OverDrive processor from the motherboard 
will depend on the height of the PGA socket. The 
total external height given in Table 16-1 is only mea- 
sured from the PGA pin stand-offs. Table 16-1 also 
details the minimum clearance needed around all 
four sides of the PGA package. 


Since the Pentium OverDrive processor dissipates 
more power than the Intel486 processor family 
members, it requires a larger cooling capacity. To 
facilitate the task of cooling the Pentium OverDrive 
processor, Intel will ship the product with a fan/heat- 
sink. No external connections (i.e., power): will be 
required for the fan/heatsink. All the needed con- 
nections will be made through the pins on the outer 


- row of the processor. 


16.3.1.2 Passive Heatsink Design 


The space required for the passive heatsink Over- 
Drive processors is less than the space require- 
ments shown in Table 16-1. The passive heatsink 
OverDrive processors include the IntelSX2 and In- 
telDX2 OverDrive processors. 


16.3.2 DESIGN RECOMMENDATIONS 


PC buyers value easy and safe upgrade installation. 
PC manufacturers can make upgrade component in- 
stallation in the OverDrive processor socket simple 
and foolproof for the end user and reseller by imple- 
menting the suggestions listed in Table 16-2. 


Table 16-1. Pentium™ OverDriveT™ Processor, 236-Pin, 
| PGA Package Dimensions with Fan/Heatsink Attached 


= Minimum 
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Length and Width (inches) 


Height (inches) 


Maximum 


1.975 
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Table 16-2. Socket and Layout Considerations 


Design Implementation 
Considerations e 


Visible OverDrive™ The OverDrive processor socket should be easily visible when the PC’s cover is 
processor socket removed. Label the OverDrive processor socket and the location of pin 1 by silk 
screening this information on the PC board. 


Accessible OverDrive | Make the OverDrive processor socket easily accessible to the end user (i.e., do 
processor socket not place the OverDrive processor socket under a disk drive). Be sure to leave 
enough clearance to open the Zero Insertion Force (ZIF) socket. 


Foolproof Chip This OverDrive processor socket must insure proper orientation of the OverDrive 

Orientation processor. The PGA package of the Pentium OverDrive processor for Intel486 
processor-based systems is oriented by the four corner pins that have been 
removed and the “KEY” pin from the “pin 1” corner. The four contacts (A2, A3, 
B1 and C1) in the socket should be plugged, such that PGA.pins cannot be 
inserted, to assure correct orientation. | 


Zero Insertion Force The high pin count of the OverDrive processor makes the insertion force required 

OverDrive processor | for installation into a screw machine PGA socket excessive. Even most Low 

socket Insertion Force (LIF) sockets often require more than 60 Ibs. of insertion force. A 
Zero Insertion Force (ZIF) socket insures that the chip insertion force does not 
damage the PC board. Be sure to allow enough clearance for the ZIF socket 
handle. Do not use a LIF or screw machine socket. 


Jumper or switch changes should not be needed to electrically configure the 
system for the OverDrive processor. 


Describe the OverDrive processor socket and the OverDrive processor 
installation procedure in the PC’s User’s Manual. 


1.963" 


1.840" 


OVERDRIVE PROCESSOR 
ACTIVE FAN/HEATSINK UNIT 


Adhesive 
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Figure 16-3. Pentium™ OverDrive™ Processor PGA Package with Heatsink Attached 


2-303 


intel486™ PROCESSOR FAMILY 


16.3.3 ZIF SOCKET VENDORS 


Intel has a list of qualified ZIF socket vendors. Con- 
tact Intel for more information. 


16.4 Thermal Design Considerations 


16.4.1 ACTIVE HEATSINK THERMAL DESIGN 


The Pentium OverDrive processor for Intel486 proc- 
essor-based systems will have an active fan/heat- 
sink for thermal dissipation. The maximum allowable 
temperature of the air entering the fan/heatsink can 
not exceed 55°C under the worst case operating 
conditions specified for the system. The fan/heat- 
~ sink reduces the need for high airflow. However, the 
system must provide adequate ventilation to prevent 
localized heating above the specified ambient. 


16.4.2 PASSIVE HEATSINK THERMAL DESIGN 


Passive heatsinks are used on the IntelSX2 and In- 
telDX2 OverDrive processors. The thermal efficiency 
of passive heatsink processors is dependent on sys- 
tem airflow. Please refer to the individual OverDrive 
processor Data Sheet for airflow requirements. 


16.5 BIOS and Software 


The following should be considered when designing 
the OverDrive processor socket for an Intel486 proc- 
essor-based system. 


16.5.1 OverDrive PROCESSOR DETECTION 


The component identifier ancl stepping/revision 
identifier for the:OverDrive processor is readable in 
the DH and DL registers respectively, immediately 
after RESET. See Table 16-3. These values can 
also be obtained using the CPU__ID instruction. 


As with the Intel486 processor specification, it is rec- 
ommended that the BIOS save the contents of the 
_ DX register, immediately after RESET, so that this 
_ information can be used later, if required. 
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- . | Pentium OverDrive 
processor (5V) 
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Table 16-3. DX Register Contents after Reset 


Component | Stepping 


OverDrive™ 
Processor 


Future Pentium™ 
OverDrive processor 
(3.3V) 


IntelDX2T™ OverDrive 
processor 
IntelSX2TM OverDrive Paw 


processor 
16.5.2 TIMING DEPENDENT LOOPS 


The OverDrive processor for Intel486 processor- 
based systems executes instructions at a multiple of 
the frequency of the input clock. This OverDrive 
processor also will use advanced design techniques 
to decrease the number of clocks per instruction 
(cpi) from that of Intel486 processors. Thus soft- 
ware, such as instruction-based timing loops, will ex- 
ecute faster on the OverDrive processor than on the 
Intel486 processor at the same input clock frequen- 


cy. Instructions such as NOP, LOOP, and JMP $+2 


are frequently used by the BIOS to implement timing 
loops that are required, for example, to enforce re- 
covery time between consecutive accesses for I/O 
devices. These instruction-based, timing-loop imple- 
mentations may require modification to be compati- 
ble with this OverDrive processor socket. 


In order to avoid any incompatibilities, timing loops 
can be implemented in hardware rather than in soft- 
ware. This provides transparency and also does not 
require any change in BIOS or I/O device drivers in 
the future when moving to higher processor clock 
speeds. 


As an example, a timing loop may be implemented 
as follows: The software performs a dummy I/O in- 
struction to an unused |/O port. The hardware for 
the bus controller logic recognizes this I/O instruc- 
tion and delays the termination of the I/O cycle by 
keeping RDY # or BRDY # deasserted for the appro- 
priate amount of time. 
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16.6 Test Requirements 


The electrical functionality of the OverDrive proces- 
sor socket can be verified by fully testing the PC with 
a populated OverDrive processor socket. Intel rec- 
ommends that the system be tested with all avail- 
able OverDrive processors that are compatible with 
the OverDrive socket type and power supply volt- 
age, to ensure that there are no BIOS issues. The 
BIOS requirements to maintain compatibility with all 
OverDrive processors are discussed in section 16.5, 
“BIOS and Software,” of this document. All Over- 
Drive processors undergo thorough application soft- 
ware compatibility testing prior to their introduction. 


16.7 OverDrive Processor Socket 
Pinout 


Socket 3 can accept all 5V OverDrive processors 
and the future 3.3V Pentium OverDrive processor for 
the IntelDX4 processor. The socket 3 pinout is 
shown in Figure 16-4 and Figure 16-5. Socket 6 is 
discussed in section 16.3, “Socket Layout,”’. 


16.7.1 PIN DESCRIPTION 


The signal pin descriptions for the OverDrive proces- 
sor are identical to the pin descriptions for the In- 
tel486 processor except for those shown in Table 
16-5. 


16.7.2 RESERVED PIN SPECIFICATION 


Many pins in the OverDrive processor socket are 
defined as reserved (RES). The function of these 
pins is documented separately. Some of these pins 
will be used to implement an on-chip writeback 
cache protocol, and the remaining pins will provide 
other OverDrive specific functions. These pins must 
not be connected unless they are used to implement 
these functions, as documented in the information 
available separately. To insure proper operation, 
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pins marked as NC must be left unconnected as 
well. For more information contact Intel. 


Section 16.7.4, “Shared Write-Back Pins,” discuss- 
es the Pentium OverDrive processor compatibility 
with the Write-Back Enhanced IntelDX2 processor. 


16.7.3 INC “internal No Connect” PIN 
SPECIFICATIONS 


INC Pins are defined as Internal No-Connects. This 
means that the pin is not connected to the proces- 
sor internally. Since the pin is inert and floating, it 
may be used in any manner seen fit, but must meet 
the INC pin specifications. In general, all INC pins 
have an intended use to implement a single proces- 
sor socket system design. The INC pin will never be 
used for any other function. The 6 signals which use 
the INC pins to simplify single-socket board design 
are shown in Table 16-6. 


16.7.4 SHARED WRITE-BACK PINS 


There are several signals that the Pentium Over- 
Drive processor socket has in common with the 
Write-Back Enhanced IntelDX2 processor, but which 
are located on different pins. An example of this 
would be the HITM# signal. On the Pentium Over- 
Drive processor socket, it is located in the outer row 
of pins, while on the Write-Back Enhanced IntelDX2 
processor, it is located on one of the inner rows. 
Single socket designs require that these signals be 
tied together so that the use of a jumper to reroute 
the signal is unnecessary. This is done through the 
use of the INC pin. 


Figure 16-6 shows an example of how the INC pins 
shown in Table 16-6 should be connected together 
to allow single socket compatibility between the 
Write-Back Enhanced IntelDX2 processor and the 
OverDrive processor socket. The figure is provided 
as an example only and is not intended to be guide 
for how the signals should actually be routed on a 
motherboard. : 
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16.7.5 PINOUT 


Oo Oo @) Oo O @) Oo 1 
RES RES Yoo Yoo Vegg RES RES Vgg Veesp Vecsp Veesp Vsg RES RES Veg og PLUG PLUG KEY 
Oo @) O .@) O (> O Oo O Oo O 2) Oo O O Oo Oo oO im 2 
RES A27 A28 A31 DO . D2 Veg Veg Vgg Voces, Vgg—(s«gg—S(C«COP1Vgg «= 09s*sCO9) «ZO PLUG 
Oo oO O Oo Oo O Oo O O O O Oo O Oo Oo oO Oo Oo 3 
Vsg A26 A25 Veg A29 OL Voge DE Veg 05 D3 Voge 08 Vq DIS 018 021 022 PLUG 
O Oo O Oo Oo O Oo O Oo ‘@) Oo O Oo @) O @) O O .@) 4 
Veo A235 Veg «AIT: «ASO OPO AFH SONGOOPZ-«-12_-S-IS «1017 CLK gg INC Ve 
0 Oo = 5 
Vgg VOLDET Veg AIS PLUG KEY Veo Veg 023 Veg 
eo) 6 
Vee 
O 7 
027 
O Oo @) O O Oo Oo O 8 
Veg AIZ «AIS -A22Z D26 Voc Vss_s ‘Vg 
O Oo O O O Oo Oo Oo 9 
Yeo ss. Veg. Ss A220 D268 031 029 Vee 
O oO .@) O O Oo Oo Oo 10 
Vee Yss Yeo A'l6 D350 Yee =O Yss ‘Vee 
Oo oO ‘@) O @) ‘@) ‘@) ©) 11 
Yeo Yss Yee ANS INC SMI® INC Veg 
@) Oo @) Oo oO Oo Oo Oo 12 
Yss_ Yss Yee AI INC =6YeeCOYss—s‘V 5 
Oo Oo Oo O 0 0 O 13 
Vss_Vgg ANT AS SMIACT# INC INC Veg 
@) Oo O Oo a a ©) .@) @) Oo 14 
Vos A10 AB AZ PLUG PLUG NC INC FERR® Veco 
‘@) Oo oO ‘@) a il a a O Oo ‘@) ‘@) 15 
Ysg Ygg Yee = AZ = PLUG PLUG PLUG PLUG INC UP# INC = Veg 
000000 0000:':00 00000 0 0 16 
Vec A6 AS BREQ HLDA LOCK#® D/C# PWT BEO#* BE2# BRDY# STPCLK# KEN# HOLD A20M#FLUSH# NMI IGNNE@ Vec 
'@) Oo Oo O O © O ‘@) O Oo ‘@) @) ©) .@) O Oo O Oo @) 17 
Veg A4--BLAST# PLOCK® Veg M/10® Vee Veg Veg BEI® «Veg Veg =RDY# «Veg BSB# RESET INC INTR Veg 
02 00000000000 00 0 0 0 0 0 Jfi4g 
RES ADS# CLKMUL PCHK® Veg W/R# Veg Veg Vgg PCD. Ss “Vgg_s Vg BE3® §= Vg: BOFF® BS16% EADS* AHOLD RES 
Oo @) @) O Oo ©) O O O O '@) O ‘@) oO Oo .e) re) Oo 19 
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NOTE: 
All NC and RES pins must remain unconnected 


Figure 16-4. OverDrive™ Processor Socket 3 Pinout (Top Side View) 
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All NC and RES pins must remain unconnected 


Figure 16-5. OverDrive™ Processor Socket 3 Pinout (Pin Side View) 
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A20M # 
ADS # 
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BE1# RDY # 
BEQ# RESET 
BE3# SMI# 
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BS8# W/R# 
BS16# 

CLK 

CLKMUL@) 

D/C# 

DPO 

DP1 

DP2 

DP3 

EADS # 

FERR# 

FLUSH# 
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LOCK # 
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NOTES: . 

1. All RES pins are reserved for later use by Intel. To ensure-proper operation of the microprocessor, all RES and N/C pins 
should be left unconnected. Please contact Intel for design information. 

2. These pins are valid only for the future Pentium™ OverDrive processor (3.3V). 
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Table 16-5. OverDrive™ Processor Socket Pin Description 


Symbol Name and Function 


Intel486™ PROCESSOR INTERFACE 


The Upgrade Present pin is used to signal Intel486 processor to float its outputs and stop 
driving the bus in a dual socket system design. It is active low and is never floated. UP # 
is driven low at power-up and remains active for the entire duration of the OverDrive 
processor operation. | - 


OverDrive PROCESSOR INTERFACE 


nal 
- Z 


KEY PIN 


FERR# 


The Vocsp pin supplies power to the OverDrive processor’s fan/heatsink and should be 


connected to +5V +10% regardless of the system design. Failure to connect Vccsp to 
5V will cause the component to overheat. 


The INIT input will force the OverDrive processor to begin execution in a known state. 
The processor state after INIT is the same as the state after RESET, except that the 
internal caches, floating point register and SMM base register retain whatever values they 
had prior to INIT. INIT may not be used in lieu of RESET after power-up. | 


The Key pin Is an electrically non-functional pin which ensures correct orientation for the 
OverDrive processor. Socket plugs are also used to ensure correct orientation. 


Table 16-65. Single Socket Compatibility Signals 


Write-Back Enhanced Inte Pentium™ OverDrive™ Pentium OverDrive 
intelDX2T Processor Socket Processor Socket 
Processor Signal Pin Signal Pin INC Pins 
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Figure 16-6. Sample Routing of INC Pins 


16.8 3.3V Socket Specification 


Socket 6 is a 235-pin socket and accepts 3.3V Over- 


Drive processors only. The keying mechanism con- 
sists of a Key pin (A1) and five missing pins (B1, C1, 
A2, A3, and A19). Since it is designed for 3.3V: Over- 
Drive processors only, socket 6 must not accept the 
169-pin OverDrive processors, therefore, the key 
pin, E5, is missing. To be effective as a keying 
mechanism, the locations in the socket correspond- 
ing to the six missing pins must be plugged. 


In addition, the location of the pin 1 corner should be 
clearly marked on the motherboard. 


Systems designed with the OverDrive processor 
‘socket 3 could use the future Pentium OverDrive 
processor for IntelDX4 processor-based systems, as 
well as the Pentium OverDrive processor. However, 
the OverDrive processor for IntelDX4 processor- 
- based systems requires a 3.3V supply, while the 
Pentium OverDrive processor requires a 5V supply. 
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Therefore, the supply voltage to the socket 3 (Vcc) 
must be 3.3V when it is used with the future Pentium 
OverDrive processor for IntelDX4 processor-based 
systems, and 5V when it is used with the Pentium 
OverDrive processor. ; 


However, the fan/heatsink does require a 5V power 
supply, Vccsp, as specified in Tables 16-4 and 16-5. 
To ensure adequate air circulation, the additional 
clearance specified in Table 16-1 must be provided. 


16.9 DC/AC Specifications 


The electrical specifications in this section represent 
the electrical interface of the OverDrive processor 
for Intel486 processor-based systems. The Over- 
Drive processor will be compatible to the maximum 
ratings and AC Specifications of Intel486 proces- 
sors. Tables 16-7 and 16-8 provide the unique DC 
Operating Conditions for the OverDrive processors. 
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Table 16-7. Pentium™ OverDrive™ Processor Socket (5V) DC Parametric Values 
Functional Sse Range: Vcc = 5V + 5%; a ae = 0°C to <a 


Power Supply Current 
CLK = 25 MHz 
CLK = 33 MHz 


Input Capacitance 
1/O or Output Capacitance 
CLK Capacitance 


Table 16-8. Future Pentium™ OverDrive™ Processor Socket (3.3V) DC Parametric Values 
Functional Operating Range: Vcc = 3.3V +0.3V; Tsinx = 0°C to + 85°C. 


[Parameter |Win [Max 
Tice | PowerSunpy Curent | —~(| 000 | map 
tcc | Reterence Supply urent | —+| 100 -| A | ote) 
TIccse | _Fan/Heatsink Supply Curent | | 200 | ma | tan @ 6V 
[input Capactance | tt or 
P25 | Woo capacenis [fe 


NOTES: 
1. To avoid damaging the OverDrive processor when the 3.3V power supply is accidentally connected to Vss, the system 
must limit this current to less than 55 mA. 
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17.0 ELECTRICAL DATA 


The following sections describe recommended elec- 
trical connections and electrica! specifications for 
the Intel486 processor. 


17.1 Power and Grounding 


17.1.1 POWER CONNECTIONS 


The Intel486 processor is implemented in CHMOS 
_ technology and has modest power requirements. 


However, the high clock frequency output buffers © 


can cause power surges as multiple output buffers 
drive new signal levels simultaneously. For clean on- 
chip power distribution at high frequency, multiple 
Voc and Vgs pins feed the Intel486 processor. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Intel486 proces- 
sor. On the circuit board, all Voc pins must be con- 
nected on a Vcc plane. All Vss pins must be like- 
wise connected on a GND plane. 


17.1.2 INTEL486 PROCESSOR POWER 
DECOUPLING RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near the Intel486 processor. The Intel486 processor, 
driving its 32-bit parallel address and data buses at 


high frequencies, can cause transient power surges, . 


particularly when driving large capacitive loads. Low 
inductance capacitors (i.e., surface-mount capaci- 
tors) and interconnects are recommended for the 
best high-frequency electrical performance. Induc- 
tance can be reduced by connecting capacitors di- 
rectly to the Vcc and Vss planes, with minimal trace 
length between the component pads and vias to the 
plane. These capacitors should be evenly distributed 
around each component on the Vcc power plane. 


_ Capacitor values should be chosen to ensure they 
eliminate both low and high frequency noise compo- 
nents. ; 
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The recommendation for the Intel486 processor 


is 9 x 0.01 uF and 9 x 0.1 uF capacitors. 


The power consumption can transition from a low 
level of power to a much higher level (or high to low 
power) very rapidly. A typical example would be en- 
tering or exiting the Stop Grant state. Another exam- 
ple would be executing a HALT instruction, causing 
the Intel486 processor to enter the Auto HALT Pow- 
er Down state, or transitioning from HALT to the 
Normal state. All of these examples may cause ab- 
rupt changes in the power being consumed by the 
Intel486 processor. Bulk storage capacitors with a 
low ESR (Effective Series Resistance) in the 10 to 
100 microfarad range are required to maintain a reg- 
ulated supply voltage during the interval between 
the time the current load changes and the point that 
the regulated power supply output can react to the 
change in load. In order to reduce the ESR, it may 
be necessary to place several bulk storage capaci- 
tors in parallel. These capacitors should be placed 
near the Intel486 processor (on the processor pow- 
er plane) to ensure that the supply voltage stays 
within specified limits during changes in the supply 
current while in operation. 


17.1.3 Vocs AND Vcc POWER SUPPLY ~ 
REQUIREMENTS FOR THE INTELDX4 
PROCESSOR 


In mixed voltage systems that will be driving In- 
telDX4 processor inputs in excess of 3.3V, the Vocs 
pin must be connected to the system 5V supply. In 
order to limit current flow into the Vocs pin, there is a 
limit to the voltage differential between the Vocs pin 
and the other Vcc pins. The voltage differential be- 
tween the Vccs pin of the IntelDX4 processor and its 
3.3V Voc pins should never exceed 2.25V. The 
2.25V limit applies to power up, power down and 
steady state operation. Table 17-1 outlines this re- 
quirement. 


Table 17-1. Dual Power Supply Requirements for the intelIDX4™ Processor 


[Syrmbor| Parameter [ Min | Mx 


VDIFF Vocs-Vcc 2.25 
Difference 
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Vocs input should not exceed Voc by more than 2.25V during 
power-up, power-down or during operation. 


Notes 


S 

intel ; 

Meeting this requirement ensures proper operation 
of the IntelDX4 processor and guarantees that the 
current draw into the Vccs pin will not exceed the 
locs specification (see section 17.3.1, “DC Specifi- 
cations’). If the voltage difference requirement can- 
not be met due to system design limitations, then an 
alternate solution may be employed. A minimum of a 
100 series resistor may be used to limit the current 
into the Vocs pin. This resistor will ensure that cur- 
rent drawn by the Vcocs pin will not exceed the maxi- 
mum rating of 55 mA for this pin (see section 17.2, 
“Maximum Ratings’’). 


5V (+- 0.25V) 


o——_—A /\\———_> Ves 


100 Q (+- 5%, 0.5W) 
, 242202-F8 


Figure 17-1. IntelDX4™ Processor Vcocs 
Current Limiting Resistor 


Note that this resistor is not necessary if the system 
can guarantee that the voltage difference between 
Vocs and Vcc is always limited to 2.25V, even dur- 
ing power up and power down. 


In 3.3V-only systems and systems that will be driving 
all IntelDX4 processor inputs and I/Os from 3.3V 
logic, the Vccs pin should be connected directly to 
the 3.3V Vcc plane. This will guarantee the voitage 
difference specification is met and will eliminate the 
current draw into the Vcocs pin. In a 3.3V-only sys- 
tem, the Vccs may be connected to the 5V supply 
as described previously, as long as the voltage dif- 
ferential in Table 17-1 is met, and assuming the cur- 
rent drawn by the Vccs pin is of little consequence 
.to the system design. 


17.1.4 SYSTEM CLOCK RECOMMENDATIONS 


It is recommended that the CLK input to the Intel486 
processor should not be driven until Vcc has 
reached its normal operating level (either 3.3V or 
5V). The CLK input may be grounded or allowed to 
ramp with Vcc during this period. Once Vcc has 
reached its normal operating level, the Intel486 
processor can handle the clock frequency for which 
it is specified and the oscillator/clock driver should 
have locked onto its desired frequency. 
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17.1.5 OTHER CONNECTION 
RECOMMENDATIONS 


NC pins should always remain unconnected. Con- 
nection of NC pins to Vcc or Vss or to any other 
signal can result in component malfunction or in- 
compatibility with other steppings of the Intel486 
processor family. 


INC (Internal No Connect) pins are not connected to 
any internal pad in Intel486 and OverDrive™ proc- 
essors. However, new signals are defined for the 
location of the INC pins in the Intel486 processor 
proliferations. All INC pins defined by Intel have a 
specific use for jumperless single socket compatibil- 
ity with current and future processors. A system de- 
sign could connect any signal to an INC pin without 
affecting the operation of the processor. However, 
the purpose of a specific INC pin should be under- 
stood before it is used. If not, the system design will 
sacrifice the ability to implement a jumperless (single 


socket) flexible motherboard. 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to Vcc through a pull-up 
resistor. Pull-ups in the range of 20 KN are recom- 
mended. Active HIGH inputs should be connected to 


GND. 


17.2 Maximum Ratings 


Table 17-2 is a stress rating only, and functional op- 
eration at the maximums is not guaranteed. Function 
operating conditions are given in Table 17-3 for 3.3V 
processor DC Specifications, Table 17-9 for 5V DC 
Specifications, Tables 17-17 through 17-20 for 3.3V 
processor AC specifications, and Tables 17-23 
through 17-25 for 5V processor AC specifications. 


Extended exposure to the Maximum Ratings may af- 
fect device reliability. Furthermore, although the In- 
tel486 processor contains protective circuitry to re- 
sist damage from static electric dischargé, always 
take precautions to avoid high static voltages or 
electric fields. 
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Table 17-2. Absolute Maximum Ratings 


—65°C to + 110°C 
C 


Case Temperature 
under Bias 


—65°C to + 150° 


—0.5 to Voc + 0.5V 
—0.5 to Vocs + 0.5V(1) 


Voc —0.5V to +6.5V(2) 
Voc —0.5V to + 4.6V(1) 
Vocs(1) —0.5V to +6.5V(1) 


— 1.6V to Vocs + 1.6V(1,3) 
NOTES: 


1. For IntelDX4™ processor only. 

2. All Intel486™ processors except IntelDX4 processor. 

3. Maximum voltage on any pin with respect to ground is the lesser 
of Vcc5 + 1.6V or 6.5V for the IntelDX4 processor. 


Storage Temperature 


DC Voltage on Any Pin 
with Respect to Ground 


Supply Voltage with 
Respect to Vss 


Transient Voltage on 
Any Input 


Maximum Allowable 
Current Sink on Vccs(") 
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17.3 DC Specifications 


17.3.1 3.3V DC CHARACTERISTICS 


Table 17-3 is for Intel486 SX, Intel486 DX, IntelDX2™, Write-Back Enhanced IntelDX2, and IntelDX4 proces- 
sors. 


Table 17-3. 3.3V DC Specifications 
Functional operating range: Vcc = 3.3V +0.3V; Vocs = 5V +0.25V (Note 17); Tcase = 0°C to +85°C 


Input LOW Voltage 


| Min 
feviad | 03 
> oe 
Me | Voo=0.6 


ViHC Input HIGH Voltage of CLK, CLK2 Voc — 0.6 


Output LOW Voltage 

lo. = 2.0mA 

lo. = 100 pA 

Output HIGH Voltage 

loH = —2.0mA 2.4 
lon = —100 pA amen 0.2 


Iocu UP # Active Supply Current 15 35 = 
50 “ss 


Input Leakage Current 200 = 
300 ore 


Input | InputLeakage Current = Current pa an fe 


Cin Input Capacitance 
Cout Output or |/O Capacitance 1 Zz 
14 pF 6, an 
CcoLk CLK Capacitance pF 
= we a “ 
Hip Bus Hold Low | Bus Hold Low Sustaining Current _ Current 
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NOTES: | 

1. All inputs except CLK, CLK2. (For all Intel486 processors except the IntelDX4 processor.) 

. When the processor is in Stop Grant state, the Iccy of the host processor is less than 2 mA. 

. This parameter is for inputs without internal pull-ups or pull downs and OV < Vin < Vcc. 

. This parameter is for inputs with internal pull-downs and Vij = 2.4V. 

. This parameter is for inputs with internal pull-ups and Vi_ = 0.4V. 

Fo=1 MHz; Not 100% tested. 

. For the IntelDX4 processor, this parameter is measured at: Address, Data, BEn = 4.0 mA 
Definition, Control = 5.0 mA 


NOOLON 


8. Typical values are not 100% tested. 

-9. This parameter is for Vcocs-Vcc < 2.25V. (IntelDX4 processor only.) 

10. For the IntelDX4 ‘processor only. 

11. This is the maximum current the bus hold circuit can sink without raising the node above Vj, max. IB}, should be mea- 
sured after lowering Vin to grouncl and then raising to Vi._max. (Vin = 0.8V). (Write-Back Enhanced IntelDX2 processor 
only.) | 

12. This is the maximum current the bus hold circuit can source without lowering the node voltage below Viymin. IByH 
should be measured after raising Vin to Voc (3.3V) and then lowering to Viymin. (Vin = 2.0V). (Write-Back Enhanced 
IntelDX2 processor only.) 

13. An external driver must source at least IBHLO to switch this node from low to high. (Vin = 1.3V) (Write-Back Enhanced 
IntelDX2 processor only.) . 

14, An external driver must source at least IBHHo to switch this node from high to low. (Vij <1.3V) (Write-Back Enhanced 
IntelDX2 processor only.) 

15. This parameter is for inputs with pull-downs and Vj}=2.4V. (Write-Back Enhanced IntelDX2 processor only.) 

16. All Intel486 processors except the IntelDX4 processor. 

17. Vocs should be connected to 3.3V +0.3V in 3.3V-only systems (IntelDX4 processor only.) 


Table 17-4. 3.3V Icc Values for Intel486™ SX Processor 
Functional Operating Range: Vcc = 3.3V +0.3V; Tcase = 0°C to + 85°C 


Operating 
Icc Active 
(Power Supply) 


loc Active 
(Thermal Design) 
Icc Stop Grant 


Icc Stop Clock 
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Table 17-5. 3.3V Icc Values for Intel486™ DX Processor 
Functional Operating Range: Voc = 3.3V +0.3V; Tcase = 0°C to + 85°C 


Table 17-6. 3.3V Icc Values for IntelDX2™ Processor 
Functional Operating Range: Voc = 3.3V +0.3V; Tcase = 0°C to + 85°C 


loc Active 40 MHz 
(Thermal Design) 50 MHz 
loc Stop Grant 40 MHz 

50 MHz 


Table 17-7. 3.3V Icc Values for Write-Back Enhanced IintelDX2™ Processor 
Functional Operating Range: Voc = 3.3V +0.3V; TCASE = 0°C to + 85°C 


Frequency 
loc Active 40 MHz 515 mA oA | 
(Power Supply) 50 MHz 630 mA 
Icc Active 40 MHz 309 mA 475 mA 2,3, 4 
(Thermal Design) 50 MHz 384 mA 581 mA 
Ioc Stop Grant 40 MHz 20 mA 40 mA 
50 MHz 23 mA 50 mA 
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Table 17-8. 3.3V Icc Values for IntelDX4™ Processor | 
Functional Operating Range: Voc = 3.3V +0.3V; Vocs = 5V +0.25V (Note 7); Tcase = 0°C to + 85°C © 


Operating 


Frequency Typ 


loc Active 100 MHz 

(Power Supply) 75 MHz | 

Icc Active 100 MHz 1075 mA 1300 mA 
(Thermal Design) 75 MHz 825 mA 975 mA 


100 MHz 
75 MHz 


50 mA 100 mA 
20 mA 


~ 600 pA 


loc Stop Grant 
Icc Stop Clock 


NOTES FOR TABLES 17-4 THROUGH 17-8: 

1. This parameter is for proper power supply selection. It is measured using the worst case instruction mix at Vcc = 3.6V. 
In order to support the OverDrive™ processor, care should be taken to accommodate the Maximum Power Supply 
Current Value in section 16, ‘““OverDrive Processor Socket.” 

2. The maximum current column is for thermal design power dissipation. It is measured using the worst case instruction mix 

at Voc = 3.3V. . 

3. The typical current column is the typical operating current in a system. This value is measured in a system using a typical 
device at Vcc = 3.3V, running Microsoft Windows 3.1 at an idle condition. This typical value is dependent upon the 
specific system configuration. 

4. Typical values are not 100% tested. 

5. The Icc Stop Grant specification refers to the Icc value once the Intel486 processor enters the Stop Grant or Auto HALT 
Power Down state. | 

6. The Icc Stop Clock specification refers to the Icc value once the processor enters the Stop Clock state. The Viy and Vi. 
levels must be equal to Vcc and OV, respectively, in order to meet the Icc Stop Clock specifications. 

7. Vocs should be connected to 3.3V +0.3V in 3.3V-only systems. | 
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17.3.2 5V DC CHARACTERISTICS 


Table 17-9 is for Intel486 SX, IntelSX2™, Intel486 DX, IntelDX2 Processors, and Write-Back Enhanced In- 
telDX2 processors. 


Table 17-9. 5V DC Specifications 
Functional operating range: Vcc = 5V + 0.25V; Tcase = O°C to + 85°C 


Pv | tmputtowvotage «dt 08 
Vin | tnput GH Votage ——SS*dtC 
Vor | output Low votes 
Vor | Output ich votage «dt 
iy | nput Leakage Curent 
com 


I+ 
on 
a = 


Input Leakage Current 
Output Leakage Current 


Input Capacitance 

PGA 20 

PQFP 10 
Output or 1/O Capacitance 


i 
a ae 
fasion a ll 
300 
ae aes 
ies eae 


ie 
Nn 
so fino! OU c pa | 


PGA 
PQFP 


Coik CLK Capacitance 
PGA 
PQFP 6 


VIL 
ViH 
VoL 
VOH 
lu 
HH 
He 
lLo 
Cin 


N — NM 

a 
nome) 
nn 
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NOTES: . 
1. This parameter is measured at: Address, Data, BEn 4.0 mA ° 
. Definition, Control 5.0 mA 

2. This parameter is measured at: Address, Data, BEn —1.0 mA 

| Definition, Control —0.9 mA 

. This parameter is for inputs without pull-ups or pull-downs and OV < Vin < Vcc. 

. This parameter is for inputs with pull-downs and Vij = 2.4V. 

. This parameter is for inputs with pull-ups and Vj, = 0.45V. 

. When the processor is in Stop Grant state, the Iccy of the host processor is less than 2 mA. 

Fo=1 MHz; Not 100% tested. 

. This parameter is for inputs with pull-downs and VIH=2.4V. (SRESET pin only.) | 

. This is the maximum current the bus hold circuit can sink without raising the node above Vj, max. IB}, should be mea- 

sured after lowering Vij to ground and then raising to Vi_max. (Vin = 0.8V) (Write-Back Enhanced IntelDX2 processor 
only.) 

10. This is the maximum current the bus hold circuit can source without lowering the node voltage below Viymin. IBHH 
should be measured after raising Vij to Vcc (5V) and then lowering to Vi4 min. (Vij = 2.0V) (Write-Back Enhanced 
IntelDX2 processor only.) ; 

11. An external driver must source at least Igy; 0 to switch this node from low to high. (Vin =1.6V) (Write-Back Enhanced 
IntelDX2 processor only.) , 

12. An external driver must source at least IpyHo to switch this node from high to low. (Vij, <1.6V) (Write-Back Enhanced 
IntelDX2 processor only.) 


| Table 17-10. 5V Icc Values for Intel486™ SX Processor 
Functional Operating Range: Vcc = 5V +0.25V; Tcase = 0°C to + 85°C 


loc Active 

(Power Supply) 

loc Active | 
(Thermal Design) 

loc Stop Grant | 


loc Stop Clock 


ODNAMNAYW 


Operating 
Frequency 


, Table 17-11. 5V Ic¢c Values for IntelSX2T™ Processor 
Functional Operating Range: Vcc == 5V +0.25V; Tcase = O0°C to +85°C 


| Operating 
Frequency 
loc Active 
(Power Supply) | 


Icc Active 
(Thermal Supply) 
loc Stop Grant 
loc Stop Clock 
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Table 17-12. 5V Icc Values for Intel486™ DX Processor | 
Functional Operating Range: Voc = 5V +0.25V; Tcase = 0°C to + 85°C 


Frequency 
loc Active 33 MHz 630 mA 1 
(Power Supply) 50 MHz 1000 mA 
loc Active 33 MHz 499 mA 602 mA 2, 3,4 
(Thermal Supply) 50 MHz 800 mA 956 mA 
loc Stop Grant 33 MHz. 40 mA 80 mA 5 
50 MHz N/A N/A 7 


Ico Stop Clock 200 uA 


Table 17-13. 5V Ic¢c Values for IntelDX2™ Processor | 
Functional Operating Range: Vcc = 5V +0.25V; Tcase = O0°C to + 85°C 
Operating 

Parameter | Seeratng | typ | maximum | Notes 

loc Active 50 MHz 950 mA 1 

(Power Supply) 66 MHz 1200 mA 

Icc Active 50 MHz 680 mA 906 mA 2,3,4 

(Thermal Supply) 66 MHz 901 mA 1145 mA 

loc Stop Grant 50 MHz 35 mA 70 mA 5 

66 MHz 45mA ~~ 9O0mA 
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Table 17-14. 5V loc Values for Write-Back Enhanced IntelDX2™ Processor 
Functional Operating Range: Vo = 5V +0.25V; Tcase = 0°C to + 85°C 


Frequency | : 
Icc Active 1025 mA 1 
(Power Supply) 1350 mA . 
loc Active 928 mA 2,3,4 . 
{___ (Thermal Supply) | 1287 mA : 


' NOTES FOR TABLES 17-10 THROUGH 17-14: 
1. This parameter is for proper power supply selection. It is measured using the worst case instruction mix at Voc = 5.25V. 
2. The maximum current column is for thermal design power dissipation. It is measured using the worst case instruction mix 
~ at Voc = 5V. | 
3. The typical current column is the typical operating current in a system. This value is measured in a system using a typical 
device at Vcc = 5V, running Microsoft Windows 3.1 at an idle condition. This typical value is dependent upon the specific 
system configuration. . 

4. Typical values are not 100% tested. f 

5. The Icc Stop Grant specification refers to the Ilcc value once the Intel486 processor enters the Stop Grant or Auto HALT 
Power Down state. 

6. The Icc Stop Clock specification refers to the Icc value once the processor enters the Stop Clock state. The Viy and Vi. 
levels must be equal to Vcc and OV, respectively, in order to meet the Icc Stop Clock specifications. 

7. The 50 MHz Intel486 DX does not implement SL Technology and cannot utilize Stop Grant or Stop Clock functions. 
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17.3.3 EXTERNAL RESISTORS any potential DC paths (i.e., leakage currents. If 
RECOMMENDED TO MINIMIZE external resistors are not strong enough to 
LEAKAGE CURRENTS FOR THE WRITE- “flip” the logic level, the estimated leakage cur- 
BACK ENHANCED INTELDX2 rent on the data bus and the data bus parity pins 
PROCESSOR are approximately 2 mA.) 


The data bus and data parity pins of the Write-Back According to section 9.6.2, ‘Pin State During Stop 
Enhanced IntelDX2 processor employ internal bus Grant,” data pins must be driven low to achieve the 
hold circuitry to maintain their previous logic level lowest possible power consumption. If the Write- 
while in the Stop Grant state; external resistors are Back Enhanced IntelDX2 processor is installed in an 
not required to prevent excessive current during the existing Intel486 processor socket, and the socket 


Stop Grant state for the Write-Back Enhanced In- contains existing pull down resistors, these resistors 
telDX2 processor. See Table 17-15 for specifica- should be changed to the suggested values, listed in 
tions of the bus hold circuitry. If resistors are pres- Table 17-16. The values listed are recommended to 


ent, they should be strong enough to “flip” the minimize leakage current. 
logic level of the bus hold circuitry to minimize 


Table 17-15. Write-Back Enhanced IntelDX2T™ Processor DC Keeper Specifications 


IBHL Current Required to Sustain Bus Hold LOW 17 pA 33 pA 
(max) (max) 
IBHH Current Required to Sustain Bus Hold HIGH | —20 yA | —801BA | Vin @ 2.0V 
(max) (max) 
IBHLO Current Required to “Flip” Bus Hold HIGH 210 pA 330 WA | Vin = 1.38V @ Voo=3.3V 
(min) (min) Vin = 1.6V @ Voc=5V 
IBHHO Current Required to “Flip” Bus Hold LOW —350 pA} —550 pA | Vin < 1.3V @ Voc =3.3V 
(min) (min) Vin < 1.6V @ Voc =5V 


Table 17-16. Write-Back Enhanced IntelIDX2™ 
Processor Recommended Values for Bus Keeper 


| Voc | Calculation _—|__ Suggested 
3.3V 1.3V ~ 37KO 3 KD 
350 pA 
1.6V 
5V 6V 29KN 2.7 KO. 
550 wA 


Condition 
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17.4 AC Specifications 


The AC specifications given in the tables in this section consist of output delays, input setup requirements and 
input hold requirements. All AC specifications are relative to the rising edge of the input system clock (CLK) 
unless otherwise specified. | 


17.4.1 3.3V AC CHARACTERISTICS 


Table 17-17 is for 25- and 33-MHz Intel486 SX, 33-MHz Intel486 Dx, 40-MHz IntelDX2 (20-MHz Max.), 50-MHz 
IntelDX2 (25-MHz Max.), 40-MHz Write-Back Enhanced IntelDX2 (20-MHz Max.), and 50-MHz Write-Back 
Enhanced IntelDX2 Processors (25-MHz Max.). | | 


| Table 17-17. 3.3V AC Characteristics 
Functional operating range: Vcc = 3.3V +0.3V; Tcase = 0°C to +85°C; C, = 50 pF, unless otherwise 


Bus Speed 


Parameter 


Frequency 


CLK Period Stability +250 +250 +250 17-2 | Adjacent 
clocks — 
cxngtne Pe] Tele] pw [we | wa 


cuxtowtine [ef [| [| [we | 2 | wtoov 
uw fexravime | |e] || | @ || 12 [avwoay| 
fe fe 


CLK Rise Time 


A2-A31, PWT, PCD, 
BEO-3#, M/IO#, 
| D/C#, W/R#, ADS, 
| LOCK#, BREQ, 
HLDA, CACHE #, 
HITM#, SMIACT#, 
FERR # Valid Delay 


7 A2-A31, PWT, PCD, 
BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, 
LOCK #, BREQ, 
HLDA, CACHE # Float 
Delay 


ts PCHK # Valid Delay 


BLAST #, PLOCK# 
Valid Delay 


tg | BLAST#, PLOCK# 
Float Delay 


7) 

5°) 

n°) 
Q. 

= 

0) 

fom 


+ 
o 


 * 
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Table 17-17. 3.3V AC Characteristics (Continued) a8 
Functional operating range: Vcc = 3.3V +0.3V; Vocs = 5V +0.25V (Note 1); Tcase = O0°C to + 85°C; 
C. = 50 pF, unless otherwise 7 a" 


Bus | BusSpeed 
Symbol Parameter | 20MHz | 25MHz | 33MHz | Notes 


tio DO-D31, DPO-—DP3 26 20 19 7-6 / 
Write Data Valid Delay 
t+4 DO-D31, DPO—DP3 37 28 20 17-7 3 
Write Data Float Delay 
EADS #, INV Setup 10 
Time 
EADS #, INV Hold 
Time 
ty4 KEN #, BS16#, BS8#, | 10 7-3 
WB/WT # Setup Time 
t15 KEN #, BS16#, BS8#, | 3 3 17-3 
WB/WT # Hold Time 
tye | RDY#, BRDY# Setup | 10 17-4 
Time 
t17 RDY #, BRDY # Hold 3 3 3 17-4 
Time 
HOLD, AHOLD Setup 12 10 
rege 


trea _| BOFF # | BOFF# Setup Time Time 17- | 17-3 


HOLD, AHOLD, 17-3 
BOFF # Hold Time 


FLUSH#, A20M#, 
NMI, INTR, SMI#, 
STPCLK#, SRESET, 
RESET, IGNNE# 
Setup Time 


FLUSH#, A20M#, 
NMI, INTR, SMI#, 
STPCLK #, SRESET, 
RESET, IGNNE# Hold 
Time 


too DO-D31, DPO-DP3, 17-3 
A4-A31 Read Setup 17-4 
Time 
| 


DO-D31, DPO-DP3, 17-3 
A4-A31 Read Hold 17-4 
Time 
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OTES: 

. 0-MHz operation is guaranteed when the. STPCLK# and Stop Grant bus cycle protocol is used. 

. IGNNE# and FERR# are present only in the Intel486 DX, IntelDX2, and Write- Back Enhanced IntelDX2 processors. 

. Not 100% tested, guaranteed by design characterization. 

. All timing specifications assume C, = 50 pF. See capacitive derating charts for additional timing delays due to loading. 

. A reset pulse width of 15 CLK cycles is required for warm resets (RESET or SRESET). Power-up resets (cold resets) 
require RESET to be asserted for at least 1 ms after Vcc and CLK are stable. 

. CACHE #, WB/WT#, HITM#, and INV are present only in the Write-Back Enhanced IntelDX2 processor. 


Table 17-18. 3.3V AC Characteristics for the 75/25-MHz IntelDX4™ Processor 
Voc = 3.3V +0.3V; Vocs = 5V +0.25V (Note 1); Tcase = 0°C to + 85°C; C_ = 50 pF 


aaa ES de 
tia | CLK Period Stability eee a 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK#, FERR#, 
BREQ, HLDA Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO# 


D/C#, W/R#, ADS#, LOCK # Float Delay ss | 


NQhROnog 


oO 


oh ~ 


Coa ot ~- or or or 


~ 
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Table 17-18. 3.3V AC Characteristics for the 75/25-MHz IntelDX4™ Processor (Continued) 
Voc = 3.3V +0.3V; Vocs = 5V +0. 25V (Note 1); Tcase = 0°C to + 85°C; C_ = 50 pF 


[symbor | ___ Parameter‘ win | tox | nt | Powe [ not 
Ty | b0-681,6F0-awiieoaaFesiGnay | | a | vw | sr | a 
Tie [eaaseseuotme fe | tm fre | 
Pe [ese vem Ps ee Pv PO 
KEN# ast6e,eseeseuptme |e |m|ioel 
rene, asiee,aser ostine |e | | m | va] 
[rove soresuwnme Le | | ree | 


RESET, FLUSH#, A20M¥, NMI, INTR, 
IGNNE # SRESET, STPCLK#, SMI# 
Setup Time 


RESET, FLUSH#, A20M#, NMI, INTR, 
IGNNE # SRESET, STPCLK#, SMI# Hold 
Time 


DO-D31, DPO-3, A4—A31 Read Setup 
Time 
17- 4 


hala 

. Vocs should be connected to 3.3V +0.3V in 3.3V-only systems. 

0-MHz operation is guaranteed when the STPCLK# and Stop Grant Acknowledge protocol is used. 

Not 100% tested. Guaranteed by design characterization. 

All timing specifications assume C, = 50 pF. See capacitive derating charts for additional timing delays due to loading. 
A reset pulse width of 15 CLK cycles is required for warm resets (RESET or SRESET). Power-up resets (cold resets) 
require RESET to be asserted for at least 1 ms after Voc and CLK are stable. 

For adjacent clocks, assumes frequency of operation is constant. STPCLK# input should be used to change frequency 
of operation. 


ie el hes 


@ 
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Table 17-19. 3.3V AC Characteristics for the 100/33-MHz intelDX4™ Processors 
Voc = 3.3V £0.3V; Vocs = 5V +£0.25V (Note 1); Tcase = 0°C to + 85°C; C, = 50 pF 


Symbol , Parameter 

CLK Frequency 

‘CLK Period 

1a CLK Period Stability 

CLK High Time 

CLK Low Time — 

CLK Fall Time 

CLK Rise Time 

A2-A31, PWT, PCD, BEO-3#, M/IO#,,. 
D/C#, W/R#, ADS#, LOCK#, FERR#, 
BREQ, HLDA Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK # Float Delay 


PCHK# Valid Delay 

BLAST #, PLOCK#, SMIACT # Valid Delay 
BLAST #, PLOCK # Float Delay 

DO-D31, DPO-3 Write Data Valid Delay 
DO-D31, DPO-3 Write Data Float Delay 
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tyo EADS # Setup Time 

t13 EADS # Hold Time 

t14 KEN#, BS16#, BS8# Setup Time 
tis | KEN#,BS16#, BS8# Hold Time 
tie RDY#, BRDY# Setup Time 


RDY #, BRDY # Hold Time 

HOLD, AHOLD Setup Time 

ti8a BOFF # Setup Time 

tig | HOLD, AHOLD, BOFF # Hold Time 
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[| Min | Max | unit | Figure | Notes 
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Table 17-19. 3.3V AC Characteristics for the 100/33-MHz IntelDX4™ Processors (Continued) 
Voc = 3.3V £0.3V; Vocs = 5V +0.25V (Note 1); Tcase = 0°C to + 85°C; C, = 50 pF 


[eres [reer __ [oo [ os [ot [rs [i 


RESET, FLUSH #, A20M#, NMI, INTR, 
IGNNE#, SRESET, STPCLK#, SMI# 
Setup Time | 


RESET, FLUSH #, A20M#, NMI, INTR, 
IGNNE#, SRESET, STPCLK#, SMI# Hold 
Time 


too DO-D31, DP0-3, A4—-A31 Read Setup 5 17-3, 
Time 17-4 
7 17-4 


a 

. Vocs should be connected to 3.3V +0.3V in 3.3V-only systems. 

0-MHz operation is guaranteed when the STPCLK# and Stop Grant Acknowledge protocol is used. 

. Not 100% tested. Guaranteed by design characterization. 

. All timing specifications assume C, = 50 pF. See capacitive derating charts for additional timing delays due to loading. 

. A reset pulse width of 15 CLK cycles is required for warm resets (RESET or SRESET). Power-up resets (cold resets) 
require RESET to be asserted for at least 1 ms after Voc and CLK are stable. 

. For adjacent clocks, assumes frequency of operation is constant. STPCLK# input should be used to change frequency 
of operation. 
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Table 17-20. 3.3V AC Characteristics for the 100/50-MHz IntelDX4™ Processor 


Voc = 3.3V +0.3V; Vocs = 5V +0.25V (Note 1); Tcase = 
Parameter 
a CLK Frequency 
CLK Period 
CLK Period Stability 
CLK High Time 
t3 CLK Low Time 
a CLK Fall Time 
CLK Rise Time 


tea A20-A31, PWT, PCD, BEO-3#, M/IO#, 
} D/C#, W/R#, ADS#, LOCK#, FERR#, 
BREQ, HLDA Valid Delay 
te A2-A19 


ote 
N 


PCHK# Valid Delay 


fee) 


8a BLAST #, PLOCK#, SMIACT # Valid Delay 


BLAST #, PLOCK# [Float Delay 
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~-- 
—_h 


= 
ooh 
oh 


EADS# Setup Time 

t13 EADS # Hold Time 

KEN #, BS16#, BS8# Setup Time 
15 KEN #, BS16#, BS8# Hold Time 
6 RDY #, BRDY # Setup Time 
RDY #, BRDY # Hold Time 

HOLD, AHOLD, Setup Time 


- 
—_ 
ine) 


- f ms | 
_"s Ss 
aS 


~- -- 
—_ _— 


ie BOFF # Setup Time 
tig | HOLD, AHOLD, BOFF # Hold Time 
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A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK# Float Delay 


; DO-D31, DPO-3 Write Data Valid Delay _ 
DO-D31, DPO-3 Write Data Float Delay 
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Table 17-20. 3.3V AC Characteristics for the 100/50-MHz IntelDX4™ Processor (Continued) 
Voc = 3.3V £0.3V; Vocs = 5V +0.25V (Note 1); Tcase = 0°C to + 85°C; C, = O pF 


smi | tam | tr | vt [re [Ho 


RESET, FLUSH #, A20M#, NMI, INTR, 
IGNNE#, SRESET, STPCLK#, SMI# 
Setup Time 


RESET, FLUSH#, A20M#, NMI, INTR, 
IGNNE #, SRESET, STPCLK#, SMI# Hold 
Time 
too DO-—D31, DPO-3, A4—A31 Read Setup 17-3, 
Time 17-4 
| 17-4 


NOTES: 

Vocs should be connected to 3.3V +0.3V in 3.3V-only systems. 

0-MHz operation is guaranteed when the STPCLK# and Stop Grant Acknowledge protocol is used. 

Not 100% tested. Guaranteed by design characterization. 

All timing specifications assume C, = 0 pF. I/O buffer modeling should be used to calculate additional timing delays due 

to loading. 

5. A reset pulse width of 15 CLK cycles is required for warm resets (RESET or SRESET). Power-up resets (cold resets) 
require RESET to be asserted for at least 1 ms after Vcc and CLK are stable. 


6. For adjacent clocks, assumes frequency of operation is constant. STPCLK# input should be used to change frequency 
of operation. 


aa lea 
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Table 17-21. 3.3V Intel486 Processor AC Specifications for the Test Access Port 
(All Intel486 Processors and Frequencies except the IntelDX4™ Processors) 
Voc = 3.3V £0.3V; Tcase = 0°C to + 85°C; C; = 50 pF 


Symbol Parameter | Min | Max | Unit | Notes 
TCK Frequency | | | af Maz ft 
TOK Ped se 


> | 
Nm | fo 


to7 TCK Low Time ) 40] | ns | at 0.8V 


=i ea 2 
TDI, TMS Setup Time | }s | |n |] 3 | 


(ne ad 
Nh 
co) 


tit | 
GD 


ts TDI, TMS Hold Time | p10} df ns | 

tg2 TDO Valid Delay | 3 | 3 | nm | 3 

t99_ | TDO Float Delay | | % fs | 8 

t34 All Outputs (Non-Test) Valid Delay )3 | 30 | ns | 3 | 

t35 All Outputs (Non-Test) Float Delay | | 36 | ns | 3 | 
All Inputs (Non-Test) Setup Time Pa | [nn }| 3 | 

t7 All Inputs (Non-Test) Hold Time p10} fn | 3 | 
NOTES: 


. TCK period < CLK period. 

. Rise/Fall times are measured between 0.8V and 2.0V. Rise/Fall times can be relaxed by 1 ns per 10-ns increase in TCK 
period. 

. Parameters tgq-t37 are measured from TCK. 

. Refer to Figure 17-18 for signal waveforms. 
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Table 17-22. 3.3V IntelDX4™ Processor AC Specifications for the Test Access Port 
(All IntelDX4 Processor Frequencies) 
Voc = 3.3V £0.3V; Vocs =5V +0.25V (Note 1); Tcase = 0°C to + 85°C; C, = 0 pF 


All Outputs (Non-Test) Valid Delay 


All Outputs (Non-Test) Float Delay 


All Inputs (Non-Test) Setup Time 


t37 All Inputs (Non-Test) Hold Time 


NOTES: 

1. Vocs should be connected to 3.3V + 0.3V in 3.3V-only systems. 

2. All inputs and outputs are TTL Level. 

3. Rise/Fall times are measured between 0.8V and 2.0V. Rise/Fall times can be relaxed by 1 ns per 10-ns increase in TCK 
period. 

4. TCK period < CLK period. 

5. Parameters tgq—tg7 are measured from TCK. 
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17.4.2 5V AC CHARACTERISTICS 


Table 17-23 is for 25- and 33-MHz Intel486™ SX, 33-MHz Intel486 DX, 50-MHz IntelSX2T (25-MHz Max.), 
50-MHz IntelDX2TM (25-MHz Max.), 66-MHz IntelDX2 (83-MHz Max.), 50-MHz Write-Back Enhanced IntelDX2 
(25-MHz ae and 66-MHz Write-Back Enhanced IntelDX2 (83-MHz Max.) processors. 


Table 17-23. 5V AC Characteristics 
Functional operating range: Vcc = 5V +0.25V; Tcase = O°C to +85°C; C; = 50 pF unless otherwise 
specified. (See also Table 17-24). 


: | Bus ee 
Parameter Notes | 


— nes sa 


CLK Period Stability . + 250 + 250 17-2 _ 
: clocks 


CLK High Time ae tae eee ae at | atav | 

CLK Low Te Pa [ [| [ne [wre | woev 
cLK Fale ee [re [ire Tavooev) 
CuK ie Ti Te [2 [om [are [ava 


A2-A31, PWT, PCD, BEO-3#, 3 19 3 16 17-6 
M/lO#, D/C#, W/R#, ADS#, 

LOCK #, BREQ, HLDA, SMIACT #, 
FERR#, CACHE#, HI’M# Valid 

Delay oo, 
A2-A31, PWT, PCD, BEO-3#, 

M/lO#, D/C#, W/R#, ADS#, 

LOCK #, BREQ, HLDA, CACHE # 

Float Delay 


CHK Valid Delay 2a REYES EIECS 175 
BLAST #, PLOCK # Float Delay rs{ T= tele 17-7 


DO-D31, DPO—DP3 Write Data Valid 20 18 17-6 

Delay 

DO-D31, DPO-—DP3 Write Data Float 28 20 = 
Delay 


EADS®, INV Setup Time mriaeric cs ee 
EADS#, INV Hold Time [fal [re | ara” 


KEN #, BS16#, BS8#, WB/WT # 17-3 
Setup Time 


~~ 
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Table 17-23. 5V AC Characteristics (Continued) 
Functional operating range: Vcc = 5V +0.25V; Tcase = O°C to +85°C; Si 50 pF unless otherwise 
specified. (See also Table 17-24). 


Bus ae te 


KEN #, BS16#, BS8#, WB/WT # 
a Time 


| tye | RDY#, BRDY # | RDY#,BRDY# Setup Time Time ttt Pied 4 
Co Ml a 
rie [rows anowosmpriw [| Pe fmf ma 
en [BorFeseuptine «| | e{ | m= | wa 
rs freien commana [Le [Pm [8 [ 


FLUSH #, A20M#, NMI, INTR, SMI#, 
STPCLK #, SRESET, RESET, 
IGNNE # Setup Time 


FLUSH #, A20M #, NMI, INTR, SMI#, 
STPCLK#, SRESET, RESET, 
IGNNE # Hold Time 


too DO-—D31, DPO-—DP3, A4—A31 Read 17-3, 
Setup Time 17-4 
to3 DO-D31, DPO-—DP3, A4—A31 Read 3 3 17-3, 
Hold Time 17-4 


NOTES: 

1. O-MHz operation is guaranteed when the STPCLK# and Stop Grant bus cycle protocol is used. 

2. Not 100% tested, guaranteed by design characterization. 

3. IGNNE# and FERR# are present only in the Intel486 DX, IntelDX2, and Write-Back Enhanced IntelDX2 processors. 
4. CACHE#, WB/WT#, HITM#, and INV are present only i in the Write-Back Enhanced IntelDX2 processor. 
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The following specifications are different for existing IntelSX2, IntelDX2 and Write-Back Enhanced IntelDX2 
processors. A system board that will support all of the Intel486 processors should be designed to the worst- 
case Spegncanons of the 25- and 33-MHz local bus timings. 


Table 17-24 is for 50-MHz IntelSX2T™ (25-MHz Max.), 50-MHz IntelDX2TM (25-MHz Max.), Pars IntelDX2 
(83-MHz Max.), 50-MHz Write-Back Enhanced IntelDX2 (25-MHz Max.) and 66-MHz Write-Back Enhanced 
IntelDX2 (83-MHz Max.) processors. | 


| Table 17-24. 5V AC Characteristics | | 
Functional operating range: Vcc = 5V +0.25V; Tcase = O°C to +85°C; C, = 50 pF unless otherwise 
- specified. (See also Table 17-23). a 


win [ wx | win | Max 


a es 
CuK Frenioe Taps [ef [wel 


A2-A31, PWT, PCD, BEO-3#, M/IO#, D/C#, 14 
W/R#, ADS#, LOCK #, BREQ, HLDA, 

SMIACT #, FERR#, CACHE#, HITM# Valid : 
Delay 


a: 

—— 

— PCHK # Valid Delay et | le fs | 
| tga | BLAST#, PLOCK# Valid Delay Sees ae 
P tio 
Pte 
= 


Do- D31, DPO—DP3 Write Data Valid Delay Le aa ie | 
HOLD, AHOLD Setup Time . Pat | | of ms fo 
tiga BOFF # Setup Time ps] fz] fn} | 


FLUSH #, A20M#, NMI, INTR, SMI#, 
, STPCLK#, SRESET, RESET, IGNNE # Setup 
Time 


NOTES: 
. 0-MHz operation is guaranteed when the STPCLK# and STOP GRANT bus cycle protocol is used. 
B Not 100% tested, guaranteed by design characterization. 
3. IGNNE# and FERR# are present in the IntelDX2 and Write-Back Enhanced IntelDX2 processors only. 
4. CACHE #, WB/WT #, HITM#, and INV are present only in the Write-Back Enhanced IntelDX2 processor. 
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Table 17-25. 5V AC Characteristics 50-MHz Intel486™ DX Processors 
Functional operating range: Vcc = 5V +0.25V; Tcase = 0°C to + 85°C; C; = (Note 1). 


CLK Period 


CLK Period Stability 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK#, BREQ, 
HLDA, FERR # Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK#, BREQ, 
HLDA Float Delay 


FLUSH #, A20M#, NMI, INTR, RESET, 
IGNNE # Setup Time 
FLUSH#, A20M#, NMI, INTR, RESET, 
IGNNE# Hold Time 
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pms | 77 
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2V to 0.8V 
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Table 17-25. 5V AC Characteristics 50-MHz intel486™ DX Processors (Continued) 
Functional operating range: Voc = 5V +0.25V; TcasE = 0°C to +85°C; C, = (Note 1). 


Parameter — 


nent). First order |/O buffer models for the Intel486 processor are available. Contact Intel for the latest release. 
2. Not 100% tested. Guaranteed by design characterization. 


Table 17-26. 5V Intel486 Processor AC Specifications for the Test Access Port 
= | | (All Processors and Frequencies) | 
Voc =5V +0.25V; Tcase = 0°C to + 85°C; C, = 50 pF 


Parameter | Min | 
to4 TCK Frequency id MHz 
tos TCK Period | 125 
TCK High Time 40 at 2V 
to7 TCK Low Time 40 at 0.8V 
tog TCK Rise Time 
TCK Fall Time 


TDI, TMS Setup Time 
TDI, TMS Hold Time 
TDO Valid Delay 

—TDO Float Delay 
All Outputs (Non-Test) Valid Delay 
All Outputs (Non-Test) Float Delay 
All Inputs (Non-Test) Setup Time 
All Inputs (Non-Test) Hold Time 
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NOTES: 

1a. Voc should be connected to 3.3V +0.3V in 3.3V-only systems. 

. TCK period < CLK period. : 
. Rise/Fall times are measured between 0.8V and 2.0V. Rise/Fall times can be relaxed by 1 ns per 10-ns increase in TCK 
period. . 

. Parameters tg39-t37 are measured from TCK. 

. Refer to Figure 17-18 for signal waveforms. 
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Figure 17-2. CLK Waveforms 
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Figure 17-3. Input Setup and Hold Timing 
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Figure 17-4. Input Setup and Hold Timing . 
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Figure 17-5. PCHK # Valid Delay Timing 
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Tx 


CLK 


MIN 
A2-A31, PWT, PCD, (6) | [MAX 


BEO-3#, M/IO#, 


eee 
D/C#, W/R#, ADS#, VALID n KXX VALID n+1 
LOCK#, FERR#, oe 
BREQ, HLDA, MIN 


CACHE#, HITM# @ als, 
DO-D31 RR 
DPO-3, VALID n XXX VALID n+1 
MIN 


@he 


VALID n KXX VALID n+1 


242202-G3 


Figure 17-6. Output Valid Delay Timing 


CLK 


A2-A31, PWT, PCD, (&) 
BEO-3#, M/lO#, _ 
D/C#, W/R#, ADS#, oA y 
LOCK#, FERR#, 
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CACHE# 


DO-D31, Caw a & 
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Figure 17-7. Maximum Float Delay Timing 
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Figure 17-8. Test Signal Timing Diagram 


17.5 Capacitive Derating Curves 


The capacitive derating curves illustrate output delay 
versus capacitive load for 3.3V and 5V Intel486 
processors. The derating curves show the delays for 


the rising and falling edges under worst-case condi- | 


tions. Figure 17-9 and Figure 17-10 apply to all 
3.3V Intel486 SX, Intel486 DX, and IntelDX2 


3.3V Inte 


Delay (ns) 


NOTE: 


nom = nominal value from the AC Characteristics table. 


1486™ SX, Intel486 DX, and intelDX2™ Processors (Rising) 


| | 


Capacitive Load (pF) 


This graph will not be linear outside of the capacitive range shown. 


Figure 17-9. Typical Loading Delay versus Load Capacitance under 


242202-G5 


processors. Figure 17-11 and Figure 17-12 apply to 
5V Intel486 DX and IntelDX2 processors. Figure 17- 


13 and Figure 17-14 apply to 5V Intel486 SX and 


IntelSX2 processors. Figures 17-15 through 17-17 
apply to the IntelDX4 processor. The figures apply to 
all frequencies specified for each corresponding 
product. Refer to Appendix C for bus frequencies 
above 33 MHz for Intel486 processors. 


242202-G6 


Worst-Case Conditions for a Low-to-High Transition 
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NOTE: 
This graph will not be linear outside of the capacitive range shown. 
nom = nominal value from the AC Characteristics table. 


Figure 17-10. Typical Loading Delay versus Load Capacitance under 
Worst-Case Conditions for a High-to-Low Transition 


Delay (ns) 


Capacitive Load (pF) 


242202-G8 


NOTE: 
This graph will not be linear outside of the capacitive range shown. 
nom = nominal value from the AC Characteristics table. 


Figure 17-11. Typical Loading Delay versus Load Capacitance under 
Worst-Case Conditions for a Low-to-High Transition 
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£& nom+3 
= nom+2 
is 

nom-2 

25 50 aan 

_ Capacitive Load (pF) 
242202-G9 

NOTE: 


This graph will not be linear outside of the capacitive range shown. 


| nom = nominal value from the AC Characteristics table. 


Figure 17-12. Typical Loading Delay versus Load Capacitance under 
Worst-Case Conditions for a High-to-Low Transition 


5V Intel486™ SX and IntelSX2™ Processors (Rising) 


Delay (ns) 


Capacitive Load (pF) 
242202-HO 


This graph will not be linear outside of the capacitive range shown. 
nom = nominal value from the AC Characteristics table. 


Figure 17-13. Typical Loading Delay versus Load Capacitance under | 
Worst-Case Conditions for a Low-to-High Transition 
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NOTE: 
This graph will not be linear outside of the capacitive range shown. 
nom = nominal value from the AC Characteristics table. 


Figure 17-14. Typical Loading Delay versus Load Capacitance under 
Worst-Case Conditions for a High-to-Low Transition 


intelDX4™ Processor (Falling) (3V Signals) 
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Figure 17-15. IntelDX4™ Processor Capacitive Derating Curve for 
High-to-Low Transitions (3V Signals) 
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Figure 17-16. IntelDX4™ Processor Capacitive Derating Curve 
for Low-to-High Transitions (5V Signals) 


intelDX4™ Processor (Rising) (3V/5V Signals) 
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Figure 17-17. IntelDX4T™ Processor Capacitive Derating Curve 
for Low-to-High Transitions (3V/5V Signals) 
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18.0 MECHANICAL DATA 


This section describes the package dimensions and thermal specifications for all processors in the Intel486 
processor family. 


NOTE: 
For further details about thermal and mechanical package specifications and methodologies, refer to 
the 1994 Packaging Handbook (order number 240800). 


18.1 Intel486 Processor Package Dimensions 


The processor dimensions are listed in the following order: 
¢ 168-pin PGA package; 

e 208-lead SQFP package; 

e 196-lead PQFP package. 


18.1.1 168-PIN PGA PACKAGE 


SEATING 


SEATING 
PLANE 


@B (ALL PINS) 


to 


SWAGGED 
PIN 
DETAIL 


SWAGGED 
1.52 ; PIN 
45° CHAMFER (4 PL) BASE 

(INDEX CORNER) 


242202-H4 


Figure 18-1. 168-Pin Ceramic PGA Package Dimensions 
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Table 18-1. 168-Pin Ceramic PGA Package Dimensions 


_ Family: Ceramic Pin Grid Array Package 


Millimeters 
Max 
4.57 


1.14 SOLID LID 0.025 0.045 SOLID LID 
3.5 SOLID LID 0.110 0.140 SOLID LID 


1.40 
0.51 
44.83 
40.77 
2.79 
3.30 
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Table 18-2. Ceramic PGA Package Dimension Symbols 
Description of Dimensions 
| Distance from seating plane to highest point of body 
Distance between seating plane and base plane (lid) 
Distance from base plane to highest point of body 
Distance from seating plane to bottom of body 
Diameter of terminal lead pin 
Largest overall package dimension of length 
A body length dimension, outer lead center to outer lead center 
Linear spacing between true lead position centerlines 
Distance from seating plane to end of lead 
Other body dimension, outer lead center to edge of body 


a ae 
a: ae 


NOTES: 

1. Controlling dimension: millimeter. 

2. Dimension ‘“‘e,” (“e”) is non-cumulative. 

3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 
4. Dimensions “B”, ‘‘B;” and “C” are nominal. 

5. Details of Pin 1 identifier are optional. 


2-348 


intel : Intel486™ PROCESSOR FAMILY 


18.1.2 208-LEAD SQFP PACKAGE 


0.13+0.12-0.08 


0.60+0.10 
1.30 Ref 


* Note* Length Measurements same 
as Width Measurements 


Tolerance Window for 
Lead Skew from Theoretical 


“th: True Position 
ies 0.10 Max 


- Units: mm 
242202—H5 


Figure 18-2. 208-Lead SQFP Package Dimensions 
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18.1.3 196-LEAD PQFP PACKAGE 
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= 02 


[| 0.006 @|c aA®-8O©}/0® 


242202-H6 


NOTE: 
Interpret dimensions and tolerances in accordance with ANSI Y14.5M-1982. 


Figure 18-3. Principal Dimensions and Data for 196-Lead Plastic Quad Flat Pack Package | 


Description of Dimensions | Min | Max | 
Package Height: Distance from the seating plane to the highest point of body. | 0.160 | 0.175 
Standoff: The distance from the seating plane to the base plane. 


Table 18-3. Symbol List and Dimensions for 196-Lead PQFP Package 


Overall Package Dimension: Lead tip to lead tip. 
Plastic Body Dimension 


Bumper Distance 
Without FLASH 
With FLASH 


Seating Plane Coplanarity 0.000 


NOTES: 
~ 1. All dimensions and tolerances conform to ANSI Y14.5M-1982. 
2. Dimensions are in inches. 
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DETAIL C 
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om Of 


ECS 
Les 


0.00 
0.01 


9 | 
2 


DETAIL B 
242202-L9 


(0.675, -0.625) 


(0.010) MAX 


(0.730, -0.660) 
(0.739, -0.665) 


2° ANGLE ON 
3 SURFACE OF THE 
BUMPER BOTTOM HALF 


242202-H7 


Figure 18-4. Typical Lead 


18.2 Package Thermal Specifications 


The Intel486 processors are specified for operation 
when Tc (the case temperature) is within the range 
of 0°C-85°C. To may be measured in any environ- 
ment to determine whether the Intel486 processor is 
within the specified operating range. The case tem- 
perature, with and without heat sink should be mea- 
sured using a 0.005” diameter (AWG #36) thermo- 
couple with a 90° angle adhesive bond at the center 
of the package top surface, opposite the pins. Figure 
18-5 and Figure 18-6 illustrate this methodology. 


The ambient temperature (T,) is guaranteed as long 
as Tc is not violated. The ambient temperature can 
be calculated from @jc and @ ja from the following 
equations. 

Tj Ter P* O36 

Ta = Ty — P* Oya 


To = Ta + P* [0ya-95c] 
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Where: 
‘Ty, Ta, To = Junction, Ambient and Case Temper- 
7 ature, respectively. 

9 jc, 9yA = Junction-to-Case and Junction-to-Am- | 
bient thermal Resistance, respective- 
ly. 

P = Maximum Power Consumption 


The values for 0ja and @jc are given below for the 
packaging and operating frequencies. 


Note that T, is greatly improved by attaching ‘“‘fins’’ 
or a “heat sink” to the package. P (the maximum 
power consumption) is calculated by using the maxi- 
mum Icc at nominal Vcc (either 3.3V or 5V) as tabu- 
lated in the DC Charisteristics in section 17. 


242202-H8 
0.005” diam. thermocouple on the center of the pack- 
age top surface with a 90° angle adhesive bond 


Figure 18-5. Case Temperature nro 
| malinout Heat Sink 


Heat Sink — 


a = 


N 
N 
S 


242202-H9 
0.005” diam. thermocouple on the center of the pack- 
age top surface with a 90° angle adhesive bond 
through a hole drilled at the heat sink base 


Figure 18-6. Case Temperature Measurement 
| _with Heat Sink 
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_ Refer to section 16, “OverDrive Processor Sock- 


et,” for OverDrive processor thermal specifica- 


_ tions. 


18.2.1 168-PIN PGA PACKAGE THERMAL 
CHARACTERISTICS FOR 3.3V IntelDX4 
PROCESSOR | 


Desktop Applications 


All thermal measurements for the PGA package 
were taken with part in a socket mounted to a 4.5” x 
4.5" printed circuit board with 2 power plane layers | 
and 2 signal layers in a test chamber. 


Table 18-5 shows maximum ambient temperatures 
of IntelDX4 processor for each core operating fre- 
quency. The maximum ambient temperature is mea- 
sured as ambient air temperature in the system 
case. These maximum ambient temperatures are 
not valid for the OverDrive processor. 


: 


0.079" —| }— 


—{-0.063" 4-060" 


iitioonat 


242202-10 


Figure 18-7. Sample IntelDX4™ Processor 
PGA Heat Sink 


| 
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Table 18-4. Desktop PGA Package Thermal Resistance (°C/W)-6 jc and Oy, for IntelDX4™ Processor 


idle 
(2.03) Te — - a (5. eee 
ee ae ae OT 


*0.350” high omnidirectional heat sink. 


Table 18-5. Desktop PGA Package Maximum Ambient Temperature for IntelDX4™ Processor 


Airflow—ft/min (m/sec) 
Pie se-a5-30-9 
—— (0) (2.03) <a — 
es a Oe Oa 


*0.350” high omnidirectional heat sink. 


Mobile Applications for IntelDX4 Processor Table 18-6. Mobile PGA Package Thermal 


Resistance (°C/W)—0@ jc and 65, and Tampient 
The thermal resistances and maximum ambient tem- 
peratures for the IntelDX4 processor in the PGA 


package are given in Table 18-6. 


NOTE: sash a Sink 
These values should be used as guidelines f 
only, and are highly system dependent. Mo- 100 MHz 
bile Applications assume no airflow (if airflow Without Heat Sink 
exists in the system, the Desktop Application 75 MHz 


values should be used). All thermal mea- 
surements for the PGA package were taken 
with part in a socket mounted toa 45” x *0.350” high omnidirectional heat sink. 
4.5” printed circuit board with 2 power plane 

layers and 2 signal layers in a test chamber. 

Final system verification should always refer 

to the case temperature specification. 


100 MHz 


: 2-353 


Intel486™ PROCESSOR FAMILY intel 7 


18.2.2 168-PIN PGA PACKAGE: THERMAL CHARACTERISTICS FOR 5V Intel486 PROCESSORS 


Table 18-7. Thermal Resistance (C/W) O05c and 60 ja for the 168-Pin PGA Package of the Intel486™ 
Processor 


OJA vs. Airflow—ft/min. (m/sec) 


*0.350” high omnidirectional heat sink. 


Table 18-8. Maximum Tambient for the 5V, 168-Pin PGA intel486™ Processor Family 


‘Intel486T SX Processor 
Tambient °C with Heat Sink* 


Tambient °C without Heat Sink 


IntelSX2T™ Processor — 


Tambiont°CwithHeat inky | 0 | 98 | so ||| 
Tambient °C without Heat Sink 22 
Intel486™ DX and IntelDX2™ Processors** 


Tambient °C with Heat Sink* 74 
69 
65 
Tambient °C without Heat Sink 59 
46 
36 

Write-Back Enhanced IntelDX2™ Processors | 
Tambient °C with Heat Sink 68 
| 62 
Tambient °C without Heat Sink 43 
30 


NOTE: 
* 0.350” high omnidirectional heat sink. 
** For 33- and 50-MHz Intel486 DX processors and 50- and 66-MHz IntelDX2 processors. 
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18.2.3 THERMAL SPECIFICATIONS FOR 208-LEAD SQFP PACKAGE 
Table 18-9. Thermal Resistance (°C/W) 6 , for the Intel486™ Processor (for SQFP) 


0a VS. Airflow —ft/min. (m/sec) 
200 400 600 
(1.01) (2.03) (3.04) 
intel486 SX Processor without Heat Sink 


Table 18-10. Thermal Resistance (°C/W) 6jc for the Intel486™ Processor (for SQFP) 


0Jc vs. Airflow —ft/min. (m/sec) | | 
0 200 400 / 600 
(0) (1.01) (2.03) (3.04) 


Intel486 DX and IntelDX2™ Processors | 35 | 60 | 60 | 6.0 


Iintel486T™ SX Processor 


33 47 62 65 69 


Iintel486T™ DX Processor | 


IntelDX2T™ Processor 7 . 
Tambient °C without Heat Sink 
Write-Back Enhanced IntelDX2™ Processor 7 | 
Tambient°C withoutHeatSink | 40 | 63, | 8 | tH 
ee eee 
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18.2.3.1 SQFP Package Thermal Characteristics | 


for the 3.3V IntelDX4 Processor 
Desktop Applications 


All thermal measurements for the SQFP package 
were taken with part soldered to a 4.5” x 4.5” print- 
ed circuit board with 2 power plane layers and 2 
signal layers in a test chamber. 


Table 18-14 shows maximum ambient temperatures 
of IntelDX4 processor for each core operating fre- 
quency. The maximum ambient temperature is mea- 
sured as ambient air temperature in the system 
case. 


Mobile Applications 


The thermal resistances and maximum ambient tem- | 


peratures for the IntelDX4 processor in the SQFP 
package are given below. 


NOTE: 
These values should be usec! as guidelines 
only, and are highly system dependent. Mo- 
bile Applications assume no airflow (if 


intel. 


airflow exists, the Desktop Application values 
should be used). Data was taken on a 2” x 
2", 4-layer circuit board in a test chamber. 
These values have been correlated to a typi- — 
cal 8.5” x 11” x 1.5” notebook PC with the 
ambient temperature measured external 
to the system. Final system. verification 
should always refer to the case temperature 
specification. 


Table 18-12. Mobile SQFP Package Thermal 


- Resistance (°'C/W)—8@ jc and Oya and Tambient 


With Heat Sink 
75 MHz 


100 MHz 


Without Heat Sink 
75 MHz 
100 MHz 


Table 18-13. Desktop SQFP Package Thermal Resistance (°C/W)—9@ jc and 6 a for the 
. IntelDX4™ Processor 


With Heat Sink 
Without Heat Sink 


Oya VS Airflow—ft/min (m/sec) 


Table 18-14. Desktop SQFP Package Maximum Ambient Temperature for the IntelDX4™ Processor 


Tambient °C with Heat Sink 
Tambient °C without Heat Sink 


NOTE: 
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200 
== — .01) cs eo 


| 435 | 


a ae 


The SQFP Package is intended primarily for the Mobile Market. 


Airflow—ft/min — 
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18.2.4 THERMAL SPECIFICATIONS FOR 196-LEAD PQFP PACKAGE 
Table 18-15. Thermal Resistance (°C/W) 6jc and 6ya 


Oya VS Airflow—ft/min (m/sec) 
Pie ar-se-ar 
(2.03) a 04) 
ae See Se ee 


*0.350” high omnidirectional heat sink. 


Table 18-16. Maximum Tampient for the 5V, 196-Lead PQFP Intel486™ Processor Family 


Airflow—ft/min (m/sec) 
Freq. 0 200 400 ~ 600 | 
(MHz) (0) (1.01) (2.03) (3.04) 


Intel486™ SX Processor 
Tambient °C with Heat Sink* 


ee 
intel486™ DX Processor 
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APPENDIX A 
ADVANCED FEATURES 


Some non-essential information regarding the In- 
tel486 processor is considered Intel confidential and 
proprietary and is not documented in this publica- 
tion. This information is provided in the Supplement 
to the Pentium™ Processor Family User’s Manual 
and is available with the appropriate non-disclosure 
agreements in place. Please contact Intel Corpora- 
tion for details. ; 


The Supplement to the Pentium™ Processor Family 
User’s Manual contains architecture extensions for 
- the Intel486 and Pentium processors that are confi- 
-. dential and non-essential for standard applications. 
These extensions include low-level registers that 
_ provide access to features such as page extensions, 
virtual mode extensions, testing, and performance 
monitoring. 


_This information is specifically targeted at software 
developers who develop the following types of low- 
level software: 


© operating system kernels 
e virtual memory managers 
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e BIOS and processor test software 
¢ performance monitoring tools 


For software developers designing other categories 
of software, this information does not apply. All of 
the required program development details are pro- 
vided in the /nte/486™ Microprocessor Family Pro- 
grammer’s Feference Manual, which is_ publicly 
available from the Intel Corporation Literature Cen- — 
ter. To obtain this document, contact the Intel Cor- 
poration Literature Center at: 


Intel Corporation Literature Center 
P.O. Box 7641 
Mt. Prospect, IL 60056-7641 


or call 1-800-879-4683 
and reference 240486-002. 
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APPENDIX B 
FEATURE DETERMINATION 


CPUID Instruction 


The Intel486 processor implements the CPUID in- 
struction that makes information available to the 
system software about the family, model, and step- 
ping of the processor on which it is executing. Sup- 
port of this instruction is indicated by the ability of 
system software to write and read the bit in position 
EFLAGS.21, referred to as the EFLAGS.ID bit. The 
actual state of the EFLAGS.ID bit is irrelevant and 
provides no significance to the hardware. This bit is 
reset to zero upon device reset (RESET and SRE- 
SET) for compatibility with older Intel486 processor 
designs. 


Operation 


The CPUID instruction requires the software devel- 
oper to pass an input parameter to the processor in 
the EAX register. The processor response is re- 
turned in registers EAX, EBX, ECX, and EDX. 


1. When the parameter passed to EAX is zero, the 
register values returned upon instruction execu- 
tion are: 

EAX[31:0] <— 1 

EBX[31:0] < 756E6547—‘‘Genu”’, with “G” in 
the low nibble of BL 

EDX[31:0] <— 49656E69—“‘inel’’, with ‘‘i’ in the 
low nibble of DL 

ECX[31:0] <— 6C65746E—‘ntel’”’, with ‘‘n’ in 
the low nibble of CL 

The values in EBX, ECX, and EDX indicate an 

Intel processor. When taken in the proper order, 

they decode to the string ‘““Genuinelntel.”’ 


2. When the parameter passed to EAX is one, the 
register values returned upon instruction execu- 
tion are: 


EAX[3:0] <— xxxx—Stepping ID 
EAX[7:4] <— xxxx—Model 


EAX[11:8] <— 0100—Family 
EAX[15:12] <— 0000 
EAX([31:16] <— Intel Reserved 
EBX[31:0] <— 00000000 
ECX[31:0] <- 00000000 
EDX[0:0] <—- 1—FPU on-chip 


EDX[3:1] <— 1—For more information on these 
bits, see Appendix A 


~ EDX[31:4] <— Intel Reserved 


The value returned in EAX after CPUID instruc- 
tion execution is identical to the value loaded into 
EDX upon device reset. Software must avoid any 
dependency upon the state of reserved proces- 
sor bits. 


3. When the parameter in EAX is greater than one, 
the register values returned upon instruction exe- 
cution are: 


EAX[31:0] <— 00000000 
EBX[31:0] <— 00000000 
EDX[31:0] <— 00000000 
ECX[31:0] <— 00000000 


Flags Affected 


No flags are affected. 


Exceptions 


None. 


For More Information 


Refer to the Intel application note AP-485, /nte/ 
Processor Identification with the CPUID Instruction 
for more details. 


Table B-1. CPUID Instruction Description 


OPCODE Processor Core EAX Input Value Description 
Clocks 
OF A2 CPUID 14 1 Processor Identification 
9 0 or greater than 1 Intel String/Nuil Registers 
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APPENDIX C 
|[/O BUFFER MODELS 


For processor bus speeds above 33 MHz (e.g., 50 
MHz), the capacitive derating curves are not guaran- 
teed. For bus speeds of 50 MHz, |/O buffer model- 
ing techniques should be used to accurately simu- 
late (and predict) the behavior of processor signals 
in a particular environment. 


This appendix presents a sample I/O buffer model 
parameters for the IntelDX4 processor. The first sec- 
tion presents an overview of signal buffer type cata- 


-gorization. The second section presents a graphical | 
representation of IBIS (I/O Buffer Information Sheet) - 


data for each of the input, input/output, and output 
buffers types on the processor. The third section 
provides a text listing of the ae presented in the 
IBIS format. 


1/O buffer model information is available for all In- 
tel486 processors described in this datasheet. Con-. 
tact your Intel representative for the latest |/O buffer 
models for the IntelDX4 and other members of the 
Intel486 processor family. 


1/O Buffer Models ed IntelDX4 
Processor 


Each valid delay for the 50-MHz bus oe specified for 
a 0 pF load. The system designer should use I/O 


_ buffer modeling to account for signal delays due to 


loading. Table C-1 lists the buffer type to be used for 
each signal in the external interface. 


Table C-1. External interface Signal Buffer Assignment 


Signals 


IntelDX4™ 
Processor 


STPCLK#, SRESET, CLKMUL 


D1i6-—D0, DP2—DP90 
D31-D17, DP3 


A3-A2, FERR#, HLDA 


PWT, PCHK#, W/R# 


A20M #, AHOLD, BOFF #, BRDY #, BS8# 
BS16#, FLUSH#, HOLD, IGNNE #, INTR, 
KEN #, NMI, RDY#, RESET #, EADS#, SMI#, 


BE3 #-BE0#, BREQ#, D/C#, M/IO#, PCD, 


Drive Buffer | Receiver Buffer 
TYPE ue 


re 
ee 
a 


ADS#, BLAST #, LOCK#, PLOCK#, SMIACT#, wae N/A 


Sample IBIS Files for IntelDX4 Processor 


The following pages present sample IBIS file outputs for the IntelDX4 processor. 
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LI-cOeeve 


IBIS 


I/O Buffer Information Sheet 


Component: intelDX4(TM) Processor A-3 33/ S0MHz Bus 
Signals: CLK 


Simplified electrical input model 


Vcc 


GND 


Buffer Type: CLOCK BUF 
Revision: 


C comp 


ee ar EE SA Nc SA! Ree GES ne 


Beyond the Rail info 


Diode to Vec* 
ee BT ee 
| oo | 00 | ve | 00 
| 04 | 0.0 [vecras! 00 
| 05 | 02 [vecros| 00 
| 06 | 14 Yvcecros] 00 
| 07 | 30 Yvecro7| 01 
| 08 | 60 Frecoe| 10 
| 0.9 | 110 [vccro.9} 80 _| 
| 4.0 | 30.0 Precsto} 140 
ie ee 


Intel does not guaranties diode operation 
lof purposes other than ESO protection 


“NOTE: VCC = voltage at pin J1 


= 


This Information Is for modeling 
purposes only and Is not guaranteed. 


Packaging Characteristics 


€ 


ATINV4 HOSS39D0Ud wi98h191U] 


C9E-S 


Zl-Z0cere 


Component: 
Signals: 


Beyond the Railinfo « 
Diode to GND Diode to Vcec* 


i 


< <€ 
818 18 § 
&13/5 O18 |< 
© ~“ a 


N 


Z 


= 


This information Is for modeling 


purposes only and is not guaranteed. 


does not quararies chode operation 
lor purposes other than ESD protection 


l/O Buffer Information Sheet 


inteiDX4(TM) Processor A-3 
A20M¥#. AHOLD. BOFF#, BRDY#. BS16#. BS8#. EADS#, FLUSH#, HOLD. IGNNE#. INTR 
KEN#, NMI. RDY#, RESET. SRESET, SMi#. STPCLK# 


*NOTE: VCC = voltage at pin J1 


IBIS 


‘Buffer Type: INPUT #1 
Revision: 


AINV4 YOSSA0OUd wi98h19}U] 


€9E-¢ 


€i-cd0eere 


IBIS 


/O Buffer Information Sheet 
Component: _—_IntelDX4(TM) Processor A-3 33/SO0MHz Bus Buffer Type: OUT GROUP #1] 
Signals: ADS#, BLAST#, SMIACT#, A<3:2>, FERR#, HLDA, PLOCK#, LOCK# Revision: 


oe ried HIGH =a 

PRO} 1000 sete 

cael 

| 40 | -46.7 | -57.1 510 | 

| 9 | 207 | -135 | | 704 | 

Hes oo | oo | 00 | 
Z Pararae _ cna. 

E ara ee ee 2 aa 
: Rina a7 Pa) E pa ae 
= | 20 | 330] see | 35.4 | 6 | 95 | 
| 25 | 335 | 586 | 36. | = |} 46 | 65 | 
ewe Hae 

341 | soa | 369 | 


00 10 20 630 0=«640 
V_ol (volts) 


Beyond The Rail Info | 
Diode to Vcc" 
v_ [tim fv [tim 


Ramp Rate (into OpF, no pkg 


ee ee ee Se ee 
| Ailse [| 159 | 199 | 47 | | yotsins —_| 
[ron | ave | 243 


[max | unit | 
ee | 140.0 | 325.0 | mOhen | 
| oo | oo | | 00 | | 95 | 15.1] oH | 
| 04 | 00 Fveco4] 0.0 | | 56 | 97 | pF | 
| 05 | 02 Fvccos| 00 | [| 29 | 32 | pF | : 
| 06 | 11 [vecso6| 0.0 | Simplified Output Resistance 
| 07 | 30 [vcceo7] 01 | 
| 08 | 60 Pvccos] 10 | 
| 09 | 11.0 [vcceo9] 80 | a 
P10 | 300 [vecrto| 140 | E ntl. Cacao sin 
(-vcc) yf | vect2 | | 


Intel does not guarantee diode operation 
for purposes other than ESD protection 


“NOTE: VCC = voltage at pin J] 


. This information Is for modeling 
purposes only and Is not guaranteed. 
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vi-cOcere 


IBIS 


/O Buffer Information Sheet 
Component: _IntelDX4(TM) Processor A-3 33/ 50MHz Bus . Buffer Type: OUT GROUP #2 
Signats: BE<3:0>#, BREQ, D/C#, M/IO#, PWT, PCD, PCHK#, W/R# Revision: 


a pecmmneeere 


|I_ol (mA) 
i_oh (mA) 


a dated adh teen eel eee ee 
20 30 40 50 


V_ ol (volts) 


Ramp Rate (into OpF, no pkg) — 
fT min ft mox | typ | unt 
| Rise | 151 | 191 | 47 | voksns | 
a RE 


Vv 
=z 


Simplified Output Resistance 
es eee ee ee 


Sengitied Oviout Res stance calovlaed using s $0 chm load ine 


Intel does not guatantee code oper ation 
poses other than ESD protection 


"NOTE: VCC = voltage at pin J! 


This information is for modeling 
purposes only and is not guaranteed. . 


ATINV4 YOSS3D0¥d wi98h}e7U] 
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G9E-c 


Component: intelDX4(TM) Processor A-3 step 33/50MHz 
Signals: DBUS<16:0>, DP<2:0> 


(mA) 


i_ol 


“4. $44 


00 10 20 


V_ol (volts) 


Diode to GND Diode to Vcc’ 


ro 
ee 
0.0 00 

x 


| 02 | 

re ete 
| 30 J vecs0.7 | 
an 


| oo | 

zee 

ee 
-0.7 


(vec)[ ft vecr2 | 


intel does not guarantee diode operation 


for purposes other than ESD protection 


This information is for modeling 
purposes only and Is not guaranteed. 


1/0 Buffer Information Sheet 
Buffer Type: 1/O GROUP #1 


(mA) 


|_oh 


“NOTE: VCC = voltage at pin JI 


| -1050.0 | 1880.0] 


1410.0 


798 
471 
46 1 
448 


421 
-35.1 
241 


95 
65 
243, 
417 
564 
67.3 
746 
798 
82.4 
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IBIS 


1/0 Buffer Information Sheet ; 
Component: IntelDX4(TM) Processor A-3 33/ 50MHz Bus Buffer Type: |/O GROUP #] 
Signals: DBUS<31:17>,DP<3> Revision: 


Low oneal 
| -1000 | -1830 | -1380 | 
60.0 7 da ae 
| rae7 [az | 10 | 
50.0 0. 8.9 | -20. 13.5 
| oo | 0.0 
< 40.0 / ae 
E agg a | 10 | 230 | 37.2 
3 | 15 | 290] 497 | 315 | 
=e | 20 | 330 | see | 35.4 | 
H284 254 586 | 263 
au | 340 | soo | 366 | 
[35 | 341] 594 | 369 | 
0.0 faa ey. ao 942) | soz 
00 10 20 30 40 50 | 45] 344] soe | 374 


V_ol (volts) 


Om jn 
; jo 
3 3 
i ) 


372 
372 


60.0 


Beyond The Rail Info 


Diode to Vee" [mex [unit] 
ee BY RT 328.0 | mot 
| oo | 00 | vo | 00 | | 13.7 | nH 
04 | 00 [vcs 00 | 6a | oF 
ee vec+0 § | 32 | pF | 


slslsis ls 
812 18 18 | 8 
={S/S/S]6 
Shwoio an 


on’ 
° 
N 
8 


-VCC 
intel does not guasantee diode operation 
for purposes other than ESD protection 
This information Is for modeling — 
purposes only and is not guaranteed. 


*NOTE: VCC = voltage at pin J] 


9l-Z0eeve 


AINV4 HOSS300Hd w9grlequl 


LOE-S 


IBIS 


0 Buffer information Sheet 


Component: IntelDX4(TM) Processor A-3 33/ 50MHz Bus 
Signals: ABUS<31:4> 


370.u 


N 
© 


| -1000 
| 8.9 | 
rae 


05 207 
0.0 | oo | oo | 
2 05 | 20.1 | 129 | 
[a ee a cca ees 1.0 
7 | 15 | 290 | 49.7 | 
: Sala ae 
| 335 | 586 
30 | 340] 590 | 
| 35 | 344 | 594 | 
+-+-+ + +4 [40 | 362 | 597 | 
0.0 10 20 30 40 £50 | 45 | 344 | 590 
| 5.0 | 344] 60.0 | 
V_ol (volts) Sof 244 fea 
L100 [ 344 [ 600 | 


— 


Beyond The Rail info 
Olode to Vcc’ 


Vv 
00 


< 


1.0 
8.0 


< 
° 
a 
+ 
i=] 
~ 
o 
_ 


This information is tor modeling 
purposes only and is not guaranteed. 


Li-c0eere 


80.0 
60.0 


40.0 } 


20.0 


lLoh (mA) 
>) 
O° 


-20.0 


-40.0 +. 


HIGH 


Ramp Rate (into OpF, no pko 


Buffer Type: |/O GROUP #3 
Revision: 


| -95.2 | 
| 79.5 | 
[721 | 


735 


637 
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Sample Text Listing of IBIS Files for IntelDX4 Processor 


[ FRR RRR RHR EERE RR ERR RE ERE RE RE REE RE RE ee eR ee eee 


{IBIS Ver] i ee 2 
{File name] inteldx4pg.ibs 
(File Rev} 2.0 
[Date] 3/23/94 . 
[Source] File originated at Intel Corporation 
{Notes ] The following information corresponds to the INTELDX4 (TM) 


| processor and has been correlated with silicon. 

is for the PGA package only.IntelDX4 processor 
[Disclaimer] This information is for modeling purposes only, and is not 
| . guaranteed. : 


ee eee ESE EAE EAS A SS SES A ESS eS ee RA ee ee 


| 
| 


{Component ] 


This file 


INTELDX4 PROCESSOR 


‘(Manufacturer ] Intel 

[Package] 

| 

| typ min max 

R_pkg 2329m728m 3930m 

L_pkg 17.79nH 8.56nH 27.01nH 
C_pkg 6.03pF 1.89pF 10.16pF 


| , 
See ESL SEAL ESE SS SESE Se SEE Se ee EES Se ee eRe SS ae ew N 


| 


[Pin] signal_name model_name R_pin L_pin C_pin 

AO1 D20 1/02 -1866m 15.54n 3.88p 
A02 D22 I/02 1808m 13.2 0n 5.60p 
A04 D23 I/02 1468m 13.33n 3.14p 
AOS DP3 I/02 1406m 11.59n 4.50p 
A06 D24 I/02 1412m 13.02n 3.04p 
A08 D29 I/02 1274m 10.90n 4.14p 
A15 IGNNE# Inputl 1858m 13.96n 5.74p 
A16 INTR Inputl 1956m 14.47n 6.00p 
Al17 AHOLD Inputl 3414m 22.1in 9.99p 
B01 D19 I/02 1762m 13.46n 5.47p 
BO2 D21 I/02 1636m 14.27n 3.45p 
BO6 D25 I/02 1178m a Ee Olas o7 2.61p 
B08 D31 I/02 1050m 9.73n 3..52p 
B10 SMI # Inputl 948m 10.45n 2.19p 
B15 NMI Inputl 1430m 13220 3.07p / 
B17 EADS # Inputl 1728m 14.78n 3.62p 
col Dil I/Ol 2440m 18.73n 4.93p 
c02 D18 I/02 1446m 13.21n 3.10p 
C03 CLK Clockbuffer 2486m 18.99n 5.02p 
C06 D27 I/02° 964m 10.54n 2.21p 
C07 D26 I/02 892m 8.90n 3.09p 
C08 D28 I/02 752m 9.36n 1.82p 
cog D30 I/02 728m 9.22n 1.78p 
c10 SRESET Inputl 784m 9.54n L<88p 
C12 SMIACT# Outputl 1270m 12.23n 2.78p 
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FERR# 


FLUSH# 


RESET 
BS16# 
D9 
D13 
D17 
A20M# 
BS8# 
BOFF# 
D10 
HOLD 
DP1 
D8 
D15 
KEN# 
RDY# 
BE3# 
D12 


STPCLK# 


D3 
DP2 
BRDY # 
D5 
D16 
BE2# 
BE1# 
PCD 
D14 
BEO # 
D6 
D7 
PWT 
D4 
D/C# 
D2 
D1 
DPOQ 
LOCK# 
M/IO# 
W/R# 
DO 
A29 
A30 
HLDA 
A31 
Al7 
Al9 
A2l 
A24 
A22 
A20 
Al16 
Al3 
Ag 
A5 


Outputl 
Inputl 
Inputl 
Inputl 
I/O1l 
I/Ol 
I/02 
Inputl 
Inputl 
Inputl 
I/Ol 
Inputl 
I/O1 
I/Ol 
I/Ol 
Inputl 
Inputl 
Output2 
I/Ol 
Inputl 
I/Ol 
I/O1 
Inputl 
I/O1 
I/0O1 
Output2 
Output2 
Output2 
I/O1 
Output2 
I/O1 
I/Ol 
Output2 
I/O1 
Output2 
I/O1 
I/Ol 
I/O1 
Outputl 
Output2 
Output2 
I/O1l 
I/03 
I/03 
Outputl 
I/03 
I/03 
I/03 
I/03 
I/03 
1/03 
I/03 
I/03 
I/03 
I/03 
I/03 


1904m 
1342m 
1480m 
2756m 
2718m 
2100m 
1156m 
1148m 
3474m 
3452m 
2356m 
1920m 
2394m 
1864m 
858m 

2404m 
1804m 
3336m 
1912m 
3930m 
1708m 
928m 

2134m 
1528m 
1614m 
848m 

1002m 
2266m 
1160m 
954m 

1432m 
1048m 
1548m 
1000m 
1442m 
1448m 
1198m 
1038m 
1118m 
1744m 
1676m 
1532m 
1292m 
1194m 
1568m 
1608m 
2016m 
1584m 
956m 

920m 

894m 

788m 

850m 

836m 

914m 

966m 


intel486™ PROCESSOR FAMILY 


WWrR NR WwW Ww 


NON £& OUP fF W 


NP PWWW NP UWW NNO PDP WW PDO PWW PON WW hW WwW WUD WwW WDD W ~] W 
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Q13 A7 I/03 . 1084m 11.20n 2.44p 
Q14 A2 Outputl 1134m 11.48n 2.53p 
Q15 BREQ Output2 1872m 15.58n 3.89p 
Q16 PLOCK# Outputl 1630m 14.23n 3.44p 
Q17 PCHK# Output2 1616m 12.69n 5.07p 
ROL A28 1/03 1708m 14.67n $2599 
R02 A25 I/03 1604m 12.63n 5.04p 
ROS A18 I/03 1498m 13.50n 3.20p 
RO7 A15 I/03 1148m 11.56n 2.55p 
R12 All I/03 1274m 10.90n 4.14p 
R13 A8 I/03 1252m 12.13n 22750 
R15 A3 Outputl — 1504m 12 tin 4.77p 
R16 BLAST# Outputl 1698m 14.61n 3.57p 
soi A27 I/03 1900m 14.18n 5.85p 
S02 A26 I/03 1800m 15.18n 3.75p 
$03 A23 I/03 1756m 14.93n 3.67p 
S05 Al4 ; I/03 1842m 13.88n 5.69p 
S07 Al2 1/03 1530m 12.24n 4.84p 
$13 A1l0O | I/03 1444m 13 20n- 3.10p 
S15 A6 I/03 1722m 13.25n 5.36p 
S16 A4 1/03 1792m 15.13n 3.74p 
S17 ADS# | Outputl 1888m 14.12n 5.82p 


[ae ae eI III IIIT RTI RR 


(Model) Outputl 

Model_type Output 

Polarity Non-Inverting 

Enable Active-Low 

{signals ADS#, BLAST#, SMIACT#,A<3:2>,FERR#,HLDA, PLOCK#, LOCK# 
| 

| typ min max 

C_comp 3.05pF 2.9pF 3.2pF 

{Voltage range] 3.3V 3.0V 3.6V 


eA ee LSS ee eS eS ee Ae ESE PEAS SSS SES eR Se eS 


{Pulldown ] 

| voltage I(typ) I (min) I (max) 
-5.0V -960.0mA -580.0mA -1410mA 
-2.0V -190.0mA -99.0mA -292.0mA 
-1.0V -21.0mA -16.7mA -27.1mA 
-0.5V -13.30mA -8.7mA -20.5mA 


0.0V 0.0 0.0 0.0 

0.5V 12.9mA 8.3mA 20.1mA 
1.0V 23.7mA 15.1l1mA 37.2mA 
Ls 5V 31.5mA 19.7mA 49.7mA 
2.0V 35.4mA 21.6mA 56.6mA 
225V 36.3mA 22.0mA 58.6mA 
3.0V 36.6mA 22.2mA 59.0mA 
3.5V 36.9mA 22.3mA 59.4mA 
4.0V 37.0mA 22.4mA 59.7mA 
4.5V 37.1mA 22.5mA 59.8mA © 
5.0V 37.2mA 22.5mA 60.0mA 
6.0V 37.2mA 22.5mA 60.0mA 
10.0V 37.2mA 22.5mA - 60.0mA 
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WSeniea buen st ateacaeae tas aul nan cunea census senna eee 
| Note that the pullup voltage in the data table is derived from 

| the equation: 

| Vtable = Vcc - Voutput 

| For the 8.3V in the table, it is actually 8.3V below Vcc and -5Vv 

| with respected to Ground. 
| 
| 
[ 
| 
| 


KEKE KKK KKK KKK KKEK KKK KKK KKE KKK KK KKEKKKEEKEEKEKEKEEKEKKEKKRKEKEKKKEKKKKRRKKRKKEK 


Pullup] 
voltage I(typ) I(min) I(max) 


8.3V -1410mA -976.25mA -1992.8mA 
4.3V -79.8mA -55.6mA -229.06mA 
3.3V ~47.1mA -40.80mA -72.66mA 
2.8V -46.1mA -29.42mA -70.96mA 
223V -44.8mA -28.64mA -68.34mA 
1.8V -42.4mA -27.14mA -61.94mA 
1.3V -35.41mA -23.34mA -50.34mA 
0.8V -24.1mA -16.08mA -33.08mA 
O23V -9.5mA . -6.3mA -12.86mA 


0.2V 6.5mA 4.76mA 9.02mA 
-~0.7V 24.3mA 17.3mA 33.04mA 
-1.2V 41.7mA 30.48mA 55.6mA 
-1.7V 56.4mA 42.26mA 74.14mA 
-~2.2V 67.3mA 51.26mA 88.4mA 
-2.7V 74.6mA 56.96mA 96.58mA 
-3.7V 79.8mA 61.33mA 104.5mA 
-6.7V 82.4mA 63.55mA 109.5mA 
| 

[GND_clamp] 

| Voltage I(typ) JI(min) I(max) 
0.0V OmA NA NA 

-0.4V OmA NA NA 

=0..5V -~0.2mA NA NA 

-0.6V -l.lmA NA NA 

-0.7V -3.0mA NA NA 

-0.8V -6.0mA NA NA 

~0.9V -11.0mA NA NA 

-1.0V -30.0mA NA NA 

-1.2V -120.0mA NA NA 

-2.0V -180.0mA NA NA 

-5.0V -420.0mA NA NA 


KKK KKK KKK KR HK KEKE KEK KEKE K KK KEKE KEK EKER KREKEKEKEKEKKEKKEKEKEKEKKKKEKKEKEK 


| 
| 
| The data in the following POWER_clamp table is listed 
| as "Vec-relative", meaning that the voltage values are 
| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 
| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| Ground and OV above Vcc. : 


ee SION NE Re AR Re Ree ER ae eee, Re ee ae RNR ee ee ea a Ee ee Om, CR RO Ne AOR eS 


[ POWER_clamp] 
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| voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 

-0.4V OmA NA NA 

-0.5V OmA NA NA 

-0.6V OmA NA NA 

-0.7V 0O.1lmA NA NA 

-0.8V 1.0mA NA NA 

-0.9V 8.0mA NA NA 

-1.0V 14.0mA NA NA 

-2.0V 100mA NA NA 


[Petree eee eek eA AA ESSA SEES SSeS ST CLES AS ESLER SS SES RE 


[Ramp] 

| typ min max 

av/dt_r 1.13/0.749n 0.93/0.868n 1.35/0.642n 
av/dt_f 0.99/0.447n 0.75/0.543n 1.27/0.387n 


LEELA eee ES RES SSeS A eT Te a ee ee ee ee ee 


[Model] Output2 
Model_type Output 
Polarity Non-Inverting 
Enable Active-Low 
|signal BE<3:0>#, BREQ, D/C#,M/I0#, PWT, PCD, PCHK#,W/R# 
| 
| typ min max 
C_comp 2.7pF 2.7pF 2.7pF 
{Voltage range] 3.3V 3.0V 3.6V 


[RELEASES SSE SESE SEES EER A SARS Se ee ee ee ee ee 


[Pulldown] 
| voltage I(typ) I(min) I (max) 

: ~5.0V -960.0mA -580.0mA -1410mA 
-2.0V -190.0mA -99.0mA -292.0mA 
-1.0V -21.0mA -16.7mA -27.1mA 
-0.5V -13.30mA -8.7mA -20.5mA 


0.0V 0.0 0.0 0.0 

0.5V 12.9mA 8.3mA 20.1mA 
1.0V 23.7mA 15.imA 37.2mA 
1.5V 31.5mA 19.7mA 49.7mA 
2.0V 35.4mA 21.6mA 56.6mA 
2.5V 36.3mA 22.0mA 58.6mA 
3.0V 36.6mA 22.2mA 59.0mA 
34 5V 36.9mA 22.3mA 59.4mA 
4.0V 37.0mA 22.4mA 59.7mA 
4.5V 37.1mA 22.5mA 59.8mA 
5.0V 37.2mA 22.5mA 60.0mA 
6.0V 37.2mA 22.5mA 60.0mA 
10.0V  37.2mA 22.5mA 60.0mA 


KKK KKK KKK KH KKK K KREMER REKREKRKEKEKRKEKKKHRKKKKEKRKKKKRK KKK KK KKK 


| 
| 
| Note that the pullup voltage in the data table is derived from 
| the equation: 

| Vtable = vcc - Voutput 
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For the 8.3V in the table, it is actually 8.3V below Vcc and -5v 


with respected to Ground. 
KKK KKHKKE KKK RE KE KEKE KKK KEKHRHREKEKEKKREKEKEKKEKEKKKRKKRKKEKKEKKEKEKKEK 


| 
| 
| 
| 
{Pullup] 
| Voltage I(typ) I(min) I(max) 
| 


8.3V -1410mA -976.25mA -1992.8mA 
4.3v -79.8mA -55.6mA -229.06mA 
3.3V -47.1mA -40.80mA -72.66mA 
2.8V -46.1mA -29.42mA -70.96mA 
2.3V -~-44.8mA -28.64mA -68.34mA 
1.8V -42.1lmA -27.14mA -61.94mA 
1.39 -35.1mA -23.34mA -50.34mA 
0.8V -24.1mA -16.08mA -33.08mA 
0.3V -9.5mA -6.3mA -12.86mA 


-0.2V 6.5mA 4.76mA 9.,02mA 

-0.7V 24.3mA 17.3mA 33.04mA 
-1.2V 41.7mA 30.48mA 55.6mA 
-1.7V 56.4mA 42.26mA 74.14mA 
-2.2V 67.3mA 51.26mA 88.4mA 
-2.7V 74.6mA 56.96mA 96.58mA 
-3.7V 79.8mA 61.33mA 104.5mA 
-6.7V 82.4mA 63.55mA 109.5mA 


(GND_clamp] 

| 

| Voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V -0.2mA NA NA 
-0.6V -l.ilmA NA NA 
-0.7V -3.0mA NA NA 
-0.8V -6.0mA NA NA 
-0.9V -11.0mA NA NA 
-1.0V -30.0mA NA NA 
-1.2V -120.0mA NA NA 
-2.0V -180.0mA NA NA 
-5.0V -420.0mA NA NA 


kkKewe keke ae kaka KKK KK KKK KKK KK KKK KKK KKK KKK RK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK 


| 
| 
| The data in the following POWER_clamp table is listed 

| as "Vcec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 

| 

| 

[ 

| 


Ground and OV above Vcc. 
xeeKKKKEK KE KKK KEKRKEK KKK KKKEKKEKKKRHE KKK KE KKK KERR KKK KKK KKK KEKKRKEKKKKRK SK 


POWER_clamp]} 

voltage I(typ) I({min) I(max) 
0.0V. OmA NA NA 

-0.4V OmA NA NA 

-0.5V OmA NA NA 
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-0.6V OmA NA NA 
-0.7V O0.1mA NA NA 
-0.8V 1.0mA NA NA 
-0.9V 8.0mA NA NA 
~1.0V 14.0mA NA NA 
-2.0V 100mA NA NA 


| typ min max 
dv/dt_r 1.13/0.749n 0.93/0.868n 1.35/0.642n 
av/dt_£ 0.99/0.447n 0.75/0.543n 1.27/0.387n 


[III III IOI III IOI i Ike 


{Model] I/0l 
Model_type I/0 
Polarity Non-Inverting 
Enable Active-Low 
Vinl = 0.8v 

Vinh = 2.0v 

|signal DBUS<16:0>,DP<2:0> 

| ; 

| typ min max 

C_comp 2.7pF 2.7pF 2.7pF 
{Voltage range] 3.3V 3.0V 3.6V 


[RRR EEE EERE E EERE REE MERE EERE EE ERE EREREER ERE RERERERRERREEERERRERRERENKERE 


{Pulldown ] 


| voltage I(typ) I(min) I(max) 
-5.0V ~960.0mA -580.0mA -1410mA 
~2.0V -190.0mA -99.0mA -292.0mA 
-1.0V ~21.0mA -16.7mA -27.1mA 
-0.5V -13.30mA -8.7mA -20.5mA 


0.0V 0.0 0.0 0.0 

0.5V 12.9mA 8.3mA 20.1mA 
1.0V 23.7mA 15.1mA 37.2mA 
1.5V 31.5mA. 19.7mA 49.7mA 
2.0V 35.4mA . 21.6mA 56.6mA 
2\.5V 36.3mA 22.0mA 58.6mA 
3.0V 36.6mA 22.2mA 59.0mA 
3.5V 36.9mA °22.3mA 59.4mA 
4.0V 37.0mA 22.4mA 59.7mA 
4.5V 37.1mA 22.5mA 59.8mA 
5.0V 37.2mA 22.5mA 60.0mA 
6.0V 37.2mA 22.5mA 60.0mA 
10.0V 37.2mA 22.5mA 60.0mA 


HR KK KKK KKK KKK KEKE KKK KE KKKKKKKEKKKKKKKK KKK KKK KK KKK KKK KKK KKK 


| 
| 
| Note that the pullup voltage in the data table is derived from 
| the equation: ; 

| Vtable = Vec - Voutput : 

| For the 8.3V in the table, it is actually 8.3V below Vcc and -5V 
| : 

| 

| 


with respected to Ground. 
kKkkekkkekkekkekKkkkekekaKkkekkeKeKeKe KR KEK Kee KK KKK KR KKK KKK KKK KKK KKK KK KKK KKK KEK KKK 
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[Pullup] 
| voltage I(typ) I(min) I(max) 


BBV -1410mA -976.25mA -1992.8mA 
4.3V -79.8mA -55.6mA -229.06mA 
3.3V -47.1mA -40.80mA -72.66mA 
2.8V -46.1mA -29.42mA -70.96mA 
2.3V -44.8mA -28.64mA -68.34mA 
1.8V -42.41mA -27.14mA -61.94mA 
1.3V -35.1lmA -23.34mA -50.34mA 
0.8V -24.1mA -16.08mA -33.08mA 
0.3V -9.5mA -6.3mA -12.86mA 


0.2V 6.5mA 4.76mA 9.02mA 
-0.7V 24.3mA 17.3mA 33.04mA 
-1.2V 41.7mA 30.48mA 55.6mA 
-1.7V 56.4mA 42.26mA 74.14mA 
-2.2V 67.3mA 51.26mA 88.4mA 
-2.7V 74.6mA 56.96mA 96.58mA 
-3.7V 79.8mA 61.33mA 104.5mA 
-6.7V 82.4mA 63.55mA 109.5mA 
| 

[GND_clamp] 

| Voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 

-0.4V OmA NA NA 

-0.5V -0.2mA NA NA 

-0.6V -l.1lmA NA NA 

-0.7V -3.0mA NA NA 

-0.8V -6.0mA NA NA 

-0.9V -11.0mA NA NA 

~-1.0V -30.0mA NA NA 

-1.2V -120.0mA NA NA 

-2.0V -180.0mA NA NA 

-5.0V -420.0mA NA NA 


wKaeKekKkekeKe KK KKK KKK KK KKK KKK KK KK KK KKK KKK KK KKK KKK Ke KKK KKK KKK KE KKK KKK KEKKKEE 


| 
| 
| The data in the following POWER_clamp table is listed 

| as "“Vec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 

| 

| 

[ 

| 


Ground and OV above Vcc. 
Ko KK KKK KKK KR KK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKKKKRKKKKRKKRKRKK 


POWER_clamp] 

voltage I(typ) I (min) I (max) 
o.0V OmA NA NA 

-0.4V OmA NA NA 

-0.5V OmA NA NA 

-0.6V OmA NA NA 

-0.7V O.l1mA NA NA 

-0.8V 1.0mA NA NA 

-0.9V 8.0mA NA NA 

-1.0V 14.0mA NA NA 
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. ® 


-2.0V 100mA NA NA 


| 

[Ramp ] 

| typ min max 

—av/dt_r = 1.13/0.749n 0.93/0.868n 1.35/0.642n 
av/at_f. 0.99/0.447n 0.75/0.543n 1.27/0.387n 
ie 


EAS eee SA RESELL RASA EAS EAA SAS LS Re NE A eA TS ee a ae wee 


(Model) I/02 

Model_type I/0 

Polarity Non-Inverting 
Enable Active-Low 

Vinl = 3.3v 

Vinh = 6.0v 

|siganl DBUS<31:17>,DP<3> 
| typ min max . 
C_comp 3.05pF 2.9pF 3.2pF 
(Voltage range] 3.3V 3.0V 3.6V 


[eee LEE REESE LER SLES SE SAS LE Se A A eS ee eke ee eee 


(Pulldown ] 


| voltage I(typ) I(min) I(max) 
-5.0V -960.0mA -580.0mA -1410mA 
-2.0V -190.0mA -99.0mA -292.0mA 
-1.0V -21.0mA -16.7mA -27.1mA 
-0.5V -13.30mA -8.7mA -20.5mA 


0.0V 0.0 0.0 0.0 

0.5V 12.9mA 8..3mA 20.1imA 
1.0V 23.7mA 15.1mA 37.2mA 
SV 31.5mA 19.7mA 49.7mA 
2.0V 35.4mA 21.6mA 56.6mA 
255V 36.3mA 22.0mA 58.6mA 
3.0V 36.6mA 22.2mA 59.0mA 
3.5V 36.9mA 22.3mA 59.4mA 
4.0V 37.O0mA 22.4mA 59.7mA 
4.5V 37.1mA 22.5mA 59.8mA 
5.0V 37.2mA 22.5mA 60.0mA 
6.0V 37.2mA 22.5mA 60.0mA 
10.0V 37.2mA 22.5mA 60.0mA 


KK KK KKK KKK KKK KKK KKK KKK KEKKEKKHKEKKKKKRKKKKKKK KK KKK KKK KKK KK KKK 


| 

| 

| Note that the pullup voltage in the data table is, derived from 
| the equation: 

| Vtable = Vcc - Voutput | 

| For the 8.3V in the table, it is actually 8.3V below Vcc and -5V 
| with respected to Ground. | 

| 
| 
[ 
| 
| 


Kee K KK KKK KKRKKKE KKK KKK KKK KKK KKK KKK KKK KK KKKKRKEKK KKK KKK KKK KK KK KKK 


Pullup]) 
voltage I(typ) I (min) I (max) 


8.3V -1410mA -976.25mA -1992.8mA 
4.3V -79.8mA -55.6mA -229.06mA 
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3.3V -47.1mA -40.80mA -72.66mA 
2.8V -46.1mA -29.42mA -70.96mA 
2.3V -44.8mA -28.64mA -68.34mA 
1.8V -42.imA -27.14mA -61.94mA 
1.3V -35.1lmA -23.34mA -50.34mA 
0.8V -24.imA -16.08mA -33.08mA 
0.3V -9.5mA -6.3mA -12.86mA 
-~0.2V 6.5mA 4.76mA 9.02mA 


.7V 24.3mA 17.3mA 33.04mA 
.2V 41.7mA 30.48mA 55.6mA 
.7V 56.4mA 42.26mA 74.14mA 
.2V 67.3mA 51.26mA 88.4mA 
.7V 74.6mA 56.96mA 96.58mA 
.7V 79.8mA 61.33mA 104.5mA 
.7V 82.4mA 63.55mA 109.5mA 


| ee re | 
NWNNF HO 


{GND_clamp] 

| 

| Voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V -0.2mA NA NA 
-0.6V -1.1mA NA NA 
-0.7V -3.0mA NA NA 
-Q.8V -6.0mA NA NA 
-0.9V -11.0mA NA NA 
-1.0V -30.0mA NA NA 
-1.2V -120.0mA NA NA 
-2.0V -180.0mA NA NA 
-5.0V -420.0mA NA NA 


RAK KKEKKKEKKEKEK KKK KKK KEKE KEK KKK KKK KKK KEKE KKK KKK KEKKKEKKEKRKEKEKKEKEKRKEKKKK KK 


| 
| 
| The data in the following POWER_clamp table is listed 

| as "Vcec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput | 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 


Ground and OV above Vcc. 
[ee ee ee ee eI III IIT IO III II III ae te tee 


| 

[POWER_clamp] 

| voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V OmA NA NA 
-0.6V OmA NA NA 
-0.7V 0O.1lmA NA NA 
-0.8V 1.0mA NA NA 
-0.9V 8.O0mA NA NA 
-1.0V 14.0mA NA NA 
-2.0V 100mA NA NA 


aticiiciicMictialicRicMiclictieiieMMaMecRicicMeMatata BeBe ReleMeleleMaRateMaleleMoleielctetalateiaieletehelielelclcleteteteheheheietetetetteh Retek tte 
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{Ramp ] 

| typ min max 
AaV/dt_r 1.13/0.749n 0.93/0.868n 1.35/0.642n 
av/dt_f 0.99/0.447n 0.75/0.543n 1.27/0.387n 


[LEP A ASSL LAS SE Se ee ee ee eae a ye a ee ee ee eg Se eae 
eae 


| 
{Model] 1/03 


Model_type 1/0 
Polarity Non-Inverting 
Enable Active-Low 
Vinl 0.8v 

Vinh 2.0v 

| 
[signal ABUS<31:4> 

| 

| typ min max 

C_comp 2.9pF 2.9pF 2.9pF 
[Voltage range] 3.3V 3.0V 3.6V 


[ee See ee SNR ee ey ee meen ee ernie ee Rae Gok Se yn whe eae REN Oey ey et Se ee ce A 


{Pulldown ] 

| voltage I(typ) I(min) I(max) 
-5.0V -960.0mA -580.0mA -1410mA 
-2.0V. -190.0mA -99.0mA -292.0mA 
-1.0V -21.0mA -16.7mA -27.1mA 
-0.5V -13.30mA -8.7mMA -20.5mA 


0.0V 0.0 0.0 0.0 . 

0.5V 12.9mA 8.3mA 20.1mA 
1.0V 23.7mA 15.1mA 37.2mA 
1.5V 31.5mA 19.7mA 49.7mA 
2.0V 35.4mA 21.6mA 56.6mA 
2.5V 36.3mA 22.0mA 58.6mA 
3.0V 36.6mA 22.2mA 59.0mA 
3.5V 36.9mA 22.3mA 59.4mA 
4.0V 37.0mA 22.4mA 59.7mA 
4.5V 37.1imA 22.5mA 59.8mA 
5.0V 37.2mA 22.5mA 60.0mA 
6.0V 37.2mA 22.5mA 60.0mA 
10.0V 37.2mA 22.5mA 60.0mA 


RRM KKK KEKE KEK KKK KEKE KKK KKK KKK KEKEKEKKKEEKEKEKEKEKKKE KKK KKK KKK KKK KKK KKK 


| 

| 

| Note that the pullup voltage in the data table is derived from 

| the equation: 

| Vtable = Vee - Voutput 

| For the 8.3V in the table, it is actually 8.3V below Vcc and -5V 
| with respected to Ground. 
| 

| 

[ 

| 

| 


KKK KKK KK KKK KKEHEKEKEK KKK KH KK KKK KKEKKEKEKKK KK aAKKEKKE KKK KKEKKEKEKKKEKEEK 


Pullup] 
voltage I(typ) I(min) I(max) 


8.3V. -1410mA -976.25mA -1992.8mA 
4.3V -79.8mA -55.6mA -229.06mA 
3.3V. -47.1mA -40.80mA -72.66mA 
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2.8V -46.1mA -29.42mA -70.96mA 
Cree sf ~-44,.8mA -28.64mA -68.34mA 
1.8V ~-42.imA -27.14mA -61.94mA 
aa Wh 5 -35.1lmA -23.34mA -50.34mA 
0.8V -24.1mA -16.08mA -33.08mA 
0.3V -9.5mA -6.3mA -12.86mA 


-0.2V 6.5mA 4.76mA 9.02mA 
-0.7V 24.3mA 17.3mA 33.04mMA 
-1.2V 41.7mA 30.48mA 55.6mA 
-1.7V 56.4mA 42.26mA 74.14mA 
-2.2V 67.3mA 51.26mA 88.4mA 
-2.7V 74.6mA 56.96mA 96.58mA 
-3.7V 79.8mA 61.33mA 104.5mA 
-6.7V 82.4mA 63.55mA 109.5mA 


{GND_clamp] 

| 

| Voltage JI(typ) I(min) I(max) 
0.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V -0.2mA NA NA 
-0.6V -l.lmA NA NA 
-0.7V -3.0mA NA NA 
-0.8V -6.0mA NA NA 
-0.9V -11.0OmA NA NA 
-1.0V -30.0mA NA NA 
-1.2V -120.0mA NA NA 
-2.0V -180.0mA NA NA 
-5.0V -420.0mA NA NA 


kekKkeKkkekeKe KKK KKK KK KK KKK KEK KKK KKK KK KK KKK KKK KKK KK KK KKK KKe KR KKK KK KKK KKK KKK KES 


| 
| 
| The data in the following POWER_clamp table is listed 

| as “Vec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 

| 

[. 

[ 

| 


Ground and 0V above Vcc. 
wwe KKe KEKE KK KKK KKK KKK KKK KKK KKK KEKKKKKKKEKE KEKE KE KKKEKKE KKK KKK KEK K KK KEK 


POWER_clamp] 

voltage I(typ) I (min) I (max) 
o.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V OmA NA NA 
-0.6V OmA NA NA 
-0.7V O.ilmA NA NA 
-0.8V 1.0mA NA NA 
-0.9V 8.0mA NA NA 
-1.0V 14.0mA NA NA 
-2.0V 100mA NA NA 


Lee ee eee ees ee SE ee ee A ee RR RS Ae ee Se ee eR ee en ee 


[Ramp] 
| typ min max 
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av/dt_r 1.13/0.749n 0.93/0.868n 1.35/0.642n 
av/dt_£f 0.99/0.447n 0.75/0.543n 1.27/0.387n 


| Re SR Ae Ie ae Be Re Re IERIE See Ie Re Be Se AR A SR HR IEE Me I I RR I ROR He Re Re ee ee 


| 

{Model } Inputl 

Model_type Input 

Polarity Non-Inverting 

Enable Active-Low 
Vinl = 0.8v 

Vinh = 2.0v 

{signal A20M#,AHOLD, BOFF#, BRDY#,BS16#,BS8#,FLUSH#, 
| HOLD, IGNNE#, INTR, KEN#,NMI,RDY#, RESET, SRESET, SMI#,STPCLK# 
| typ min max 

C_comp 2.0pF 2.0pF 2.0pF 

(Voltage range] 3.3V 3.0V 3.6V 


| 


ee eS ee ee ee ee ee ee ee eee ae ee 


[GND_clamp] 

| Voltage I(typ) I(min) I(max) 
| 

0.0V OmA NA NA 
-0.4V OmA NA NA 
-~-0.5V -0.2mA NA NA 
-0.6V -l.ilmA NA NA 
-0.7V -3.0mA NA NA 
-0.8V -6.0mA NA NA 
-0.9V -11.0mA NA NA 
~1.0V -30.0mA. NA NA 
-1.2V -120.0mA NA NA 
-2.0V -180.0mA NA NA 
-5.0V -420.0mA NA NA 


KKK KHER KEKEKK HAKKAR KI KKK KEK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KEK KKK 


| 
| 
| The data in the following POWER_clamp table is listed 

| as "“Vec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 

| 

| 

[ 

| 


Ground and OV above Vcc. 
KEK KKK KKK KKK KK KEHKEKHKKKEKEKAKE KEKE KKKEK HEHE KKKKREK KE KKK KKEKEKRHEKEKEKKEKKKKRKEEKRKE KKK 


POWER_clamp] 

voltage I(typ) I(min) I(max) 
0.0V OmA NA NA ; 
-0.4V OmA NA NA 

-0.5V OmA. NA NA 

-0.6V QOQmA NA NA 

-0.7V- O0.l1mA NA NA 

-0.8V 1.0mA NA NA 

-0.9V 8.O0OmA NA NA 

-1.0V 14.0mA NA NA 

-2.0V 100mA NA NA 
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| 


[Eee ee eee EERE EAR EERE OLE EE RE EERE NS Se RE Ee ee ee ee ew 


| 
{Model } Clockbuffer 


Model_type Input 

Polarity Non-Inverting 

Enable Active-Low 

Vinl = 0.8V 

Vinh = 2.0V 

lsignal CLK 

| typ min max 

C_comp 2.0pF 2.0pF 2.0pF 
[Voltage range] Sa3V 3.0 3 36V 


| 


LES LS SEARLS EEA EAE SEE ES EER REL EAR ERE ES EEL EAM ER ER ee OS 


{(GND_clamp] 

| Voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 
-0.4V OmA NA NA 
-0.5V -O.2mA NA NA 
-0.6V -1.imA NA NA 
-0.7V -3.0mA NA NA 
-0.8V -6.0mA NA NA 
-0.9V -11.0mA NA NA 
-1.0V -30.0mA NA NA 
-1.2V -120.0mA NA NA 
-2.0V -180.0mA NA NA 
-5.0V -420.0mA NA NA 


RRR KKK RK KKK KKK KEKE HK KKK KKKRKKEKHEKEKEKKE KE KEKE KKEEKKEEKREKKKKKRKEKK KKK KKK KK 


| 
| 
| The data in the following POWER_clamp table is listed 

| as "Vec-relative", meaning that the voltage values are 

| referenced to the Vcc pin. The voltages in the data tables 
| are derived from the equation: 

| Vtable = Vcc - Voutput 

| In this case, assuming that Vcc is referenced to 3.3V. 

| OV in the table actually means 3.3V with respected to 

| 

| 

| 

[ 

| 


Ground and OV above Vcc. 
KR KKK KEKE KKK KRKEK KKK KRKEKEKKEKKRKKKKEK KK KKK KKK KRKE KKK KKK KKK KRKKKKKEKKEKKRRKKR KEK 


POWER_clamp] 

voltage I(typ) I(min) I(max) 
0.0V OmA NA NA 
~0.4V OmA NA NA 
-0.5V OmA NA NA 
-0.6V OmA NA NA 
-0.7V O.1mA NA NA 
-0.8V 1.0mA NA NA 
-~-0.9V 8.0mA NA NA 
-1.0V 14.0mA NA NA 
-2.0V 100mA NA NA 


[eee SAS LALA EEE LS SEE SES ERR SS SS SR RS RNR RR See 


| 
[End] 
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APPENDIX D 
BSDL LISTINGS 
Below is a listing of a boundary scan description lan- processors. See section 11.5, “Intel486 Processor 
guage (BSDL) file for the IntelDX4 processor. Boubdary Scan,’ for a complete description of 


| | BSDL instructions and usage. 
This file is provided as an example. Contact Intel for 
design information for this and other Intel486 


IntelDX4 Processor Listing 


+ 


-- Copyright Intel Corporation 1993 . 
m_—ae RRA KEKRKRKEKEKRKRE KEKE KKK Ke KE KK KKK KKKKRKKEKE KKK KKK KK KKK EAKRaAK KK KKK KKK Ke KKK KRKKKRKRKe KKK KKRKR Keka kK 
~- Intel Corporation makes no warranty for the use of its products 

-- and assumes no responsibility for any errors which may appear in 

-- this document nor does it make a commitment to update the information 

-- contained herein. 

ae kK KKRKKKK KKK KKK KKK KK KKhkKk KK KKK Kk Keka kK kK KKK KKK KK KKK KKK KKK KKK KKKKKEK KEKE KEK 
-- Boundary-Scan Description Language (BSDL Version 0.0) is a de-facto 

-~- standard means of. describing essential features of ANSI/IEEE 1149.1-1990 

-- compliant devices. This language is under consideration by the IEEE for 

-- formal inclusion within a supplement to the 1149.1-1990 standard. The 

-- generation of the supplement entails an extensive IEEE review and a formal 
-- acceptance balloting procedure which may change the resultant form of the 
-- language. Be aware that this process may extend well into 1993, and at 

-- this time the IEEE does not endorse or hold an opinion on the language. 


wa kk RK KKK KEK KKEKKE KR KKK KKK KKKKKKeK KK KK KKK KKKRK KKK KKK KKK KKK KReEKKKK KKK KK eK KKK KEKK KKK 


3% 


—— 


-- IntelDxX4(tm) processor BSDL description 
-- This file has been electrically verified. 


entity IntelDxX4 is 


generic (PHYSICAL_PIN_MAP >: String := "PGA_17x17"); 
port (A20M >: in bit? 
ABUS2 : out bit; . 
ABUS3 >: out bit; 
ABUS : ainout bit_vector (4 to 31); -- Address bus (words) 
ADS : out bit; 
AHOLD : in bit; 
BE s OWE bit_vector (0 to 3); 
BLAST > out bit; — 
BOFF : in bits 
BRDY : in bit; 
BREQ >: out bit; 
BS8 S 2m - Dite 
BS16 >: in bit; 
CLK -. $n bit; 
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CLKMUL : in bit: 

DBUS : inout bit_vector (0 to 31); -- Data bus 
DC : out bit; 

DP : inout bit_vector (0 to 3); 

EADS : in bit; 

FERR >: out bit; 

FLUSH : in bit; 

HLDA >: out bre? 

HOLD : in bit; 

IGNNE : in bit; 

INC_PGA : linkage bit_vector (1 to 5); -- Internal NC PGA 
INTR : in bit; 

KEN : ma bit; 

LOCK : out bit; 

MIO : out bit; 

NC_PGA : linkage bit; ~- No Connect for PGA 
NC_SQFP ‘ linkage bit_vector (1 to 7); -- NC SQFP 
NMI : in bit; 

PCD : out bit; 

PCHK : out bit; 

PLOCK : out bit; 

PWT f: “OS bit; 

RDY : in bit; 

RESET A in bit? 

SMI : in. bits 

SMIACT : out bit; 

SRESET : in bit; 

STPCLK : in bit; 

TOK; TMS. TDL R ATi bit; -- Scan Port inputs 

TDO > out bit; -- Scan Port output 

UP : in bit: : 
VCC_PGA : linkage bit_vector (1 to 23); -- vcc 
VCC_SQFP : linkage bit_vector (1 to 53); -- vcc 
VCCS : linkage bit; ~- Reference Voltage 
VOLDET : linkage bit; ~- Voltage Detect Pin, PGA only 
VSS_PGA ; linkage bit_vector (1 to 28); -- VSS 
VSS_SQFP : linkage bit_vector (1 to 38); -- VSS 

WR : out bit); 


use STD_1149_1_1990.all; 


attribute PIN_MAP of IntelDX4 : entity is PHYSICAL_PIN_MAP; 


constant PGA_17x17 : PIN_MAP_STRING := -~ Define Pin Out of PGA 
"“A20M : D1I5, "“& 
"ABUS2 : Q14, “"& 
"ABUS3 : R15, "& 
"ABUS : ($16,Q012,8515,Q013,R13,011,513,R12,"& 


‘i $07,Q10,S05,R07,Q09,Q03,R05,004,008,Q05, "& 
@07,S03,Q06,R02,S02,S01,R01,P02,P03,Q01),"&. 


"ADS > S17; “& 
“AHOLD : Al7, "& 
"BE S ARDS A165 155 P17). 
“BLAST RLG6 ye 
"BOFF : Div, “& 
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“BRDY 
“BREQ 
"BS8 
"BS16 
“CLK 
"“CLKMUL | 
"DBUS 


“DC 

"DP 
"EADS 
"FERR 
"FLUSH 
"“HLDA. 
“HOLD 
"IGNNE 
“INC_PGA 
"INTR 
"KEN 
“LOCK 
"MIO 
"NC_PGA 
"NMI 
"PCD 
“PCHK 
"PLOCK 
"PWT 
"RDY 
"RESET 
"SMI 
"SMIACT 
"SRESET 
"STPCLK 
"TCK 
"TDI 
"TDO | 
"TMS 
"UP 
"VCC_PGA 


” 


"VCC5 
"VOLDET 
"VSS_PGA 


constant SQFP_208 


“A20M 
“ABUS2 
"ABUS3 


H15, "& 
O15; "& 
D16, "& 
C17, "“& 
C03, "& 
R17, "& 


(PO01,N02,N01,H02,M03,J02,L02,L03,F02,D01,E03,"& 
C01,603,D02,K03,F03,J303,D03,C02,B01,A01,B02,"& 
A02,A04,A06,B06,C07,C06,C08,A08,C09,B08)," & 


M15, "& 
(NO3,F01,H03,A05),"& 
B17, "& 
C14, "& 
C15, "& 
P15; "& : = 
E15; "& 
A1S, "& 
: (A10,A12,Ai13,B12,B13),"& 
: Al6, "& 
Fi5;: “& 
N15, "& 
N16, "& 
C13: 
B15, "& 
Ji7, "& 
Ql17, "& 
Q16, "& 
L15, "& 
F116, "& 
C16; "& 
B10, "& 
C12, "& 
C105, "8 
Gis, "& 
A03, "& 
Al4, "& 
B16, "& 
B14, "& 
Gilly. "& 


(BO7,B09,B11,C04,C05,E2,E16,G02,G16,H16,K02,"& 
K16,L16,M02,M16,P16,R03,R06,R08,RO9,R10,R11, "& 


R14),"& 
JO1l, "& 
S04, “& 


{(A07,A09,A11,B03,B04,B05,E01,E17,G601,G617,H01,H17,"& 
K01,K17,L01,L17,M01,M17,P17,Q02,R04,S06,S08,S09,"& 
$10,S11,S812,814),"& 

N17 ws 


PIN_MAP_STRING := -~- Define Pin Out of SQFP 


47, "“& 
202, "“& 
197, "& 
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"ABUS : (196,195,193,192,190,187,186,182,180,178, "& 

" 177,174,173,171,166,165,164,161,160,159,"& 

" 158,154,153,152,151,149,148,147)," & 

"ADS : 203, "& 

"AHOLD : 17, "& 

"BE : (31,32,33,34),"& 

"BLAST  : 204, "& 

“BOFF : 6, "& 

"BRDY 5, "& 

“BREQ : 30, "& 

"BS8 : 8, "& 

"BS16 e 2; "& 

"CLK : 24, "& 

"CLKMUL : 11, “& 

“DBUS > (144,143,142,141,140,130,129,126,124,123,119, "& 
" 118,117,116,113,112,108,103,101,100,99,93,"& 

" 92,91,87,85,84,83,79,78,75,74),"& 

"DC : 39, "& 

"DP : (145,125,109,90),"& 

"EADS : 46, “& 

“FERR : 66, "& 

"FLUSH : 49, "& 

“HLDA 365 Me 

“HOLD e526; os 

"IGNNE : 72, "& 

“INTR : 50, "& 

"KEN : 13, "& 

"LOCK : 207, "& 

"MIO gs, te 

"NC_SQFP : (63,64,67,70,71,96,127),"& 

“NMI : 51, “& 

“PCD : 41, “& 

"PCHK : 4, "& 

"PLOCK  : 206, “& 

"PWT 2300. “Re 

“RDY : 12, “"& 

“RESET : 48, "& 

"SMI : 65, "& 

“SMIACT : 59, "& 

“SRESET : 58, "& 

“STPCLK : 73, “& 

“TCK ee: ee 

“TDI : 168, "& 

“TDO : 68, "& 

“TMS : 167, "& 

“UP : 194, "& 

“VCC_SQFP : (2,9,14,19,20,22,23,25,29,35,38,42,44, 45,54, "& 
i 56,60,62,69,77,80,82,86,89,95,98,102,106,111,"& 
" 114,121,128,131,133,134,136,137,139,150,155, "& 
, 162,163,169,172,176,179,183,185,188,191,198, "& 
" 200,205),"& 

"VCCS5 : 3, "& 

“VSS_SQFP : (1,10,15,21,28,36,43,52,53,55,57,61,76,81, 88,94, "& 
" 97,104,105,107,110,115,120,122,132,135,138,146, "& 
" 156,157,170,175,181,184,189,199,201, 208) , "& 
“WR eee Ad meee 
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attribute Tap_Scan_In of TDI : signal is true; 
attribute Tap_Scan_Out of _TDO : signal is true; 
attribute Tap_Scan_Mode of TMS : signal is true; 
attribute Tap_Scan_Clock of TCK : signal is (25.0e6, BOTH); 


attribute Instruction_Length of IntelDX4 : entity is 4; 


attribute Instruction_Opcode of IntelDxX4 : entity is 


“BYPASS (1111),“ & 
“"EXTEST (0000)," & 
“SAMPLE  (0001)," & 
“IDCODE (0010)," & 
"RUNBIST (1000)," & 


“PRIVATE. (0011,0109,0101,0110,0111,1001, 1010,1011,1100,2101,1110)"; 
attribute Instruction_Tapture of IntelDX4 : entity is "0001"; 
-~- there is no Instruction_Disable attribute for IntelDx4 
attribute Instruction_?rivate of IntelDX4 : entity is "private"; 


attribute Idcode_Register of IntelDxX4: entity is 


a as kKaekkekkekekekekxekazekkekrkekekekeke keke Ke KKK KKK KRKKK KKK KKK aK k 


"0000" & --version 

"“1000001010001000"& --new part number 

"00000001001" & --manufacturers identity 

re --required by the standard 
attribute Instruction_Usage of IntelDX4 : entity is 

"RUNBIST (registers BIST; "& 

“result 0;" & 


“clock CLK in Run_Test_Idle;"& 
"length 1600000)"; 


attribute Register_Access of IntelDX4 : entity is 
"BIST[1] (RUNBIST)"; 


ae RS RR TE RRR RRR RR NR RR RR RRR OR RARE BRR RR RR RE RR RR RRR RRR RRR ER) 


--{ The first cell is closest to TDO } 


a LEER EERE ERE ERE RRR REE HERE REAR EKER RE REKE RAKE EKERAERERKEKEK KKH Y 


attribute Boundary_Length of IntelDX4 : entity is 109; 
attribute Boundary_Cells of IntelDx4 >: entity is "BC_2, BC_1l, BC_6"; 


attribute Boundary_Register of IntelDX4 : entity is 


"0 (BC_2, ABUS2, output3, Xx, 107, 1, Z),"& 
"1 (BC_2, ABUS3, output3, wo. 207s Ty “as 
"2 (BC_6, ABUS(4), bidir, Mo. 200: Ay ues 
"3 (BC_6,  ABUS(5), bidir, x “L07, 2h. ye he 
"4 (BC_1, UP,. input, X)," & 

"S (BC_6, ABUS(6), bidir, K>. AO Ap. Boe 
"6 (BC_6, ABUS(7), bidir, RG. OH Te: SEV te 
"7 (BC_6, ABUS(8), bidir, X, 107, 1, 2),"& 
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(BC_6, ABUS(9), bidir, Ke. SO7> “De J2h re 
(BC_6, ABUS(10), bidir, Ry W075. We “Zr pre 
(BC_6, ABUS(11), bidir, X, 107, 1, Z),"& 
(BC_6, ABUS(12), bidir, Me TOM. The 22s 
(BC_6, ABUS(13), bidir, Ke DOE. ys “Ze he 
(BC_6, ABUS(14), bidir, e S07. Ws: Dias 
(BC_6, ABUS(15), bidir, %,. LOT). Ay. “2y,e 
(BC_6, ABUS(16), bidir, i BOTs. See Aa cke 
(BC_6, ABUS(17), bidir, R 107. Dy Byte 
(BC_6, ABUS(18), bidir, Hs. 207%. Te. 2295, 
(BC_6, ABUS(19), bidir, My, LOUs.. e- Byte 
(BC_6, ABUS(20), bidir, RS: DOT. 45. see 
(BC_6, ABUS(21), bidir, Me 107 Ly -2ie%e 
(BC_6, ABUS(22), bidir, Ki ° DOT. Le Iie 
(BC_6, ABUS(23), bidir, K-07 Ts eye 
(BC_6, ABUS(24), bidir, R TOT, De “2a %e 
(BC_6, ABUS(25), bidir, Mi 107s. Te Zia 
(BC_6, ABUS(26), bidir, Xo. LOR 1, Beye 
(BC_6, ABUS(27), bidir, Bo “107, i. Bye 
(BC_6, ABUS(28), bidir, Re Oye. A - Wie 
(BC_6, ABUS(29), bidir, Ke. MOTs. ay “2 yee 
(BC_6, ABUS(30), bidir, He DOW An She 
(BC_6, ABUS(31), bidir, Ho AGTy.. “Dy. Seieghe 
(BC_6, DP(0), bidir, % AOBz: dye -2yyts 
(BC_6, DBUS(0), bidir, X, 108, 4, 2b, *& 
(BC_6, DBUS(1), bidir, X, 108, 1, 2Z),"& 
(BC_6, DBUS(2), bidir, Kr. AOS. hi CBS Re 
(BC_6, DBUS(3), bidir, Ki DOS» “Te “We 
(BC_6, DBUS(4), bidir, Ky 108k. i “Bila Pe 
(BC_6, DBUS(S5), bidir, Ky. DOB, Le “24s 
'BC_6, DBUS(6), bidir, Me DUO, dy, Bie 
(BC_6, DBUS(7), bidir, iy 108. We “2s 
(BC_6, DP(1), bidir, M, MOG, Ay ype 
(BC_6, DBUS(8), bidir, KX, 108). Te B*e 
(BC_6, DBUS(9), bidir, x, 108, ly 27, %& 
(BC_6, DBUS(10), bidir, x, 208) Dy 2), %e 
(BC_6, DBUS(11), bidir, R- LOS;. “ys Wiehe 
(BC_6, DBUS(12), bidir, R, 208, De Zhe ®s 
(BC_6, DBUS(13), bidir, Ky D068) Hon Bis ME 
(BC_6, DBUS(14), bidir, Ki. 208; Uy <274-"s 
(BC_6, DBUS(15), bidir, Ki “hOB, “Wy. Zits 
(BC_6, DP(2), bidir, X, 108, 1, 2Z),"& 
2C_6, DBUS(16), bidir, X, 108) 1, 2) "e 
(BC_6, DBUS(17), bidir, K 108. 25. “2s 
(BC_6, DBUS(18), bidir, Xo, DOG), 35. “Bip he 
(BC_6, DBUS(19), bidir, XK, “LOS, be Zo %e 
(BC_6, DBUS(20), bidir, Ko 208%. 1p “Shy ke 
(BC_6, DBUS(21), bidir, X, 108, 1, Z),"& 
(BC_6, DBUS(22), bidir, X, 108, 1, Z),"& 
(BC_6, DBUS(23), bidir, Ry 208) Lp coe 
(BC_6, DP(3), bidir, Ki: 008s. “2 Zee 
(BC_6, DBUS(24), bidir, Sy 1065. Ty. “Bi 
(BC_6, DBUS(25), bidir, Xu -WOSy Is. “ae 
(BC_6, DBUS(26), bidir, KR, 108, 1, Z)p*e 
(BC_6, DBUS(27), bidir, X,, G08). Te Ziyie"e 
(BC_6, DBUS(28), bidir, Ko MOGs. Wh. -eyyct& 
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DBUS(29), 


2-388 


"64 (BC_6, bidir, X, 108, 1, 2Z),"& 
"65 (BC_6, DBUS(30), “Bidar, Xx, “£08, “he -Bbpohe 
"66 (BC_6, DBUS(31), bidir, KX, 108, Dy. Z)e& 
"67 (BC_2, STPCLK, input, X),"& 
"68 (BC_1, IGNNE, input, X),"& : 
"69 (BC_2,  FERR, output3, KX, F054. dy Zi" 
"70 CBC 1, SMI, input, X),"& 
"741 (BC_2, SMIACT, output3, X, 106, 1, 2Z),"& 
"72 CBE Ts. SRESET, input, 4) 8 & , 
bed Bo (BC_1, NMI, input, X),"& 
"74 (BC. oi; INTR, input, X), "& 
©75 (BC_1, FLUSH, input, ome: 
6 >) Cay RESET, input, X),"& 
“77 (BC_1, A20M, input, X),"& 
"78 (BC_1, EADS, input, XX), "& 
"79 (BCU2, -PCD;,. output3, X, 106, 1, 2Z),"& 
"80 (BC_2, PWT, output3, K, 206, 1) Zyy"% 
"S81 {BC-.2, DC, output3, X, 106, Ls Z),"& 
"82 (BC_2, MIO, output3, X, 106, Ly | Sd Se 
"83 (BC_2, BE(3), output3, x; 106, 2, -Zhite 
"84 (BC_2, BE(2), output3, xX; 1206, dy. 2s 
"85 (BC_2, BE({1), output3, x; 2065: “Ay. “Z)."6 
"86 (BC_2, BE(0), output3, X, 106, 1, 2Z),"& 
"87 (BC_2, BREQ, output3, xX. “20S, Ly Zoi? s& 
"88 (BC_2, WR, output3, X, 106, 1, Z),"& 
"89 (BC_2, HLDA, output3, Ke 1085. “by. “2h 4"e 
= 30 (BCs: “CLK; input, X), "& 
"91 (BC_1, AHOLD, input, X),"& 
"92 (BC_1, HOLD, input, X),"& 
"93 (BC_1, KEN, input, X),"& 
"Q4. (BEL, RDY, input, X),"& 
"95 (BC_1, CLKMUL, input, >. me 
"96 (BC_1, BS8, input; X),"& 
"97 (BC_1, BS16, input, X),"& 
"98 (BC_1, BOFF, input, X),"& 
"99 (BC_1, BRDY, input, X),"& 
"100 (BC_2, PCHK, output3, Xs 105, +h Z)4."& 
“104 (BC_2, LOCK, output3, X, 106, er Z),"&. 
"102 (BC 2, PLOCK, output3, X, 106, se Z),"& 
"103 (BC_2, BLAST, output3, X, 106, 1, Z),"& 
"104 (BC_2, ADS, output3, X, 106, 1, 2Z),"& 
wes (BE. 2, ms Control, 1)i.%% -- DISMISC 
"106 (BC.22; *., eontrol, T)ii"s -- DISBUS 
-"107 (BC_2, sae eontrol;, 1) ."& ~- DISABUS 
"108 (BE .2)- control, 1)"; -- DISWR 

end IntelDX4; 
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APPENDIX E 
SYSTEM DESIGN NOTES. 


SMM Environment Initialization 


When the Intel486 processors are operating in Real 
Mode, the physical address at which instructions 
and data are fetched is determined by the segment 
register and an offset (i.e., CS and IP for instruc- 
tions). When a new value is loaded into a segment 
register, the new value is shifted to the left by four 
bits and stored in a segment base register that cor- 
responds to that particular segment (CSBASE, 
DSBASE, ESBASE, etc.). It is the value stored in the 
segment base register that is actually used to gener- 
ate a physical address. For example, the linear ad- 
dress to be used for fetching instructions is deter- 
mined by adding the value contained in the CS seg- 
~ ment base register with the value in the IP register. 


When the processor is in Protected Mode, the seg- 
ment registers are used as selectors to a descriptor 
table. Each descriptor in a descriptor table contains 
information about the segment in use, including the 
segments BASE address (i.e., CSBASE), the limit (or 
size of the segment), as well as protection level, 
' privileges, operand sizes, and the segment type. In 
Protected Mode, the linear address is determined by 
adding the base portion of the descriptor to the ap- 
propriate offset. 


When in System Management Mode, the processor 
operates in a pseudo-Real Mode, with address cal- 
culation performed in the Real Mode manner. How- 
ever, the processor adds the value in the segment 
base register with the value in the EIP register, rath- 
er than the IP register, so there are no limits as to 
the segment size. The physical address of an in- 
struction is obtained by adding the value in CSBASE 
to the value in EIP. 


When entering SMM, it may be necessary to initial- 
ize the segment registers to point to SMRAM (see 
section 8.4.2, ‘Processor Environment,‘ for their val- 
ue on SMM entry). If SMBASE has not been relocat- 
ed, then the necessary segment registers can be 
initialized to point to SMRAM by using the value in 
the CS register, 3000H, which points to the SMRAM 
address space. 


When an SMI# occurs after SMBASE has been 
modified, CSBASE is loaded with the new value of 
SMBASE. However, the CS selector register still 
contains the value 3000H, not the value corre- 
sponding to the new SMBASE. 


To initialize segment registers to point to the new 
SMRAM area, read the SMBASE value from the 
SMM state that was saved in memory. Because the 
data segment registers are initialized to 0, do not 
use them to access the SMM state save area. In- 
stead, perform a read relative to the CS register by 
using a CS override prefix to a normal memory read. 
Although CS still contains 3000H, CSBASE contains 
the value of SMBASE, and CSBASE is used for the 
address generation. 


Once the value of SMBASE is obtained, it must be 
shifted to the right by four bits to get the appropriate 
value to be placed in the segment registers. The CS 
register itself can be initialized by executing a far 
jump instruction to an address within SMBASE, 
which causes CS to be reloaded with a value corre- 
sponding to SMBASE. 


Example E-1 describes one method of initializing the 
segment registers when SMBASE has been relocat- 
ed. This method works if SMBASE is less than 1 
Megabyte. 
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Example E-1. Initialization of Segment Registers within SMM 


mov c1,4 


;at memory location PTR_ADDR 


;an indirect jump address 


mov dads,ax 
mov es,ax 


;address SMBASE: OFFSET 


Accessing SMRAM 


LOADING SMRAM WITH AN INITIAL SMI 
HANDLER 


Under normal conditions, the SMRAM address 
space should only be accessible by the processor 
while it is in SMM mode. However, some provision 
must be made for providing the initial SMM interrupt 
handler routine. 
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;read the value of SMBASE from the state save area 
mov si, FEF8H ;SMBASE slot in SMM state save area 
mov eax,cs:[si] ;copy SMBASE from SMBASE:FEF8H to eax 
;scale the SMBASE value to a 16-bit quantity 


ror eax,cl ;scaled value of SMBASE now in ax 


;to load cs, execute a far jump to an address that has been stored 
;store the SMBASE value and an offset to a memory location that can be used as 


mov di,PTR_ADDR ;PTR_ADDR is the location used to 
;store the jump address 

mov bx,OFFSET ;OFFSET is the address where 
;execution continues after the 


;far jump 
mov es< (diji;: bx ;store the offset for the far jump 
inc di 
ine di 
Mov cs: [{di],ax ;store the segment address for the . 
;far jump,:- which is SMBASE 
Mov bx, PTR_ADDR ;bx now contains the address of the 


; location holding the jump address 


>initialize DS and ES with the correct address of SMBASE 


s;execute a far jump instruction to load the CS register 
jmp far [bx] ;jump to address stored at memory 
i ;location pointed to by bx 


-CS now contains the correct value of SMBASE, and execution continues from the 
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Because System Management Mode must be trans- 
parent to all operating systems, the initial SMM han- 
dler must be loaded by the system BIOS. At some 
time during the power on sequence, the system 
BIOS will need to move the SMM handler routine 
from the BIOS ROM to the SMRAM. The system 
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designer must provide a hardware mechanism that 
allows access to SMRAM while SMIACT # from the 
processor is inactive. One method would be to pro- 
vide an I/O port in the memory controller that forces 
memory cycles at a given address to be relocated to 
the SMRAM. Once the initial SMM handler has been 
loaded to SMRAM, the I/O port would be disabled to 
protect against accidental accesses to SMRAM. 


The system BIOS must provide an SMM handler at 
the address 38000H. If the system designer has 
chosen to take advantage of the SMRAM relocation 
feature of the processor, this handler must change 
the SMBASE register in the SMM state save. Next, 
the BIOS must move the full featured SMM handler 
to the new address. An SMI# must be generated in 
order to change the SMBASE register before the 
BIOS passes control to the operating system. 


SMRAM HIDDEN FROM DMA AND BUS 
MASTERS 


In a system that allows DMA or other devices to take 
control of the system bus, care must be taken to 
ensure that only the master processor can access 
SMRAM. If an external bus master requests use of 
the system bus (by asserting HOLD or BOFF#) 
while the processor is executing an SMM handler 
routine, the processor would respond by passing 
control of the bus to the requesting device. The 
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system memory controller must redirect any memory 
accesses that are not generated by the processor to 
normal system memory as if SMIACT # was inactive. 


DMA accesses to the SMRAM area must be redi- 
rected to the correct address space when the initiali- 
zation routine is loading SMRAM, as well as when 
the processor is in SMM. 


It is not recommended to block bus control requests 
when in SMM, because the increased bus access 
latency could cause compatibility issues with some 
software or expansion hardware. 


ACCESSING SYSTEM MEMORY FROM WITHIN 
SMM 


In order to enter a suspend state where power is 
removed from some or all of system memory, it is 
necessary for the processor to have access to the 
entire system address space from within SMM. Ac- 
cess to system memory from within SMM requires 
that the memory controller decode both SMIACT # 
and the processor address to determine accesses 
to SMRAM. Only those memory addresses that are 
defined as being SMRAM space would be directed 
to SMRAM. If SMRAM is located at an address that 
overlays normal system memory address space (see 
section 8.6.2, ‘“SMRAM Interface,’’.), the processor 
must have a method of accessing both SMRAM (for 
code reads) and system memory simultaneously. 


Ci Ne Nae? Ne eT TF 

apst .\tHHf/ ON 
SCT Na 
Ht ER, GO TT aaa, CO 
ROY \HHf..... 

AO J Ne 

HLDA ee NS eee ee 
BOFFH —).MKM MX 
SMRAMCS# = ———-».\________/  SMRAMCS#Blockedby .\_ 


HLDA or BOFF# 
242202-L5 


Figure E-1. Blocking Other Bus Masters from Accessing SMRAM 
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Ideally, a method of accessing system memory that 
is rnapped underneath SMRAM would be provided 
by the system memory controller. The memory con- 
troller would provide a register that allows system 
memory at a given address to be remapped to a 
different address, which is not overlaid by SMRAM. 
When the SMM handler implements a suspend, it 
would first move all of system memory that is not 
underneath SMRAM to a non-volatile medium (such 
as a hard disk drive). Next, the SMRAM image would 
be transferred to the non-volatile medium. Finally, 
the memory underneath SMRAM would be ac- 
cessed and copied to the non-volatile medium with a 
processor read to the remap address space, which 
is redirected to the overlaid system memory (see 
Figure E-2). 


_ If the memory controller does not provide a method 

, of accessing overlaid system memory, it is possible 
to implement a software procedure to accomplish 
the same goal. However, the software method is 
quite complex, and a hardware method is preferred. 
A description of the software method follows. 


The ability to access the system memory that is lo- 
cated in the address space under SMRAM requires 
a method of resuming from SMM to a predetermined 
address space. This can be accomplished with the 
following procedure. 


Remap Feature 
Disabled 


CPU Accesses 
to Remap : 
Address Space Remap Feature 


Enabled 
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- When resuming from SMM, the processor continues © 


execution at the address contained in the CS and 
EIP slots within the SMM state save. However, the 
resume address cannot be changed by simply modi- 
fying the CS and EIP slots, because the processor 
will use the CS descriptor to determine the actual 
resume address. The descriptor registers are stored 
in reserved slots in the SMM state save, and they 
cannot be directly modified. 


By replacing the suspend state save with a previous- 
ly obtained image of a state save that returns to a 
known location, the SMM suspend handler can force 
a return to a given address: 


1. During initial system power up, execute an SMI # 
from a predetermined address (the address im- 
mediately preceding the address to which you lat- 
er wish to resume). This can be accomplished by 
generating an SMI# in response to an I/O in- 
struction or executing a halt instruction and using 
an SMI# to exit the halt state. 


2. Save the state save from this SMM to a safe lo- — 
cation (SMRAM). 


3. When the system needs to resume to a given 
address from some other SMI#, the stored state 
save can be substituted for the state save genes 
ated from that particular SMM. 


Remap Address for 
— SAN 
—+ KK 


Overlaid Region \ 


Normal 
Memory 
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Figure E-2. Remapping Memory That Is Overlaid by SMRAM 
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Now that SMM can be resumed at a predetermined 
address, access the entire system memory space 
from within SMM before executing a suspend: 


1. During a suspend SMM, save all system memory 
except that which is located underneath SMRAM 
to a specified (and reserved) section of the hard 
disk. The ability to access system memory re- 
quires the memory controller to decode both 
SMIACT # and the processor address, and direct 
a limited section (maybe 64 or 128K) of the proc- 
essor address space to SMRAM. All other proc- 
essor memory accesses should go to normal sys- 
tem memory. 


2. Save the contents of the SMM state save to the 
hard disk. 


3. Modify the SMM state save so that the RSM in- 
struction will return to a predefined address, 
which is not in the application that was interrupt- 
ed. The code at this address must contain the 
remainder of the suspend SMM handler. The pre- 
defined address can be anywhere in the proces- 
sor address space, because the contents of sys- 
tem memory have already been saved to disk. 


4. Execute an RSM instruction, which exits SMM 
and returns control to a predetermined address 
(which must contain the rest of the SMM suspend 
handler). 


5. Save the rest of system memory (that which is 
located underneath SMRAM) to the hard disk. 
This address space can now be accessed with 
normal move instructions, because we are no 
longer in SMM. . 


6. Save a flag (in CMOS memory) indicating that the 
next reset should cause a resume from suspend. 


7. Powerdown the memory (and possibly the proc- 
essor). 


8. When power is restored, the processor is reset 
and begins execution of the POST in BIOS. Early 
in the POST, the system should check the status 
of the suspend flag. | 


9. Load a preliminary SMM handler to location 
38000H and generate an SMI#. The SMM han- 
dler should read the SMBASE slot from the SMM 
state save that was stored to hard disk. SMBASE 
is then modified to point to the final SMRAM loca- 
tion and the system resumes from SMM back to 
the system BIOS. 


10. Restore the contents of system memory located 
underneath SMRAM from the hard disk. 
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11. Generate a second SMI#, which executes an 
SMM handler at the original value of SMBASE 
(before the suspend SMM). The SMM handler 
restores the contents of the rest of system 
memory from the hard disk, and then restores 
the original SMM state save to the SMM state 
save area in SMRAM, discarding the most re- 
cent SMM state save. 


12. Execute an RSM instruction, which returns exe- 
cution to the application that was interrupted by 
the suspend request. 


Interrupts and Exceptions During SMM 
Handler Routines 


To ensure transparency to existing system software, 
the SMM handler should not depend on interrupt or 
exception handlers provided by the operating sys- 
tem. However, in some cases it may be necessary to 
service interrupts or exceptions while in System 
Management Mode. In these cases, SMM compliant 
interrupt and exception handlers, as well as an SMM 
compliant interrupt vector table, should be provided. 


SMM COMPLIANT VECTOR TABLES 


An SMI# interrupt request can be generated while 
code is running under any of the other three proces- 
sor operating modes (Real, Virtual-86, or Protected). 
When entering the SMM handler, the processor en- 
ters a pseudo-real mode, and the beginning of the 
interrupt vector table must be located at the address 
OOOOOOOOH. Before allowing any interrupts or ex- 
ceptions to occur, the SMM handler routine must 
provide a valid interrupt vector table. Any code that 
is executed before setting up an SMM compliant in- 
terrupt vector table must be written carefully to en- 
sure that no exceptions are generated. 


The system memory controller could relocate ac- 
cesses to the SMM interrupt vector table to a loca- 
tion within SMRAM. In this case, when SMIACT # is 
active, all accesses to the lowest 1 Kbyte of the 
processor address space would be redirected to 
SMRAM, which would contain an SMM compliant 
vector table that has already been initialized. 


lf the system memory controller does not redirect 
interrupt vector table reads to an address within 
SMRAM, there are three steps required to provide 
an SMM compliant interrupt vector table: 


1. Save the contents of memory at address 
OOO00000H to SMRAM 
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2. Provide vectors for any possible interrupts or ex- 
ceptions at the appropriate location in the vector 
table 


3. Restore the original memory contents from 
~ SMRAM before exiting the SMM handler routine 


INTERRUPTS AND SUBROUTINES WITH 
SMRAM RELOCATION 


There is an additional issue that must be considered. 


if interrupts or exceptions are to be executed within 
SMM and SMRAM has been relocated. Interrupt or 
subroutine calls from within SMM operate in a man- 
ner similar to Real Mode. When a subroutine is 
called or an interrupt is recognized, the 16-bit CS 
and IP registers are pushed onto the stack to pro- 
vide a return address. 


When SMRAM is relocated to an address space 
above 1M and an interrupt or subroutine call occurs, 
only 16 bits of the EIP register are pushed onto the 
stack. When returning from the subroutine or inter- 
rupt, the processor will vector to a location where 
the upper 16 bits of EIP are zero. This can be avoid- 
ed for subroutines by using an adciress size override 
before calling the subroutine. However, the issue re- 
mains for interrupts. 


Intel486 DX, IntelDX2, and IntelDX4 
Processor Floating Point Operation 
and SMM : 


THE NEED TO SAVE THE FPU ENVIRONMENT 


When the processor enters System Management 
Mode, the context information for the interrupted ap- 
plication is automatically saved to a specific state 
save address. When the SMM handler returns con- 
_ trol to the interrupted application by. executing the 
RSM instruction, the context information from the in- 
terrupted application is restored to the processor by 
reading from the state save location. This mecha- 
nism allows the SMM handler routine to modify most 
of the processor registers without the need to explic- 
itly save them to memory. However, the registers in 
the processor’s Floating Point Unit (FPU) are not au- 
tomatically saved when the processor enters SMM. 
If the SMM handler needs to modify any of the regis- 
ters in the FPU, or if the register data will be lost due 
to entering a power down state, the SMM handler 
must first explicitly save the FPU state as it existed 
in the interrupted application. 
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There are two instances in which an SMM handler 
routine must be aware of the Floating Point Unit 
(FPU): | 
1. When removing power from the processor / FPU 


for the purpose of executing a suspend se- 
quence. 


2. When the SMM handler uses FPU instructions. 


In both of these cases, the SMM handler must save 
the state of the FPU as it was left by the interrupted 
application. : : 


The information stored by the FPU state save in- 
structions (FSAVE, FNSAVE, FSTENV, and 
FNSTENV) is dependent on the operating mode of 
the processor. The FPU state save instructions store . 
the FPU state information in one of four formats: 16- 
bit Real Mode, 32-bit Real Mode, 16-bit Protected 


. Mode, or 32-bit Protected Mode, depending on the 


processor operating mode. The content of the infor- 
mation saved also varies slightly, depending on the 
processor operating mode in which the save instruc- 
tion was executed. For example, the 32-bit Protect- 
ed Mode FNSAVE instruction saves the address of 
the last executed FPU instruction and its operands 
in the form of a segment selector and a 32-bit offset. 
In contrast, the 16-bit Real Mode FNSAVE instruc- 
tion saves the address information in the form of a 
20 bit physical address. Because the format with 
which the FPU state restore instructions (FRSTOR 
and FLDENV) recall the information is also depen- 
dent on the operating mode of the processor, the 
save and restore instructions must be executed from 
the same processor operating mode. 


SAVING THE STATE OF THE FLOATING POINT 
UNIT 


When an SMM handler routine needs to save the 
state of the Floating Point Unit, it must save all FPU 
state information necessary for the interrupted appli- 
cation to continue processing. This state information 
includes the contents of the Floating Point Unit 
stack, which requires use of the FNSAVE or FSAVE 
instruction (FSTENV does not save the contents of 
the FPU stack). If the last executed non-control 
Floating Point instruction caused an error (such as a 
divide by 0), the saved information must also include 
the address of the failing instruction and the ad- 
dresses of any operands for that instruction. Without 
these addresses, it would be impossible for the FPU 
exception handler of the interrupted application to 
correct the error and restart the instruction. 
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The FNSAVE and FSAVE instructions differ in that 
FNSAVE does not wait for the FPU to check for an 
existing error condition before storing the FPU envi- 
ronment information. If there is an unmasked FPU 
exception condition pending, execution of the 
FSAVE instruction will force the processor to wait 
until the error condition is cleared by the software 
exception handler. Because the processor is in Sys- 
tem Management Mode, the appropriate exception 
handler will not be available, and the FPU error 
would not be corrected in the manner expected by 
the interrupted application program. For this reason, 
the FNSAVE instruction should be used when saving 
the environment of the FPU within SMM. | 


Because the SMM handler does not know the proc- 
essor mode in which the interrupted application was 
executing (16 or 32 bit, Real or Protected), the SMM 
handler must execute the FNSAVE instruction in a 
mode in which all FPU state information is stored. 
The 32-bit Protected Mode format of the FNSAVE 
instruction is a super set of all other formats of the 
FNSAVE instruction. Therefore, executing the 32-bit 
Protected Mode FNSAVE instruction ensures that all 
FPU state information will be saved. 


Executing the FNSAVE instruction in 32-bit Protect- 
ed Mode requires that the processor be temporarily 
placed in Protected Mode. Rather than perform all of 
the setup details and overhead necessary to place 
the processor into Protected Mode, including the ini- 
tialization of all descriptors and descriptor tables, it 
is possible to temporarily place the processor into 
Protected Mode for the purpose of executing only a 
few carefully written instructions. This can be ac- 
complished by setting the PE bit in the CRO register, 
and then executing a short jump to clear the instruc- 
tion pipelines. 


It is important to note that any instruction that modi- 
fies a segment register will cause the processor to 
attempt to load a new descriptor from the descriptor 
table. (The occurrence of an interrupt or an excep- 
tion would cause the processor to load a new de- 
scriptor, so interrupts must be disabled during this 
sequence.) Because neither the descriptors nor the 
descriptor table have been initialized, this would 
cause the system to crash. Therefore, all segment 
registers that are to be used in the FPU state save 
instructions must be initialized before entering Pro- 
tected Mode. 
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Example E-2 gives an example of the code that can 


_ be used to place the processor in Protected Mode 


and save the FPU state. 


Note that the no wait form (FNSAVE) of the save 
instruction must be used. In the event that the previ- 
ous FPU instruction caused a floating point error, we 
do not want to wait for this error to be serviced be- 
fore executing the save instruction. Additionally, if 
the FSAVE instruction were used, the operand size 
override prefix would be incorrectly applied to the 
implicit WAIT instruction which precedes FSAVE, 
rather than to the save instruction itself. — 


Before exiting the SMM handler and returning to the 
interrupted application, the register contents. of the 
Floating Point Unit must be returned to their previ- 
ous values. This can be accomplished by executing 
the 32-bit Protected Mode format of the FRSTOR 
instruction. Example E-3 gives an example code 
segment that can be used to restore the FPU to the 
state in which it was interrupted by the SMI request. 


Note that the no wait form (FNRSTOR) of the re- 
store instruction must be used. If the FRSTOR in- 
struction were used, the operand size override prefix 
would be incorrectly applied to the implicit WAIT in- 
struction which precedes FRSTOR, rather than to 
the save instruction itself. 


Support for Power Managed 
Peripherals | 


SHADOW REGISTERS 


Before power is removed from any device, the state 
of that device must be saved in a protected memory 
space so that the device can be reinitialized to its 
previous state. If a peripheral contains a write only 
register, the value in that register can be recovered 
by providing shadow registers that are both readable 
and writeable. 


These shadow registers should be updated every 
time the peripheral registers are written, but they 
have no function other than tracking the data written 
to a particular register. 
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Example E-2. Saving the FPU State in 32-Bit Protected Mode 


;first initialize the registers used to store the state save information 
;SEGMENT is the segment to be used by 


mov ax, SEGMENT 


: ;the save instruction, 
mov ds,dx ; normally it should point to SMRAM 
mov si,OFFSET 


;set the PE bit in CRO 
mov eax,crd 
or . eax,00000001H 
mov crQ, eax 


;OFFSET is the offset used in the save 
;instruction 


:read the old value of CRO 
;set the PE bit 


;enter protected mode Sy executing a short jump to clear the prefetch queue 


jmp protect 
protect: 


;we can now save the state of the FPU in the protected mode format 


db 66H 


;use an operand size override prefix 


;to set 32-bit format 


fnsave [si] 


:FPU state saved to SEGMENT:OFFSET 


~ynow return to real mode to continue with the SMM handler (no jump is 


; required) 


mov eax,crd 
and eax, OFFFFFFFEH 
mov cr0,eax 


In addition to the write only registers in a system, 
there are several other registers that must be shad- 
owed. Any device that requires registers to be pro- 
grammed in a particular sequence must also have its 
registers shadowed. Examples in a typical personal 
computer include the programmable interrupt con- 
troller, the DMA controller, and the programmable 
timer/counter. 
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;Clear the PE bit in CRO 
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It is also possible to perform shadowing of some 
write only registers using SMM. Any time a write cy- 
cle is generated to a write only register, the system | 
can generate an SMI#. The SMM handler can use 
the processor state information saved in the SMM 
state save to save the data from the interrupted I/O 
cycle to a predetermined location in the SMRAM 
space. 
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Example E-3. Restoring the FPU State from a 32-Bit Protected Mode Save 


;first initialize the registers used to recall the state save information 


mov ax, SEGMENT 


; SEGMENT is the segment to be used by 


;the restore instruction, 


mov ds,dax 
mov Ssi,OFFSET 


j;normally it should point to SMRAM 
;OFFSET is the offset used in the 


;restore instruction 


;set the PE bit in CRO 


mov eax,crod 
or eax,00000001H 
mov crQ, eax 


;read the old value of CRO 
;set the PE bit 


;enter protected mode by executing a short jump to clear the prefetch queue 


jmp protect 
protect: 


;we can now recall the state of the FPU from the previous FNSAVE instruction 


;(in the protected mode format) 


db 66H 


;use an operand size override prefix 


;to set 32-bit format 


fnrstor 


;FPU state restored from 


; SEGMENT: OFFSET 


;now return to real mode to continue with the SMM handler (no jump is 


; required) 


mov eax,crod 
and eax, FFFFFFFEH 
mov cr0,eax 


The information contained in the SMM state save 
can be used (with the knowledge that the SMI # was 
in response to an I/O write instruction) to determine 
both the address and the data of the interrupted 
write instruction. The SMM handler can examine the 
OPCODEs of previous instructions by decrementing 
the IP (or EIP) register. Once the correct OPCODE is 
determined, it can be used with the values in the 
EAX and DX slots of the SMM state save to update 
the information in the memory used to shadow the I/ 
O register. |/O write instructions occur in one of 
three forms: 1) a write to an address that is specified 
in the OPCODE; 2) a write to an address contained 
in the DX register; or 3) a string write to an address 
contained in the DX register. 


The !/O write instructions have the following 
OPCODEs: 


;Clear the PE bit in CRO 
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Table E-1. 1/O Write Instruction OPCODEs 


[Instruction | OPCODE | Notes 
OUT x,al E6x x is the address of 
the 1/O port 
OUT x,ax E7x x is the address of 
| the 1/O port 
OUT x,eax E7x x is the address of 
the 1/O port 
OUT dx,al 
OUT dx,ax 
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The SMM handler must know whether a particular |/ 
O port is 16 or 32 bits in order to distinguish between 
16 and 32 bit I/O write cycles. 


| The SMM handler can decrement the value of IP 


contained in the state save, and then examine the | 


memory contents at that address. If the SMM han- 
dler knows that the last instruction was an I/O write 
instruction, and writes to |1/O addresses 6EH, 6FH, 
OEEH, and OEFH will not cause an SMI#, it can use 
the SMM state save data for EAX and EDX to recon- 
struct the last instruction. 


HANDLING INTERRUPTED I/O WRITE 
SEQUENCES 


In a typical personal computer, there are several 
hardware devices that require the control registers 
‘for that device to be programmed in a particular or- 
der. For example, the interrupt controller, the DMA 
controller, the programmable timer/counter, the key- 
board controller, and the real time clock all require a 
series of accesses to properly initialize the registers 
in that particular device. Some of these devices may 
require successive accesses to registers located at 
different addresses, while others may require sever- 
al control registers to be programmed through write 
cycles to the same address. 


If an SMI request interrupts an application that is in 
the process of initializing the registers in one of 
these devices, special care must be taken to ensure 
that the peripheral is returned to its original state 
when control is returned to the interrupted program. 
For some SMM handler events, it may be necessary 
to power down the device or change the state of a 
register within the device. In these cases, the SMM 
handler must return control to the interrupted appli- 
cation in such a way that the application can contin- 
ue with the correct sequential access in the inter- 
rupted sequence. 
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To accomplish this, the SMM handler must restore . 
the original values of all registers in the device, and 
restart the interrupted sequence so that the applica- 
tion may continue where it left off. This requires sys- 
tem hardware to shadow all registers that need to be 
accessed in the sequence, keep an index indicating 
which position in the sequence the register occu- 
pies, and keep a pointer so that SMM software 
knows to which register the last access was direct- 
ed. This pointer would indicate the last register of 
each sequence that was programmed in the particu- 
lar peripheral. 7 


For example, programming the master interrupt con- 
troller requires a write to I/O port 20H (ICW1) fol- 
lowed by four write cycles to 1/O port 21H (ICW2, 
ICW3, ICW4, and OCW1). If this sequence is inter- 
rupted by an SMI request, and the resulting SMM 
handler either modifies or powers down the interrupt 
controller, the SMM handler must return control to 
the interrupted application such that the following 
access to the interrupt controller would access the 
correct register in the sequence. System hardware 
must save the contents of each of the registers, as 
well as a pointer indicating which register was last 
written. 


Before returning control to the interrupted applica- 


tion, the SMM handler must initialize ICW1-ICW4 


~ and OCW‘1 to their previous values. It would then re- 


write the appropriate registers so that the first ac- 
cess by the application program would be to the lo- 
cation in the sequence following the last location it 
programmed before it was interrupted by the SMI 
request. 


A similar procedure must be followed for each of the 
peripherals that require control registers to be initial- 
ized in a particular order. 
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Intel’s 82420 PCliset enables workstation level of performance for Intel486™ CPU desktop systems. The 
Peripheral Component Interconnect Bus (PCI) is driving a new architecture for PC’s—eliminating the I/O 
bottleneck of standard expansion busses. PCI provides a glueless interface for high performance peripherals 
such as graphics, SCSI, LAN and video to be placed onto a fast local bus. By utilizing this technology and 
incorporating read/write bursts along with write buffers into the 82420 PClset, a new level of performance is 
now possible for today’s Intel486 CPU desktop systems. 


The Intel 82420 PCliset is comprised of three components: the 82424ZX Cache DRAM Controller (CDC), the 
82423TX Data Path Unit (DPU), and the 82378ZB System I/O (SIO). The CDC and DPU provide the core - 
system architecture while the SIO is a PCI master/slave agent which bridges the core architecture to the ISA 
standard expansion bus. Intel also offers two components, the 82374EB (ESC) and 82375EB (PCEB), that 
work in conjunction to bridge the PCI bus to the EISA expansion bus. Refer to the ESC and PCEB data sheets 
for information regarding the EISA bridge components. 


The chip set supports the Intel486 CPU family as well as Intel’s future OverDrive™ processor for the 
Intel486 DX2 CPU. The high performance memory subsystem supports concurrent operation between PCI bus 


masters while the CPU accesses memory. An integrated second level cache can be programmed for write- 
through or write-back operation. 
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Product Highlights 
82378ZB—System 1/0 Component 


82424ZX—Cache DRAM Controller (SIO) 
(CDC) m Supports Fast DMA Type A, B, or F 
m Concurrent Linefill ne Copyback Cycles 

Cycles 


Supports DMA Scatter/Gather 
Arbitration Logic for Four PCI Masters 
Reusable across Multiple Platforms 
Directly Drives Six External ISA Slots 


Integrates Many of Today’s Common 
I/O Functions 


208-Pin QFP Package 


m Supports Intel486 CPU Family and 
; OverDrive Processors 


m Supports OverDrive Upgrade 
m 64K-512K Level 2 Cache Support 


m Level 2 Cache Configurable as Write- 
Back or Write-Through 


m 208-Pin QFP Package 


82423TX—Data Path Unit (DPU) 
Highly Integrated 

m Four Dword Write Buffers 

m Zero Wait-States for CPU Write Cycles 

# 

a 


PCI Burst Write Capability 
160-Pin QFP Package 


Product Description 


The 82424ZX Cache DRAM Controller (CDC) is a single-chip bridge from the CPU to the PCI bus. It provides 
the integrated functionality of a second level cache controller, a DRAM controller, and a PCI bus controller. It 
also features an optimized memory subsystem. The CDC is a dual ported device with one port as the host port 
and the other as the PCI port. 


The 82423TX Data Path Unit (DPU) integrates the host data, memory data, and PCI data interface, DPU 
control/parity and four deep posted write buffers. With glue and buffers integrated directly into the DPU, the 
Intel 82420 PCiset reduces board space requirements. The DPU’s posted write buffers allow CPU write cycles 
- to be executed as 0 wait-states. 


The 82378ZB System I/O (SIO) is a dual ported device which acts as a bridge between the PCI and standard 


ISA I/O bus. The SIO integrates the functionality of an ISA controller, PCI controller, fast 32-bit DMA controller, 
and standard system I/O functions. | 
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82420EX PCISET DATA SHEET | 
82425EX PCI SYSTEM CONTROLLER (PSC) | 
AND 82426EX ISA BRIDGE (IB) 


m Host CPU m Programmable Attribute Map for First 
— 25-33 MHz Intel486™ and 1 MByte of Main Memory : 
OverDrive™ Processors , 
‘ mw 100% ISA Compatible 
— L1 Write-Back Support  —Directly Drives 5 ISA Slots 
m Integrated DRAM Controller 
— 1 to 128 MByte Main Memory ee 
— 70 ns Fast Page Mode DRAM SIMMs — 27-bit Addressability 
Supported = : 
— Supports 256 KByte, 1 MByte, and wombat? DMA Transfers 
4 MByte Double and Single Sided m One 82C54 Timer/Counter 
SIMMs — System Timer 
— Read Page Hit Timing of 3-2-2-2 at — Refresh Request 
33 MHz — Speaker Tone 
— Burst Mode PCI Master Accesses m Two 82C59 Interrupt Controllers 
— Decoupled Refresh Reduces DRAM — 14 Interrupts 
Latency — Edge/Level Sense is Programmable 
— Five RAS Lines per Channel 
m Integrated L2 Cache Controller —— PCI Interrupt Steering for Plug and 
— Write-Back and Write-Through Cache Play Compatibility 
Policies eas m X-Bus Peripheral Support 
— Direct Mapped Organization — RTC, KBC, BIOS Chip Selects 
— 64, 128, 256 or 512 KByte Cache — Control for Lower X-Bus Transceiver 
Sizes — Integrates Mouse Interrupt 
— Programmable Zero Wait-State L2 — Coprocessor Error Reporting 


Cache Read and Write Accesses 
— Two Banks Interleaved or a Single m Non-Maskable Interrupts (NMI) 


Bank Non-Interleaved Operation _ ie System Errors 7 
— No VALID Bit Required — Main Memory Parity Errors 


25/33 MHz PCI Bus Interf — ISA Parity Errors 
" — Two Be Masters es m System Power Management (intel SMM 


a : : Support) 
PCI Auto Solneweuen Support — Programmable System Management 
m Host/PCli Bridge Interrupt (SMI)—Hardware Events, 
— Converts Back-to-Back Sequential Software Events, EXTSMI# 
Memory Writes to PCI Burst Writes — Programmable CPU Clock Control 
— CPU Memory Write Posting to PCI — Fast On/Off Mode 
m PCI Local Bus IDE Interface m Generates System Clocks 


— Supports Mode 3 Timing 


160-Pin QFP Package for IB 
m 208-Pin QFP Package for PSC 


December 1994 
Order Number: 290488-003 2-401 
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The 82420EX PClset is the foundation for the Value Flexible Motherboard solution for entry-level Intel4867™ 
processor-based PCI systems. The Value Flexible Motherboard solution, including 82420EX, Intel486 proces- 
sor, 82091AA Advanced Integrated Peripherals, 82C42 Keyboard Controller, Flash BIOS, and Plug & Play 
software, drives PCI into the mainstream. The 82420EX PClset is a highly integrated solution enabling low 
cost, small form factor motherboard designs. All Intel486 processors and upgrades are supported, including L1 
write-back and Intel SMM power management. PCI Local Bus IDE is incorporated for higher performance IDE 
at no additional cost. | 


The 82420EX was designed from the ground up for PCI performance. It consists of two components—the 
82425EX PCI System Controller (PSC) and the 82426EX ISA Bridge (IB). The PSC integrates the L2 cache 
controller and the DRAM controller. The cache controller supports both write-through and write-back cache 
policies and cache sizes from 64 KBytes to 512 KBytes in an interleaved or non-interleaved configuration. The 
DRAM controller interfaces main rnemory to the Host Bus and the PCI Bus. The PSC supports a two-way 
interleaved DRAM organization for optimum performance. Up to ten single-sided SIMMs or four double-sided 
and two single-sided SIMMs provide a maximum of 128 MBytes of main memory. The PSC provides memory 
write posting to PCI for enhanced CPU-to-PCI memory write performance. In addition, the PSC provides a high 
performance PCI Local Bus IDE interface. | 


The IB is the bridge between the ISA Bus and Host Bus, and integrates the common I/O functions found in 
today’s ISA-based PC systems—a seven channel DMA controller, two 82C59 interrupt controllers, an 8254 
timer/counter, Intel SMM power management support, and control logic for NMI generation. The IB also 
provides the decode for external BIOS, real time clock, and keyboard controller. Edge/Level interrupts and 
interrupt steering are supported for PCI plug and play compatibility. The IB integrates the ISA address and data 
path, reducing TTL and system cost. In addition, the integration of system clock generation logic eliminates the 
need for external host and PCI clock drivers. 
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1.0 INTRODUCTION 


This section discusses CPU performance optimization 
techniques for the Intel486™M DX2 microprocessor. 
The reader should be familiar with the Intel486™ DX 
microprocessor as well as knowledgeable about memo- 
ry systems and cache architectures. For further refer- 
ence, the reader is directed to the following documents 
and application notes (corresponding Intel order num- 
ber is shown in parentheses): 


@ Intel486™ DX2 Microprocessor Data Book 
(241245-001) 


e Intel486™ DX Microprocessor Data Book 
(240440-004) 

@ Intel486™ DX Microprocessor Hardware Refer- 
ence Manual (240552-001) 

® Cache Tutorial 1991 (296543-002) 


© AP447: A Memory Subsystem for the Intel486™ 
Family of Microprocessors including Second Level 
Cache (240799-002) 

© 485TurboCache Module Intel486T™ DX Micro- 
processor Cache Upgrade (240722-002) 


ZB 
Wi 
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2.0 The High-Performance 
Intel486™ DX2 Microprocessor 


The Intel486 DX2 Microprocessor is functionally 
equivalent to the now-familiar Intel486™ DX Micro- 
processor. However, the Intel486 DX2 CPU’s internal 
core runs at twice the frequency of its external bus. 
This architecture enables a very high level of perform- 
ance while, at the same time, maintaining straightfor- 
ward system design. 


The Intel486 DX2 CPU is partitioned such that the 
cache and write buffers operate at the full core speed as 
illustrated in Figure 2.1. As such, the processor is only 
slowed to the external bus speed on cache misses and 
when the write-buffers are full. The Intel486 DX2 
CPU’s external bus interface is identical to its predeces- 
sor, i.e. all system cycles emanating from the CPU look 
exactly as if they would from the Intel486 DX CPU. 
The Intel486 DX2 microprocessor includes additional 
features such as JTAG boundary scan and power-down 
capability that are not covered in this section. 


66 MHz 
intel486™ DX 
Microprocessor 
Core 


Microprocessor 


os 33 MHz Intel486™ Dx 
External Bus 


241261-1 


Figure 2.1. The 66MHz Intel486™ DX2 CPU Internal Architecture 


MS-DOS, Windows, Word, Excel and Flight Simulator are trademarks of Microsoft Corporation. 


UNIX is a trademark of UNIX System Laboratories. 


Lotus, 123 are trademarks of Lotus Development Corporation. 


Turbo C is a trademark of Borland International. 
AutoCAD is a trademark of AutoDesk, Inc. 
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Performance optimization for the Intel486 DX2 CPU is | 


subtly different than for the Intel486 DX CPU due to 
the difference in the internal architecture. This is evi- 
dent if you consider that external memory latencies 
now affect the full speed core by twice as many CPU 


clocks (refer to Figure 2.2). In other words, the memo-— 


ry system should really be designed to satisfy the data 
throughput demands of a 50/66MHz CPU. The next 
few sections examine the situation in detail so that edu- 
cated trade-offs can be made during system design. The 
discussions will focus on CPU-cache memory perform- 
ance; I/O performance and other architectural issues 
are not addressed in this applications note. 


In an ideal system, all CPU cycles operate at zero-wait 


states and the theoretical maximum performance of the 
CPU is achieved. However, short of spending a lot of 
money on SRAMs, a real system always falls short of 
the ideal zero wait-states. There are many cache-memo- 
ry designs that differ in both architecture and imple- 
mentation. However, it may be impossible to design a 
system that performs better than all others across all 
applications; different applications generate different 
- types of CPU bus activity and the cache-memory sys- 
tem will perform differently in each case. A range of 
statistical parameters may be used to illustrate this 
point: 

@ Internal Cache (L1) Hit Rate 

¢ Number of prefetches 

® Number of operand reads 

® Number of operand writes 


¢ Bus Utilization (amount of time spent using the 
CPU bus). 


_ These parameters are examined next and will be useful 
information for cache-memory design trade-offs. 


50 MHz/66 MHz 


25 MHz/33 MHz 


Intel486™ DX 
CPU 
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2.1 Internal Cache Hit Rates 


_ The Intel486 DX2 CPU maintains the same unified 


code/data, four-way interleaved, 8K-byte internal 
cache as the Intel486 DX CPU. The internal cache 
(L1) hit rate is shown in Figure 2.3 for some different 
Operating systems and applications. These hit rates 
were obtained from instruction traces captured from 
specific applications. Note that the L1 hit rate for the 
Intel486 DX2 CPU will be almost identical to that for 
the Intel486 DX CPU; the 2X-internal frequency does 
not significantly alter the cache miss statistics. 


The DOS applications included Auto Cad, Lotus123, 
Excel, Turbo C, and Flight Simulator. Lotus123 had 
the highest hit rate at 99% while Auto Cad was the 
lowest at 89.6%. The Windows 3.0 results included in- 
structions executed while clipping an image, drawing a 
dialog box, executing Excel and while executing Page- 
maker. The category UNIX-iSPEC refers to applica- 
tions within the UNIX SPEC benchmark suite that are 
mostly integer intensive, whereas UNIX-fSPEC refers 
to those which are floating point intensive. The last 
category UNIX-TP!1 refers to the hit rates typical while 
running the TP1 transaction processing benchmark. 


These results illustrate the different nature of different 
software on the bus characteristics. Single-threaded 
DOS applications have a typically higher hit rate com- 
pared to the multi-tasking nature of UNIX and Win- 
dows benchmarks. The UNIX floating point bench- 
marks show the lowest hit rates; this is partly due to the 
large data structures typical of floating point applica- — 
tions that do not fit well into the L1 cache. It is also 
due to the nature of the operations performed on those 


data structures; i.e. the application does not exhibit 


very much temporal or spatial locality. 


@ External Bus Speed 


@ Sensitivity to Memory 
Latency (Internal Core 
Frequency) 


Intel486™ DxX2 


CPU 


241261-2 


Figure 2.2. The Intel486™ DX2 Microprocessor is More 
Sensitive to Memory Latency than Its Predecessor 
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Windows 
UNIX-fSPEC 
UNIX-TP 1 
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Figure 2.3. There will be a Range of L1 Hit Rates for Different Applications/Operating Systems 


TP1 is a UNIX multiuser multithreaded application 
with heavy amounts of disk and I/O as well as compu- 
tation. The L1 cache hit rate is low since it has many 
active contexts due to multiple requests per user. 


High L1 hit rates are typical of many prevalent and 
commonly used DOS benchmarks - some have hit rates 
approaching 100%. This is unfortunate since they fail 
to properly account for the more realistic external 
cache and memory demands of most DOS and Win- 
dows applications. These demands will continue to be 
true with the advent of newer graphical-user-interface- 
based multi-tasking operating systems and applications. 
With these benchmarks, there is a danger of misrepre- 
senting system performance for the Intel486 DX micro- 
processor. For the Intel486 DX2 microprocessor, this 
misrepresentation can be even more damaging since L1 
cache misses incur a penalty that 1s twice the number of 
external clock cycles - since the CPU core 


Prefetches 
(49.5%) 


(33.5%) 


Hit Rate 
93.0% 


now runs twice as fast internally. In other words, the 
Intel486 DX2 CPU is twice as sensitive to wait states 
compared to the Intel486 DX CPU. To properly gauge 
the external cache and memory performance, more de- 
manding benchmarks (e.g. UNIX SPECmarks) or real 
application benchmarks should be used. 


2.2 Bus Cycle Mix 


Different applications cause the CPU to generate a dif- 
ferent number of reads, writes and instruction prefetch- 
es. The reads and prefetches are filtered by the internal 
L1 cache before reaching the external CPU bus. All 
writes propagate through to the external bus since the 
internal cache follows a write-through protocol. This is 
shown in Figure 2.4 for an instruction trace captured 
from an integer SPEC benchmark. . 


Reads/Prefetches (24.5%) 


Writes 
(17%) Writes (75.5%) 


Internal Bus: 
Number of Cycle 
Types 


External Bus: 
Number of Cycle 


Types 
a 241261-—4 


Figure 2.4. The Effect of L1 Cache Hits on the External Bus Cycle Mix for an Integer SPEC Benchmark 
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As with the Intel486 DX CPU, the bus cycle mix for 
the Intel486 DX2 CPU consists of mostly write cycles. 
However, the exact ratio of reads to writes is again 
application dependent. For example, for Lotus123 that 
has a L1 hit rate of 99%, writes make up 99.5% of 
external bus cycles. 


2.3 Bus Utilization 


Bus utilization refers to the amount of time that the 
CPU spends executing bus cycles for a given applica- 
tion. It is a measure of the amount of bus traffic gener- 
ated by a particular application on the CPU’s external 
bus. This metric is illustrated in Figure 2.5 where the 
CPU bus is busy for 75% of the twelve external clock 
cycles shown. 


intel. 


Bus utilization is dependent on the application and the 


external cache/memory system. Different applications 
generate different amounts of bus traffic. For example, 


"some applications may have small data structures that 


fit easily within the internal cache and therefore smaller 
amounts of external bus cycles are generated. 


The Intel486 DX2 CPU will have a larger percentage 
of bus utilization for the same application as compared 
to the Intel486 DX CPU. This is due to the faster CPU 
core that can now operate twice as fast and that will try 
to generate twice aS many bus cycles in the same 
amount of time. However, since the external CPU bus 
remains at a 1X-frequency, it experiences heavy 
amounts of bus traffic as it trys to keep up with the 2X- 
internal core. In other words, with faster internal core 
execution, less time is spent idling on the external bus. 


Different external cache/memory systems also affect 
the bus utilization; faster cache/memory systems allow 
CPU cycles to complete faster and therefore free up the 
bus more. 


12 clocks 
[qe a =, Og we Ge ee ey Po yk 
ADS# L_J LI 


External Clock 


BRDY# L_J l J 


busy idle busy 


25% 25% 50% 
( ) ( ) ( 241261-5 


. Figure 2.5. Definition of Bus Utilization 
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2.4 Profiles of Some Applications traces were subsequently used in a CPU-cache-memory 

simulator to extract the desired information. The re- 
The Intel486 DX2 CPU bus characteristics of some dif- sults shown here assume an ideal zero wait-state memo- 
ferent applications and operating systems are shown in —_—s‘ry system; i.e. all bus cycles complete in zero wait- 


Tables 2.1 through 2.3. These results are derived from states. 
traces captured from the actual applications. These 


Table 2.1. Bus Profiles of UNIX Applications with a Zero Wait-State Memory System 


UNIX Applications | SPEC1 | GCC | UNIXMIX1 | 
Total Number of CPU clocks simulated | 12.14M | 12.54M 12.44M 


Overall L1 hit rate 
Prefetch hit rate 
Read hit rate 


Write hit rate 


Number of external bus cycles 
% bus code prefetches 
% bus data reads 
% bus data writes 


Bus Utilization 51.6% 46.4% 51.4% 


Table 2.2. Bus Profiles of Windows Applications with a Zero Wait-State Memory System 


Windows Applications | Word | Excel—Calc 
Total Number of CPU clocks simulated | 27.02M 7.39M 30.06M 


Overall L1 hit rate 
Prefetch hit rate 
Read hit rate 
Write hit rate 


Number of external bus cycles 
% bus code prefetches 
% bus data reads 
% bus data writes 


Bus Utilization 40.9% 60.1% 56.0% 


Table 2.3. Bus Profiles of DOS Applications with a Zero Wait-State Memory System 


DOS Applications | Excel | Turbo | Auto Cad 


Total Number of CPU clocks 11.1M 13.9M 16.1M 
simulated 


Overall L1 hit rate 
Prefetch hit rate 
Read hit rate 

Write hit rate 


Number of external bus cycles 
% bus code prefetches 
% bus data reads 
% bus data writes 


Bus Utilization 41.0% 40.6% 55.5% 
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The UNIX applications are described as: 


e SPEC1: A mixture of integer SPEC benchmark suite 
programs running concurrently. 


® GCC: SPEC benchmark suite GNU C ae 
compiling itself. 


@¢ UNIXMIX1: A mixture of UNIX utility programs 
like awk and grep, running concurrently. 


The Microsoft Windows 3.0 applications are described 
as: | 


© Word: Microsoft Word for Windows converting a 
document for import (no VGA activity, includes 
kernel calls). 


@ Excel—Calc: Microsoft Excel for Windows running 
a calculation. 


e Pagemaker: Pagemaker for Windows formatting a 
document (no VGA activity, includes kernel calls) 


The three DOS applications shown are described as: 


e Excel: Microsoft Excel (DOS version) recalculating 
a spreadsheet. 


¢ Turbo C: Borland’s Turbo C compiler compiling a 
large C program. 


¢ Auto Cad: Auto Desk’s Auto Cad program comput- 
ing and displaying a drawing (reason for low hit 
rate) 


Note that most DOS applications will typically use the 
external bus much less than UNIX or Windows appli- 
cations. On the other hand, heavy duty DOS applica- 
tions such as Auto Cad will actually exhibit a larger 
demand for the external bus similar to the demands of 
UNIX and Windows. Also, recall that the bus utiliza- 
tion numbers shown assume a zero wait-state memory 
system; more realistic external cache/memory systems 
with a finite number of wait-states will actually experi- 
ence a larger percentage of bus utilization. Finally, note 
that since the L1 hit rate for the DOS applications is 
high, the external bus mix consists of mostly writes. 
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Now that we have considered the characteristics of L1 
hit rates, bus cycle mix and utilization, we can examine 
the impact that the cache-memory system has on over- 
all Intel486 DX2 CPU performance. To examine these 
effects, traces were captured from three applications 
(one from each operating environment) and were used 
in a. CPU-cache-memory simulator to measure the 
CPU performance under different conditions. The 
traces used were SPECI (UNIX), Pagemaker (Win- 


2.5 Wait States Explained 


dows) and Turbo C (DOS). The simulator used is an 


accurate and convenient method of comparing per- 
formance by varying different parameters independent- 
ly. This allows us to develop some heuristic rules to 
guide system design. Before continuing, the following 
convention is defined to denote memory performance: 


Notation Convention: 
A memory system’s performance is abbreviated as: 
Lead-off clocks - Burst 2 - Burst 3 - Burst 4, Write 
clocks 
e.g. 3-1-2-1, 3 would look like: 


BURST READ 


A zero-wait-state case corresponds to a 2-1-1-1, 
memory system. 


Figure 2.6. Memory Performance Notation 


2.5.1 THE IDEAL ZERO WAIT STATE MEMORY 
SYSTEM 


As a Starting point, the ideal zero-wait-state memory 
system was characterized for three applications. For 
I/O cycles, it was assumed that a constant 8 wait-states 
were required. Since the I/O instructions were a small 
portion of the instruction traces used, any inaccuracy 
due to this assumption will be insignificant. 
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The total execution times reported are as follows: 


Table 2.4. Number of Internal Clocks (millions) 
Needed to Complete Application Trace 


[speci | pagemaker | Turbo 


Intel486 DX CPU 10.76 26.68 13.34 


Intel486 DX2 CPU | 12.14 30.06 13.90 


As an example, we can now compare the actual time 
required to complete the applications between a 
66MHz Intel486 DX2 CPU and a 33MHz Intel486 DX 
CPU. The number of clock cycles is multiplied by 15ns 
for the Intel486 DX2 CPU and by 30ns for the Intel486 
DX CPU. 


Table 2.5 Total Execution Time in Seconds 


[| spect Pagemater] Turbo 6 


Intel486 DX2 CPU 451 ms 209 ms 


Note that although the Intel486 DX2 CPU’s internal 
clock rate is twice that of the corresponding Intel486 
DX CPU, the relative improvement is less than 100%. 
This is due to cache miss reads and write cycles that 
run at the external bus speed. Note that the improve- 
ment is much greater for Turbo C which has a lower 
cache miss rate and low bus utilization. 


2.5.2 ADDING WAIT STATES 


As wait states are added to the ideal zero wait state 
memory system, performance degrades. Three memory 
parameters are of interest in characterizing the memory 
performance. They are: 


AP-469 


@ Number of wait states added to the first ready of a 
read (a.k.a. the lead-off cycle). e.g. One wait-state 
with zero wait-state burst = 3-1-1-1 


¢ Number of wait states during the remainder of the 
burst cycle. e.g. a zero wait-state lead-off with a one 
wait-state burst = 2-2-2-2 


¢ Number of wait states on a write cycle. e.g. a one 
wait-state write takes three clocks. 


To examine the impact of adding wait-states to each of 
the parameters above, three series of simulations are 
done where wait states are added to each memory pa- 
rameter separately while the other memory parameters 
are held constant: 


Zero 
wait- 1 wait- 2 wait- 3 wait- 
states state states states 


e Lead-off 
Series: 

e Burst Series: 2-1-1-1,2 2-2-2-2,2 2-3-3-3,2 2-4-4-4, 2 

® Write Series: 2-1-1-1,2 2-1-1-1,3 2-1-1-1,4 2-1-1-1,5... 


2-1-1-1,2 3-1-1-1,2 4-1-1-1,2 5-1-1-1,2... 


As the number of wait states increases, the number of 
clocks needed to complete the application increases 
(and the CPU performance decreases). This series of 
measurements is used to separate out the dependency of 
the CPU performance on the different memory parame- 
ters. This information is useful for subsequent external 
cache/memory design trade-offs. 


The number of clocks needed to complete the ‘applica- 
tion relative to the zero wait-state case is referred to as 
the relative total execution time. This metric will be 
used in the following graphs instead of the reciprocal of 
total execution time which would be CPU performance; 
this is so that any inherent linear relationships between 
wait-states and execution time can be more easily rec- 
ognized. To translate the total execution time back to 
CPU performance, the following table is provided for 
convenience (100% refers to the zero wait-state case): 


Table 2.6. CPU Performance versus Total Execution Time 


PRELIMINARY 


Total Execution Time | 100.00% | 110.00% | 120.00% | 130.00% | 140.00% | 150.00% | 160.00% 
CPU Performance 100.00% 90.91% 83.33% 76.92% 71.43% 66.67% 62.50% 
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Figures 2.7 and 2.8 show the total execution time as 
wait states are added for the SPECI! trace described 
earlier. 


As would be expected, as wait states are added, the 
relative total execution time for the Intel486 DX2 CPU 
increases faster than the Intel486 DX CPU. (however, 
note that the absolute total execution time for a In- 
tel486 DX2 CPU will never be greater than a Intel486 
DX CPU of the same external bus speed). Also note 
that the order of importance of the memory parameters 
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changes between the Intel486 DX and the Intel486 
DX2 CPU. For the Intel486 DX CPU, burst perform- 
ance is the most important, with read-lead-off perform- 
ance being slightly more important than writes. This 
conclusion was presented in Chapter 4 of the original 
Intel486 DX Microprocessor Hardware Reference 
Manual (Order Number 240552-001). However for the 
Intel486 DX2 CPU, while burst performance is still the 


most important, write performance becomes more im- 
portant than read-lead-off performance. 


Number of Wait States 241261-7 


Figure 2.7. intel486 DX2 CPU Performance Degradation as Wait States 
are Added - for the SPEC1 Application Trace (UNIX) 
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Figure 2.8. Intel486 DX CPU Performance Degradation as Wait States 
are Added - for the SPEC1 Application Trace (UNIX) 
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Figures 2.9 and 2.10 show the total execution time as 
wait states are added for the GCC trace described earli- 
er. 


Compared to the results for the SPEC1 application 
trace, the GCC results with the Intel486 DX2 CPU still 
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show the burst cycles as being the most sensitive to wait 
states. However, note that the lead-off cycle is now 
more important than write cycles. This is due to the 
small percentage of bus writes (57.4%) while running 
the GCC application as compared with the SPEC1 
trace. 
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Figure 2.9. Intel486 DX2 CPU Performance Degradation as Wait States 
are Added - for the GCC Application Trace (UNIX) 
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Figure 2.10. Intel486 DX CPU Performance Degradation as Wait States 
are Added - for the GCC Application Trace (UNIX) 
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The results for the Pagemaker Application under Win- earlier. Basically, the degradation of the Intel486 DX2 

dows are shown in Figures 2.11 and 2.12. . CPU’s relative execution time increases faster as wait- 
states are added as compared to the Intel486 DX CPU. 

The same conclusions can be made for the Pagemaker 

application under Windows as for the SPEC1 results 
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Figure 2.11. Intel486 DX2 CPU Performance Degradation as Wait States 
| are Addeci - for the Pagemaker Application Trace (Windows) 
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Figure 2.12. intel486 DX CPU Performance Degradation as Wait States 
are Added - for the Pagemaker Application Trace (Windows) 
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Finally, the results for the Turbo C application under external bus utilization of the application. However, the 
DOS are shown in Figures 2.13 and 2.14. degradation is still about twice what it is for the In- 

tel486 DX CPU. Note that the write importance is 
The rate of performance degradation for the Intel486 about equal to the burst importance in this case. This 
DX2 CPU with the Turbo C application is less than the can be attributed to the greater percentage of writes in 
UNIX and Windows examples. This is due to the lower the bus cycle mix for this application. 
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Figure 2.13. Intel486 DX2 CPU Performance Degradation as Wait States 
are Added - for the Turbo C Application Trace (DOS) 
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Figure 2.14. Intel486 DX CPU Performance Degradation as Wait States 
are Added - for the Turbo C Application Trace (DOS) 
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2.5.3 WAIT STATES AND CPU STALLS 


The main reason why the relative performance of the 
Intel486 DX2 CPU degrades faster than the Intel486 
DX CPU is that for every external wait state, two inter- 
nal clock delays are caused. In fact, a zero wait state 
cycle on the external bus of the Intel486 DX2 CPU is 
already a two-wait state cycle as experienced by the 2X- 
clock internal core as shown in Fig. 2.15. The imagi- 
nary 2X-clock versions of the signals ADS# and 
BRDY # illustrate what the cycle might have looked 
like if the internal bus frequen cy was equal to the ex- 
ternal. . 


Extending this fact, this means that a one wait-state 
cycle for the Intel486 DX2 CPU is actually equivalent 
to a four wait-state cycle for the 2X-clock internal CPU 
core. 


The internal cycle start indication conditions may also 
have a one internal clock cycle synchronization penalty 
if it is active in the wrong phase of the external clock 
(also shown in Figure 2.15). 


- As the effective number of wait states increases, the 
~ CPU will stall program execution differently for each of 
the three memory parameters described above. The 
stall conditions for each memory parameter are elabo- 
rated below. 
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Figure 2.15. A Zero-Wait State Write for the Intel486™ DX2 CPU 
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2.5.3.1 Delay Till First Ready of a Read 


Wait states incurred on the first ready of a external 
read (the lead-off cycle) affect both data reads and code 
prefetches. For data reads, the CPU’s execution is 
stalled under most conditions; no other operation can 
happen in parallel until the first ready of the line fill is 
received. For code prefetches, execution is stalled if the 
processor is fetching code as a result of a code branch 
(therefore flushing the prefetch buffers). 


For most applications, the read lead-off delay increases 
the execution time the least compared to the other pa- 
rameters. This is because writes cycles usually make up 
the dominant share of the bus cycles. However, there 
are exceptions to this case; for example, with the GCC 
trace, only 57.4% of the bus cycles were writes. 


2.5.3.2 Wait states on Bursts 


Adding wait states to the burst cycle increases the exe- 
cution time the most. The burst is usually the result of a 
cache line fill or a code prefetch. Adding wait states to 
this parameter ties up the bus for the longest periods of 
time compared to adding the same number of wait 
states to the other parameters. As a result, all subse- 
quent external bus requests are stalled as the CPU waits 
for the burst cycle to complete. These include stalls 
while the CPU waits to do a read cycle. A longer burst 
cycle also delays the rate at which the internal write 
buffers can be emptied since the write buffers must also 
wait for the external bus to free up. This causes stalls as 
described below for write cycles. 
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Finally, longer code prefetch bursts will slow down 
CPU execution if the prefetch was a result of the pre- 
fetch queue being flushed. This is especially so if the 
instruction required extends beyond the first dword of 


the burst and therefore the CPU must wait for subse- 
quent dwords before execution can start. 


2.5.3.3 Write Wait States 


There are three conditions under which a longer write 
cycle will stall CPU execution as additional wait states 
are added. These conditions are: 


1. The write buffers are full and cannot accept any 
more writes. 


2. A read cannot bypass the write buffers and must 
wait for them to be flushed. 


3. A read bypasses the write buffers but must wait for 
an existing write cycle to complete. 


Before these effects are elaborated, it is worthwhile to 
reexamine the operation of the internal write buffers. 


The Intel486 DX2 CPU uses the same four-deep write 
buffers as the Intel486 DX CPU. The write buffers can 
accept data writes from the execution core as fast as 
one per clock. Once a write request is buffered, the 
internal unit that generated the request is free to contin- 
ue processing. When all write buffers are full, any sub- 
sequent write transfer will stall inside the processor un- 
til a write buffer becomes available. 


The bus interface unit can re-order pending reads in 
front of buffered writes. This is done because pending 
reads can prevent an internal unit from continuing, 
whereas buffered writes need not have a detrimental 
effect on processing speed. Writes are propagated to the 
external bus in the same first-in-first-out order in which 
they are received from the internal unit. However, a 
subsequently generated read request (data or instruc- 
tion) may be reordered in front of buffered writes. As a 
protection against reading invalid data (reading stale 
data from a location in main memory when the location 
has been modified in the write buffers), this reordering 
of reads will only occur if all buffered writes are inter- 
nal cache hits. Because an external read will only be 
generated for a cache miss, and will only be reordered 
in from of buffered writes if all such writes are internal 
cache hits, any read generated on the external bus will 
never read a location that is about to be written by a 
buffered write. 


This reordering can only happen once for a given set of 
‘buffered writes, because the data returned by the read 
cycle could otherwise replace data about to be written 
from the write buffers. 


The first condition that causes CPU stalls is when the 
write buffer is full. Write wait states decrease the rate 
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at which the write buffer can be emptied. Since the 
Intel486 DX2 runs at a 2X-internal frequency, the like- 
lihood of filling up the write buffers increases when 
compared to the Intel486 DX CPU as shown in Figure 
2.16. 


66 MHz 
Intel486™ DX2 
CPU Core 


33MHz~ - 
Intel486™ DX 


Write Buffers 


Writes 


33 MHz External 
Memory System 


33 MHz External 
241261-16 
Figure 2.16. The Intel486 DX2 CPU’s 


Write Buffers are More Heavily 
Used than the Intel486 DX CPU’s 


The second situation that degrades performance is dur- 
ing reads which cannot bypass the write buffers - either 
because the buffered writes were cache misses or be- 
cause a read reordering had already occurred. These 
reads will be stalled until the write buffers are emptied. 
The more wait states required for writes on the external 
bus, the longer these stalls will last. 


Finally, reads which can bypass the write buffers may 
be stalled by a write already in progress on the external 
bus. This condition is illustrated in Fig 2.17 for both 
the Intel486 DX and Intel486 DX2 CPUs. Note that 
for this example, although both the Intel486 DX2 and 
Intel486 DX CPUs take the same amount of time to 
complete the instruction stream, the Intel486 DX2 
CPU is stalled longer (waiting for the write to com- 
plete) relative to its own internal 2X clock. 


Out of the three conditions described above, stalls on 
write buffers full and stalls because of reads on busy 
writes dominate the increase in execution time as wait 
states are added to write cycles as shown in Table 2.7 
for the SPEC1 trace. (The results shown in Figure 2.7 
assume that the read and burst cycles complete in zero 
wait-states). These effects will be discussed again later 
when the addition of external write buffers is consid- 
ered. | 
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Figure 2.17. Reads Stalled as a Result of a Write Already In Progress 


Table 2.7 Percentage of Total Execution 
Time Stalled under the Three Different 
Write Stall Conditions 


Intel486 DX Intel486 DX2 
CPU 


CPU 
Writewaitstates|_ o | 1 | 2 | o | 1 | 2 | 


Stalls on write 0.0%10.1% 

buffers full 

Reads cannot 0.1%/0.1% 0.3% |0.5%|0.5% 
overtake writes 


_ 


3.0 Memory Design Optimization 


Some Intel486 DX2 CPU-based designs will include a 
memory system without an external cache. This section 
covers the design of such a cacheless memory system. 
Different memory architectures are discussed and the 
benefits of improving write performance through the 
addition of external write buffes will also be considered 
(see Figure 3.1). 


0.3% 


Main memory performance will be important for exter- 
nal cache-based designs also, especially for applications 
with low external cache hit rates. It is recommended 
that the performance impacts of design choices in a 
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Figure 3.1 The Two Cacheless Intel486 
DX2 CPU-Based Systems Considered 


As discussed in the previous sections, the Intel486 DX2 
Microprocessor requires a fast memory system for opti- 
mum performance. Memory systems that may have 
been adequate for Intel486 DX CPU designs running 
DOS applications may be suboptimal for the Intel486 
DX2 CPU, especially running today’s more demanding 
operating systems and applications. 


Main memory page-mode operation and interleaving 
techniques are important for Intel486 DX2 CPU per- 
formance. These are commonly used in existing, well- 
designed Intel486 DX CPU memory systems. Howev- 
er, some systems still use non-interleaved memory de- 
signs borrowed from Intel386 DX systems. These will 
be less than optimal for a high performance Intel486' 
DX2 CPU workstations design. 


cacheless memory design are understood even if you - 


have already specified an external cache in your design. 
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3.1 Page Mode DRAM 


Page-mode main memory controllers can be imple- 
mented in several fashions. Typical memory systems 
utilize paging for all accesses - during the beginning of a 
read, during read bursts and during write cycles; i.e. the 
RAS# line is held active after all accesses and only 
returned inactive during a page miss. Alternatively, 
paging may be used only for the burst portion of a read 
cycle; the RAS# line always returns inactive after the 
read or write cycle has been completed. This method is 
more commonly used in conjunction with a write-back 
external cache as discussed in Section 4.4. 


For read burst cycles, the 16-byte linefill of data or 
code will always lie within a DRAM page, thereby al- 
lowing the data or code to be strobed out of memory 
with a series of back-to-back CAS# pulses. Paging al- 
lows for a much faster burst cycle compared to the case 
where a full RAS#-CAS# cycle is required for each 
dword. A page mode burst read access to a single bank 
of memory is shown in Fig. 3.2. 


A paged memory system also allows for faster back-to- 
back write cycles. As was true for the Intel486 DX 
CPU, the Intel486 DX2 CPU generates writes in 
strings of two, about 60%-70% of the time, and writes 
in strings of three about 40%-50% of the time. This bus 
characteristic accounts for a large page hit rate for 
writes; therefore, it is faster to perform the back-to- 
back write cycles in a fast page mode rather than per- 
forming a full RAS #-CAS# cycle for each write. 


At this point, it is worthwhile to examine the page hit/ 
miss ratio for the three applications considered in the 
previous section. These results are shown in Table 3.1 
and assume no external cache and a page size of 8192 
bytes. 
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Table 3.1 Page Hit Ratios 


Read Page Hit 31.2% 26.4% 25.4% 
Read Page Miss | 68.8% 73.2% 74.6% 
Write Page Hit 65.5% 68.8% 68.9% 
Write Page Miss | 34.5% 31.2% 31.1% 
Note the low page hit ratio for CPU reads. This is due 
to the internal cache of the Intel486 DX2 CPU that 
filters read requests and tends to make the cache miss 


reads more randomly distributed throughout main 
memory. 


3.2 Interleaving 


Interleaving involves the use of more than one bank of 
memory; different banks are controlled separately. As 
an access is occurring, the other banks are being read- 
ied for the next access. Interleaving can be implement- 
ed in several ways. Horizontally interleaved banks gen- 
erate accesses for consecutive locations in memory. For 
example, one bank can be designated as an odd dword 
and another for the even dword. Vertically interleaved 
banks separate large contiguous regions of memory be- 
tween banks; i.e. multiple DRAM pages are open. 
Memory controllers often combine both methods. 


Horizontal interleaving can be combined with paging to 
generate very quick burst reads. Two 32-bit banks can 
generate a zero wait-state burst as detailed in the Intel 
Applications Note AP447 “A Memory Subsystem for 
the Intel486™ DX Family of Microprocessors includ- 
ing Second Level Cache.” Fig. 3.3 illustrates a burst 
read cycle from a paged-interleaved memory system. 
The signals CASO# and CAS1# drive each of the two 
32-bit banks of memory in this example. 
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Figure 3.2. A Page Mode Burst Read (Page Hit on Lead-Off Cycle) 
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Figure 3.3. Burst Read Cycle froma Paged-Interleaved Memory System 
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Some implementations of a two-bank interleaved mem- Table 3.2. Memory Systems used 
ory may be limited to a 1-2-1 burst cycle for the last for the No-Cache System Test 
three dwords. This is mainly limited by the amount of 
time it takes to invert the A3 address line between the 
second and third dwords. 


3.3 Memory Read Performance belle 
Considerations ae 

System C 

Seven memory systems with different read performance System D 

parameters are examined; write performance is kept System E 

constant during these simulations. The memory param- | System F 

eters are as follows: System G 


Systems A through D represent the performance of. 
some typical page mode memory controllers while the 
performance of systems E through G would require a 
paged-interleaved memory controller. 


The results for the Intel486 DX2 CPU with these mem- 
ory systems is shown in Figure 3.4 through Figure 3.6 
for the SPECI, Pagemaker and Turbo C traces used 
previously. The graphs show the total execution time 
relative to the ideal zero-wait state memory system. 
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Figure 3.4. Total intel486 DX2 CPU Execution Time 
versus Memory Read Performance - for SPEC1 (UNIX) 
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Figure 3.5. Total Intel486 DX2 CPU Execution Time versus 
Memory Read Performance - for Pagemaker (Windows) 
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Figure 3.6. Total intel486 DX2 CPU Execution Time versus 
Memory Read Performance - for Turbo C (DOS) 


The interesting points to note here are: 


3.4 Memory Write Performance 


e As expected, the DOS application suffers the least Considerations 
from slow memory performance. 


e Burst performance is very important. Note the im- 
provement from system A to B, system D to E and 
even from system F to G (where one clock was re- 


moved from the third burst). 


There are several methods of improving the write per- 
formance of the memory system. These methods are 
first described; the benefits of the different methods are 
discussed later. 


¢ Since the read page hit ratio is lower than 50%, The most common method for improving write per- 
improving the read page miss lead-off cycle is more formance is to employ page mode accesses to DRAM. 
important than the read page hit lead-off cycle. Note As shown in Table 3.1, the page hit ratios for write 
the improvement from system B to C versus the im- cycles favor the use of page mode accesses. An example 


provement from system B to D. 


PRELIMINARY 


of a DRAM write cycle is shown in Fig. 3.7. On page 
hits, back-to-back three clock write cycles can be main- 
tained. A page miss write would of course take an addi- 
tional number of clocks to allow for the RAS# pre- 
charge time. 
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Figure 3.7. Page Mode DRAM Allow for Fast Back-to-back Writes 


Memory write performance can be improved further by 
two related methods: write buffering and pipelining. If 
one external write buffer is added to the memory sys- 
tem shown in Fig. 3.7, the write cycles in Fig. 3.8 may 
be observed: 


In the example shown, the one level of write buffering 
allows the first ready signal to be returned one clock 
earlier. The first write cycle finishes in two clocks (zero 
wait states); however, if the CPU puts out many back- 
to-back writes (as is typical), the memory system will 
still be limited to a throughput of three clock writes 
subsequent to the first write cycle. If the CPU write is 
not followed immediately by any bus traffic, the one 
write buffer does relieve the CPU quickly, ee if 
the write was a page miss. 


More than one level of write buffering is sometimes 


employed. This would allow multiple writes to be ac- 
cepted at zero wait states before wait states of the main 


External 
Clock 


ADS# 


BRDY# 


RAS# 


CAS# 


Determine 
page hit/miss; 


memory system affect the CPU bus. To get the maxi- 
mum benefit from multiple write buffering, CPU reads 
that occur when there are more than one writes pend- 
ing in the external write buffers should be allowed to 
bypass the writes and be executed as soon as the exist- 
ing memory write is complete. This is similar or course 
to the internal write buffers of the Intel486 CPU. If the 
read cycle’s address corresponds to an address already 
in the write buffers, the read must wait until the corre- 
sponding write completes so that the read does not 

. fetch stale data from memory. In other words, care 
must be taken to ensure data consistency when using 
external write buffers. 


Write pipelining extends the use of buffering by over- 
lapping the memory controller operations during the 
write cycle. An example is shown in Fig. 3.9 below. The 
data phase of the last write cycle (CAS# pulse) is over- 
lapped in time with the address phase of the next mem- 
ory write cycle (Page hit/miss decoding, etc.). 
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Figure 3.8. Adding One Write Buffer to the Memory System 
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Figure 3.9. Pipelining the Writes to Memory 


With pipelining, it is possible to achieve a throughput 
of many two-clock back-to-back page hit writes. (An 
example of write pipelining can be found in Intel Appli- 
cations Note AP447) Note that pipelining may affect a 
subsequent read cycle; if the CPU read occurs immedi- 
ately after the write, the beginning of the read will be 
delayed until the DRAM write cycle has completed. 
This is especially true for page miss writes where the 
write may take several clocks to complete. Note also 
that pipelined memory write systems can be combined 
with write buffering; this helps for the case where many 
back-to-back page miss writes occur. 


Note that both write buffering and/or pipelining re- 
quire a data path device between the CPU and main 
memory (see Fig. 3.10). This is needed to capture the 
data from the CPU before RDY# or BRDY # is re- 
turned, after which point the data will become invalid. 
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Figure 3.10. A Data Latch is Required 
for Write Buffering or Pipelining 


To understand the performance benefits of the various 
methods described, systems B and F from the earlier 
simulations for read performance are repeated with dif- 
ferent write performance parameters (refer to Table 
3.3). Systems B and F were chosen as typical represen- 
tations of paged and paged-interleaved memory con- 
trollers respectively. 
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The memory systems above were simulated again for the Intel486 DX2 CPU with the three applications. The results 
are shown in Fig. 3.11 through Fig. 3.13 below: 
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Figure 3.11. Total Execution Time versus Write Performance- for SPEC1 (UNIX) 
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Figure 3.13. Total Execution Time versus Write Performance - for Turbo C (DOS) 
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From the results shown, the following significantly im- 


proved CPU performance: 


© Reducing the number of: clocks for page hit writes 
(system B2 to B3 and F2 to F3) 


¢ Reducing the number of clocks for page miss writes | 


(system B3 to B4 and F3 to F4) | 


The following caused marginal improvement in the exe- 
cution time: 


e Adding one level of buffering (from memory sys- 
tems B1 to B2 and F1 to F2) 


e Adding more than two write buffers (from B4 to B5 
to B6 and F4 to F5 to F6) resulting in only a 1-2% 
improvement. 


- These results may be somewhat surprising considering 


the earlier graph showing performance degradation as 


saa Fe 
| ntel ; 

the number of write wait states increases (Fig. 3.7). One 
would expect that adding write buffers would compen- 
sate for the slower memory write system more than 
they do. In order to understand the results, consider the 
statistics in Table 3.4 for processor execution stalls as a 


result of write activity as described in Section 2.5.3. The 
Statistics are shown for the SPEC] trace. - 


Note that the Stalls Because of Reads On Busy Writes 

dominates the increase in execution time for memory 

system F1 compared to the zero wait state case. Adding 

one write buffer (from system F1 to F2) - in an attempt 

to improve performance - decreases the percentage of 
stalls on a full write buffer from 5.3% to 4.5%. Howev- 

er, while the number of Stalls Because of Reads on 

Busy Writes did decrease, the wait states were simply 

transferred to stalls while waiting for the first ready of a- 
read and no net imp rovement is observed. One exam- 

ple of this situation is illustrated in Fig. 3.14. 


Table 3.4. Stall Statistics for the Write Buffers for the SPEC1 (UNIX) Trace 


On Write 
- Buffers Full 


Zero Wait State Case 
Memory System F1 


System F1 plus one write buffer 
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Percentage of Total Execution Time Stalled: 


Because of reads 
on Busy Writes 


Because a 
Read Cannot 
Overtake a 
Write 


Waiting for 
First Ready 


0.9% 
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Figure 3.14. Adding Write Buffers Does Not Improve Stalls Because of Reads on Busy Writes 
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In the example shown, without external memory write — 


buffers, the cache miss read shown stalls for four clock 


cycles while waiting for the write cycle to complete. 


With memory write buffers, the CPU need not wait to 


start the read cycle since the write completed in zero | 


wait states. However, since main rnemory is still occu- 


pied with the original write cycle, the read is still de-' 


layed externally while the write completes. The net ef- 
fect is that the read-stalls because of write traffic does 
not decrease; th e write traffic has simply been trans- 
ferred from the CPU bus to the memory bus where it 
has to contend with the next read cycle. 


Note that there will be instances where the addition of 
external write buffers to a cacheless memory system 
does benefit a sequence of bus cycles. This would be the 
case for applications with very low external read traffic 
and large amounts of write traffic. In this case, the 
write buffers do benefit the heavy write traffic while the 
reads on busy writes will be a lower percentage of total 
stalls. 


3.5 Viability of Intel486 DX2 System 
without an External Cache 


As shown in this section, the CPU performance of a 
cacheless, main-memory-only Intel486 DX2 CPU 
based system will range from good to fair depending on 
the application. The correct cost-performance point 
_will dictate the viability of such a product. For the 
Windows and UNIX applications tested, with a good 
memory design, the Intel486 DX2 CPU will get to 


WRITE 
THROUGH 
CACHE 
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either serial 
or parallel 


Figure 4.1. Different Cache Architectures Discussed 


intel. 


about 120% of the execution time of the ideal zero wait 
state case with the examples shown. The reciprocal of 
total execution time is‘;CPU performance; wh ich works 
out to 83% of maximum in this case. Of course, other 
system design factors will come into play, such as re- 
fresh requirements, other bus master memory require- 
ments, etc. 


More exotic memory architectures may improve the 
performance of the cacheless Intel486 DX2 CPU sys- — 
tem design over what has been discussed here. Howev- 
er, the next section will address the more straightfor- 
ward method of increasing CPU performance further: 
adding an external cache. 


4.0 CACHE DESIGN OPTIMIZATION 


‘An external cache will supplement the on-chip 8K 


cache of the Intel486 DX2 CPU. The requirement for 
an external cache is more important for the Intel486 
DX2 CPU than it was for the Intel486 DX CPU for all 
the reasons discussed in the previous sections. 


Many cache architectures have been implemented with 
the Intel486 DX CPU. Caches differ depending on 
their size, associativity, serial vs. parallel implementa- 
tions, write-through vs. write-back policies, etc. This 
section will focus on optimizing the performance of a 
uniprocessing system, i.e. the CPU is the major con- 
sumer of main memory bandwidth. The architectures 
discussed are shown in Figure 4.1. 
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4.1 Overall Effect of an External 
Cache on CPU Performance 


The same memory systems tested in the previous sec- 
tion are tested again with a 128K 2-way associative 
write-through parallel cache. This will yield the im- 
provement achieved by the decrease in the effective 
number of read and burst wait states. The results are 
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shown in Figures 4.2 through 4.4 for the three applica- 
tions tested earlier. 


The addition of an external write-through cache reduc- 
es the performance degradation caused by main-memo- 
ry wait states for the lead-off cycle of a read and for 
wait states during the remainder of a burst. The impact 
of these wait states was discussed in Section 2.5.3. 
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Figure 4.2. Adding an External Cache Decreases Execution Time - for SPEC1 (UNIX) 
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Figure 4.3. Adding an External Cache Decreases Execution Time - for Pagemaker (Windows) 
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Figure 4.4. Adding an External Cache Decreases Execution Time - for Turbo C (DOS) 


For the UNIX and Windows applications, the addition 
of the external cache improved the execution time by 
15%-35% depending on the memory design. The cache 
used in this case - a 128K two-way set associative cache 
- does an excellent job of buffering the CPU perform- 
ance from the memory system performance. However, 
note that even with the external cache, the execution 
time is still 12%-18% above the zero wait state case for 
these two applications. This is due to the write perform- 
ance of the memory system s ince the cache policy in 
this example is write-through. Further improvement on 
the write performance is investigated later in this sec- 
tion. 


For the DOS application, the addition of the cache 
brings the performance of the Intel486 DX2 within 5% 
_ of the ideal zero wait state case. This is of course due to 
the lower miss rate of the CPU’s internal cache and the 
application’s low bus utilization. _ 
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4.2 Effect of Cache Size and 
Associativity 


A 128K two-way-associative, write-through, parallel, 


_ external cache was used in the previous section. As the 


size and associativity of the cache are varied, the CPU 
performance varies. This is shown in Fig 4.5 for the 
memory systems B and F as used earlier (see Table 4.1). 
Both one-way (direct mapped) and two-way set associa- 
tive caches are tested with the SPEC1 application trace. 


Table 4.1. Memory Systems for the 
Write-Through Cache Test 


Read Read Write 
Page Page Page 
Hit Miss © Hit 
System B | 4-2-2-2 | 8-2-2-2 
System F | 3-1-2-1 | 6-1-2-1 
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Figure 4.5. Total Execution Time for the Intel486 DX2 CPU 
as a Function of Cache Size and Associativity - for SPEC1 


Fig 4.6 illustrates the external cache hit rates for CPU read cycles. The hit rates are directly related to the total 
execution time; higher hit rates result in shorter execution times. 
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Figure 4.6. L2 Read Hit Rate as a Function of Cache Size and Associativity (UNIX) 
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4.3 Improving the Performance ofa 
Write-Through Cache 


With a write-through cache, goocl memory write per- 
formance is necessary to achieve the best possible per- 
formance with the Intel486 DX2 microprocessor. All of 
the methods for improving the write performance for a 
cacheless system, discussed in section 3.4, also apply for 
the write-through cache-based system. 


4.3.1 MEMORY WRITE PIPELINING 


_ The previous results for a write-through cache assumed 
a non-pipelined memory system with a page-hit write 
performance of three clocks and a. page-miss perform- 
ance of six. The most effective method of increasing the 
memory write performance further is the use of memo- 
ry write pipelining. The write performance can be im- 
proved so that continuous back-to-back page-hit write 
cycles can complete in zero wait-states. Pipelining can 
also reduce the number clocks required for a page-miss 
write cycle. As the write performance improves using 
this technique, the write-through cache system can 
come close to that of the ideal zero wait state system. 
These results are shown in Section 4.3.3 to follow. 
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4.3.2. EXTERNAL WRITE BUFFERS 


Adding one or more write buffers to a external write- 
through,cache-based system improves performance by 
a larger amount compared to the cacheless case. Figure 
4.7 illustrates why. 


The addition of external write buffers allows the memo- 
ry write cycle to be “hidden” from the CPU bus if the 
next CPU cycle happens to be a external cache hit read. 
And since the external cache read hit ratio is high, most 
of the delays which were present in a cacheless system 
under these circumstances are removed. In essence, the 
on-chip cache/write-buffers have been duplicated exter- 
nally to provide a multiple level architecture (see Fig. 
4.8). 
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Figure 4.7. Adding External Write Buffers to an External Cache Reduces Execution Time 
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Figure 4.8. A Hierarchy of Caches and Write Buffers 


4.3.3 PERFORMANCE WITH AN EXTERNAL Fig. 4.9 shows the results of the memory systems tested 
WRITE-THROUGH CACHE with a 128K, two-way associative, write-through, paral- 
| | - lel cache and the Intel486 DX2 CPU using the SPEC1 

To quantify the benefits of improving the write per- application trace. 


formance, the systems in Table 4.2 were tested. 


Table 4.2. Memory Systems Used 
for Write-Through Cache Test 
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Figure 4.9. Improving the Write Performance Benefits a Write Through Cache - for SPEC1 
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As the write performance increases, the CPU perform- 
ance approaches that of the zero wait state case. The 
improvement from systems B1 to B2 and from F1 to F2 
illustrate the benefit of write buffering with an external 
cache. The improvement from systems B2 to B3 and 
from F2 to F3 reflect the benefit of memory write pipe- 
lining. Finally, the improvement from systems B3 to B4 
and from F3 to F4 show how reducing the page-miss 
write performance also increases performance. 


4.4 Write-Back Caches 


If correctly implemented, a write-back external cache 
can provide good performance for a uniprocessing In- 
tel486 DX2 CPU based system. Serial write-back 
caches have typically been used to reduce bus utiliza- 
tion for multiprocessing systems. The design complexi- 
ty of a write-back cache controller is typically an order 
of magnitude higher than for a write-through cache 
controller. However, correct implementation is abso- 
‘lutely necessary if significant performance gains are to 
be realized with the Intel486 DX2 CPU. 


A write-back cache is different from a write-through 
cache in that it allows cache write hits to modify the 
cache line without updating main memory. The cache 
has tags that include a bit called the modified (dirty) 
bit. This bit is set if the cache location has been written 
with new information and therefore contains informa- 
tion that is more recent than the corresponding infor- 
mation in main memory. If a subsequent read miss oc- 
curs and the line being fetched needs to fill the cache 
location that is currently being occupied by the modi- 
fied line, the cache controller must then write the modi- 
fied cache line back to main memory; hence coherency 
is maintained. 


If a CPU write is not a cache hit, the cache controller 
has the option of allowing the write to propagate 
through to memory or to fetch the cache line from 
memory to be merged with the new write data. The 
cache line fill in the second option is called a write-allo- 
cation. In the following discussions, it is assumed that 
no write-allocations are being performed. 


4.4.1 MAIN MEMORY CONTROLLER 
CONSIDERATIONS 


The addition of an external write-back cache changes 
the characteristics of the main memory bus traffic. 
Since the cache effectively filters all CPU requests, the 
cycles that do propagate to main memory tend to be 
more distributed in their locations. This decrease in 
temporal and spatial locality will reduce the DRAM 
page hit rate as shown in Table 4.3 for a 128K, two-way 
associative, write-back cache with the SPECI applica- 
tion trace. Compare these results to the prior results in 
Table 2.1 for a cacheless system. 
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Table 4.3. Page Hit Ratios for a 
Write-Back Cache - for SPEC1 


Reads: PageHits | 17.1% | 25.6%| 24.7% 
Page Misses | 13.8% | 13.9% | 12.9% 
55.8% | 40.8% 


4.7% | 21.6% 


Writes: Page Hits 58.9% 
Page Misses | 10.2% 
Therefore, it is less beneficial with a write-back cache to 


implement a page-mode main memory controller. 


Of course, page mode DRAM accesses within the burst 
cycle are still important to retrieve the four words of a 


_ cache line quickly. This is also true for the write-back 


cycle where four dwords of the cache line must be writ- 
ten to memory. Memory controllers should be designed 
to support a burst write cycle instead of having to write 
each dword separately. 


4.4.2 WRITE-BACK CYCLE 


The write-back cycle is the sequence where a cache line 
fill from main memory has to displace a modified line 
that was already in the cache. The method in which the 
modified line is written back to main memory has an 
impact on overall CPU performance. Before analyzing 
the write-back cycle, consider first the architectures 
shown in Fig. 4.10. 


In the simplest implementation, a write-back cache will 
share the data bus with the CPU and main memory as 
shown in configuration X. If this is the case, then dur- 
ing a write-back cycle, the modified line must be writ- 
ten back to main memory before the cache linefill can 
commence. This has a detrimental effect on perform- 
ance since the CPU must wait while the write-back oc- 
curs. This sequence is shown in Figure 4.11. 


With a data path device between the CPU-Cache bus 
and main memory as shown in configuration Y, the 
cache controller is able to defer the write-back of the 
modified data till after the linefill has completed. The 
CPU can continue execution after the linefill as long as 
subsequent cycles are all cache hits. 


In configuration Z, a wider cache bus exists between 
the SRAM and the data path devices. This allows the 
modified data to be transferred more quickly from the 
SRAM to the data path device, thereby allowing the 
cache linefill to commence even sooner. 
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Figure 4.10. Different Architectures will Effect CPU Performance with a Write-Back Cache 
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Figure 4.11. Different Implementations of the Write-Back Cycle 
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DX2 microprocessor performance with different cache 
sizes- and associativities. 
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The results are shown in Fig. 4.12 for the Intel486 DX2 
CPU running the SPEC! trace. 


Table 4.4. Memory Systems used for Write-Back Cache Test 
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Figure 4.12. Intel486™ DX2 CPU Total Execution Time with Different Cache 
Size, Associativity, Memory Speed and Write-Back Method - for SPEC1 


The addition of a write-back cache does an excellent job 
of decoupling the CPU performance from the main 
memory performance as shown with memory systems 
A and B. However, note that memory system B (with 
the delayed line fill) performs poorly - even worse than 
a good write-through cache - unless a significant 
amount of cache memory is added to reduce the miss 
rate. 


5.0 CONCLUSION 


This document has shown that good memory perform- 
ance is especially important for the Intel486 DX2 mi- 
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croprocessor. Business workstation designs will require 
excellent CPU performance and will consequently have 
to incorporate well-designed, high-performance cache 
and memory systems. 


In optimizing memory performance, an external cache 
is essential for hiding slow main memory access times. . 
Write-through external caches offer good performance 
if coupled with good memory write performance. 
Write-back external caches can also offer excellent per- 
formance if designed correctly. Parallel write-back 
caches that cannot defer the write-back cycle till after a 
cache line fill will perform worse than a good write- 
through cache design. 
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1.0 INTRODUCTION 


As the Intel Architecture evolves, with the addition of 
new generations and models of processors (8086, 8088, 
Intel 286, Intel386™, Intel486TM, and Pentium™ 
processors), it is essential that Intel provides an increas- 
ingly sophisticated means with which software can 
_identify the features available on each processor. This 
identification mechanism has evolved in conjunction 
with the Intel Architecture as follows: 


e Originally, Intel published cocle sequences that 
could detect minor implementation differences to 
identify processor generations. 


@ Later, with the advent of the Intel386 processor, 
Intel implemented processor signature identifica- 
tion, which provided the processor family, model, 
and stepping numbers to software at reset. 


e As the Intel Architecture evolved, Intel extended 
the processor signature identification into the 
CPUID instruction. The CPUID instruction not 
only provides the processor signature, but also pro- 
vides information about the features supported by 
and implemented on the Intel processor. 


- The evolution of processor identification was necessary 


because, as the Intel Architecture proliferates, the com- 
puting market must be able to tune processor function- 
ality across processor generations anc models that have 
differing sets of features. Anticipating that this trend 
will continue with future processor generations, the In- 


tel Architecture implementation of the CPUID instruc- 


tion is extensible. 


This Application Note explains how to use the CPUID 
instruction in software applications, BIOS implementa- 
tions, and tools. By taking advantage of the CPUID 
instruction, software developers can create software ap- 
plications and tools that can execute compatibly across 
the widest range of Intel processor generations and 
models, past, present, and future. 
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New Intel processor signature and feature bits informa- 
tion can be obtained from the user’s manual, program- 
mer’s reference manual or appropriate documentation 
for a processor. In addition, Intel can provide you with 
updated versions of the programming examples includ- 
ed in this application note; contact your Intel represent- 


1.1 Update Support 


tive for more information. 


2.0 DETECTING THE CPUID 
INSTRUCTION 


Intel has provided a straightforward method for detect- 
ing whether the CPUID instruction is available. This 
method uses the ID flag in bit 21 of the EFLAGS regis- 
ter. If software can change the value of this flag, the 
CPUID instruction is available. The program examples 
at the end of this Application Note show how to use the 
PUSHFD instruction to change the value of the ID 
flag. . 


3.0 OUTPUTS OF THE CPUID 
INSTRUCTION | 


Figure 1 summarizes the outputs of the CPUID in- 
struction. 


The CPUID instruction can be executed multiple 
times, each time with a different parameter value in the 
EAX register. The output depends on the value in the 
EAX register, as specified in Table 1. To determine the 
highest acceptable value in the EAX register, the pro- 
gram should set the EAX register parameter value to 0. 
In this case, the CPUID instruction returns the highest 
value that can be recognized in the EAX register. 
CPUID instruction execution should always use a pa- 
rameter value that is less than or equal to this highest 
returned value. Currently, the highest value recognized 
by the CPUID instruction is 1. Future processors 
might recognize higher values. 


intel. 


The processor type, specified in bits 12 and 13, indicate 
whether the processor is an original OEM processor, an 
OverDrive™ processor, or is a dual processor (capable 
of being used in a dual processor system). Table 2 
shows the processor type values that can be returned in 
bits 12 and 13 of the EAX register. 


3.1 Vendor-ID String 


If the EAX register contains a value of 0, the vendor 
identification string is returned in the EBX, EDX, and 
ECX registers. These registers contain the ASCII string 
GenuinelIntel. 


OUTPUT IF EAX =0 


34 


0 
INTEGER 


HIGH VALUE 


VENDOR ID 


OUTPUT IF EAX =1 


PROCESSOR 
SIGNATURE 
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While any imitator of the Intel Architecture can pro- 
vide the CPUID instruction, no imitator can legitimate- 
ly claim that its part is a genuine Intel part. Therefore, 
the presence of the GenuineIntel string is an assur- 
ance that the CPUID instruction and the processor sig- 
nature are implemented as described in this document. 


PROCESSOR TYPE 


31 
FEATURE FLAGS 


FAMILY 
MODEL 
STEPPING 


0 
EDX* | BIT ARRAY (Refer to Table 5) 


*EBX and ECX are Intel reserved. Do not use. 
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Figure 1. CPUID Instruction Outputs 
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Table 1. Effects of EAX Contents on CPUID Instruction Output 


AP-485 
Parameter 
EAX = 0 
EAX = 1 


1 < EAX < highest value 
EAX > highest value 


Outputs of CPUID 


EAX <— Highest value recognized 

EBX:EDX:ECX <— Vendor identification string = 
EAX <— Processor signature 

EDX <— Feature flags 

EBX:ECX <— Intel reserved (Do not use.) 

Currently undefined 

EAX:EBX:ECX:EDX <— Undefined (Do not use.) 


Table 2. Processor Type 


Bit Position ; Value 
13,12 00 


ee 
a 


NOTE: 
_ 1. Not applicable to Intel 386 and Intel486 processors. 


3.2 Processor Signature 


Beginning with the Intel386 processor family, the 
processor signature has been available at reset. With 
processors that implement the CPUID instruction, the 
processor signature is available both upon reset and 
upon execution of the CPUID instruction. Figure 1 
shows the format of the signature for the Intel486 and 
Pentium processor families. Table 3 shows the values 
that are currently defined. (The high-order 18 bits are 
undefined and reserved.) 
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Description 
; Original OEM Processor | 
OverDrive™ Processor 


Dual Processor(1) 


Intel reserved (Do not use.) 


Q 
[ae 
oO 
= 
< 
@ 
=} 
2 
3° 
=] 
i?) 
ie) 
= 
p— 
3 
co 
2, 
> 
oo 
ON 
MN 
m 
‘ave! 
0 
et 
o 
— 
_> 
oo 
nN 
) 
va 
S&S 
jo] 
Qu 


> 


IntelDX2 processors do not support the CPUID in- 


struction. Therefore, the processor signature is only 


available upon reset for these processors. Refer to the 
programming examples at the end of this Application 
Note to determine which processors support the 
CPUID instruction. 


On Intel386 processors, the format of the processor sig- 
nature is somewhat different, as Figure 2 shows. Table 
4 gives the current values. 


‘ 
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Table 3. Intel486™ and Pentium™ Processor Signatures | | 


1 a a 
[c100 | eeoana0oGr| ox [ranean Pecan 
ee ee 
el ot on a 
Caieo-| cot | — oor exe and inl Dk2 Over Rocomrs 
Tasco | oer | wom | mate recenon 
raico | ein | — ow | Wit Back anced iE Reames 
Fat00 | 1000 | — ea [ san Ks Overeve Prnesery 
|o101 | 0001 ~—s | ~— x0 ‘| Pentium™ Processors (510\60, 567\66) 

}oto1 | 0010S | xm Pentium Processors (735\90, 815\100) 

che ao 


0101 0101 Reserved for Pentium OverDrive Processor for IntelDX4 
Processor | 

0101 0010 Reserved for Pentium OverDrive Processor for Pentium Processor 
(510\60, 567\66) 

0101 0100 Reserved for Pentium OverDrive Processor for Pentium Processor 
(735\90, 815\100) 


NOTES: 
1. Intel releases information about stepping numbers as needed. 
2. This processor does not implement the CPUID instruction. 


RESET ——————> EDX 


MAJOR STEPPING 
MINOR STEPPING 
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Figure 2. Processor Signature Format on Intel386™ Processors 


Table 4. Intel386™ Processor Signatures 


Famil malor uee | Description 
y Stepping ~ Stepping() P 


a ae Intel386™ DX Processor 
om Intel386 SX Processor 
0010 0011 0000 | xox Intel386 CX Processor 


intel386 EX Processor 
Intel3886 SL Processor 
RAPIDCAD™ Processor 


NOTE: | 
1. Intel releases information about minor stepping numbers as needed. 
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3.3 Feature Flags 


When a value of 1 is placed in the EAX register, the 
CPUID instruction loads the EDX register with the 
feature flags. The feature flags indicate which features 
- the processor supports. A value of 1 in a feature flag 


can indicate that a feature is either supported or not | 


supported, depending on the implementation of the 
CPUID instruction for a specific processor. Table 5 
lists the currently defined feature flag values. For 


z ; 
intel : 
future processors, refer to the programmer’s reference 


manual, user’s manual, or the appropriate documenta- 
tion for the latest feature flag values. 


Developers should use the feature flags in applications 
to determine which processor features are supported. © 
By using the CPUID feature flags to predetermine 
processor features, software can detect and avoid in- 
compatibilities that could result if the features are not 
present. 


Table 5. Feature Flag Values 


| Name Description When Fiag = 1 


= ery Floating-Point Unit On-Chip 
Virtual Mode Extension 


af 
Page Size Extension 
rao [| 


Machine Check 


APIC | On-Chip APIC 
40-311) ae 


NOTE: 


The processor contains an FPU that supports the 
Intel387 floating-point instruction set. 


Exception 18 is defined for Pentium processor style 
machine checks, including CR4.MCE for controlling the 
feature. This feature does not define the model-specific 
implementation of the machine-check error logging _ 


reporting and processor shutdowns. Machine-check 
exception handlers may have to depend on processor | 
version to do model-specific processing of the 
exception or test for the presence of the standard 
machine-check feature. 


The 8-byte (64-bit) compare and exchange instructions 
is supported (implicitly locked and atomic). 


Indicates that an integrated APIC is present and 
hardware enabled. (Software disabling does not affect 
this bit.) 


1. Some non-essential information regarding Intel486 and Pentium processors is considered Intel confidential and proprie- 
‘tary and is not documented in this publication. This information is provided in the Supplement to the Pentium™ Proces- 
sor User’s Manual and is available with the appropriate non-disclosure agreements in place. Contact Intel Corporation for 


details. 
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4.0 USAGE GUIDELINES 


This document presents Intel-recommended feature- 
detection methods. Software should not try to identify 
features by exploiting programming tricks, undocu- 
mented features, or otherwise deviating from the guide- 
lines presented in this Application Note. The following 
is a list of guidelines that can help programmers main- 
tain the widest range of compatibility for their software. 


® Do not depend on the absence of an invalid opcode 
trap on the CPUID opcode to detect CPUID. Do 
not depend on the absence of an invalid opcode trap 
on the PUSHFD opcode to detect a 32-bit proces- 
sor. Test the ID flag, as described in Section 2.0 and 
shown in Section 6.0. 


® Do not assume that a given family or model has any 


specific feature. For example, do not assume that, 


because the family value is 5 (Pentium processor), | 


there must be a floating-point unit on-chip. Use the 
feature flags for this determination. 


¢ Do not assume that the features in the OverDrive 
processors are the same as those in the OEM version 
of the processor. Internal caches and instruction ex- 
ecution might vary. 


@ Do not use undocumented features of a processor to 
identify steppings or features. For example, the In- 


tel386 processor A-step had bit instructions that — 


were withdrawn with B-step. Some software at- 
tempted to execute these instructions and depended 
on the invalid-opcode exception as a signal that it 
was not running on the A-step part. This software 
failed to word correctly when the Intel486 processor 
used the same opcodes for different instructions. 
That software should have used the stepping infor- 
mation in the processor signature. 


¢ Do not assume that a value of 1 in a feature flag ind 
icates that a given feature is present, even though 
that is the case in the first models of the Pentium 
processor in which the CPUID instruction is imple- 
mented. For some feature flags that might be de- 
fined in the future, a value of 1 can indicate that the 
corresponding feature is not present. 


¢ Programmers should test feature flags individually 
and not make assumptions about undefined bits. It 
would be a mistake, for example, to test the FPU bit 
by comparing the feature register to a binary 1 with 
a compare instruction. 
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@ Do not assume that the clock of a given family or 
model runs at a specific frequency and do not write 
clock-dependent code, such as timing loops. For in- 
stance, an OverDrive Processor could operate at a 
higher internal frequency and still report the same 
family and/or model. Instead, use the system’s tim- | 
ers to measure elapsed time. 


¢ Processor model-specific registers may differ among 
processors, including in various models of the Penti- 
um processor. Do not use these registers unless iden- 
tified for the installed processor. 


5.0 BIOS RECOGNITION FOR INTEL 
OVERDRIVET™ PROCESSORS ~ 


A system’s BIOS will typically identify the processor in 
the system and initialize the hardware accordingly. In 
many cases, the BIOS identifies the processor by read- 
ing the processor signature, comparing it to known sig- 
natures, and, upon finding a match, executing the cor- 
responding hardware initialization code. 


The Pentium OverDrive processor is designed to be an 
upgrade to any Intel486 family processor. Because 
there are significant operational differences between. 
these two processor families, processor misidentifica- 
tion can cause system failures or diminished perform- 
ance. Major differences between the Intel486 processor 
and the Pentium OverDrive processor include the type 
of on-chip cache supported (write-back or write- 
through), cache organization and cache size. The Over- 
Drive processor also has an enhanced floating point 
unit and System Management Mode (SMM) that may 
not exist in the OEM processor. Inability to recognize 
these features causes problems like those described be- 
low. | 


In many BIOS implementations, the BIOS reads the 
processor signature at reset and compares it to known 
values. If the OverDrive processor’s signature is not 
among the known values, a match will not occur and 
the OverDrive processor will not be identified. Often 
the BIOS will drop out of the search and initialize the 
hardware based on a default case such as initializing the 
chipset for an Intel486 SX processor. Following are two 


- common examples of system failures and how to avoid 


them. 
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Example 1 


If (for the Pentium OverDrive processor) the system’s 


hardware is configured to enable the write-back cache. 


but the BIOS fails to detect the Pentium OverDrive 
processor signature, the BIOS may incorrectly cause 


the chipset to support a write-through processor cache. . 


This results in a data incoherency problem with the bus 
masters. When a bus master accesses a memory loca- 
_ tion (which was also in the processor’s cache in a modi- 
fied state), the processor will alert the chipset to allow 
it to update this data in memory. But the chipset is not 
programmed for such an event and the bus master in- 
stead receives stale data. This usually results in a sys- 
tem failure. | : 


; Example 2 


If the BIOS does not recognize the OverDrive proces- 
sor’s signature and defaults to an Intel486 SX proces- 
sor, the BIOS can incorrectly program the chipset to 
ignore, or improperly route, the assertion of the floating 
- point error signaled by the processor. The result is that 
floating point errors will be improperly handled by the 
Pentium OverDrive processor. The BIOS may also 
completely disable math exception handling in the 
OverDrive processor. This can cause installation errors 
in applications that require hardware support for float- 
ing point instructions. 
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Hence, when programming or modifying a BIOS, be 
aware of the impact of future OverDrive processors. 
Intel recommends that you include processor signa- 
tures for the OverDrive processors in BIOS identifica- 
tion routines to eliminate diminished performance or 
system failures. The recommendations in this applica- 
tion note can help a BIOS maintain compatibility 
across a wide range of processor generations and mod- 
els. 


6.0 PROPER IDENTIFICATION 
SEQUENCE 


The cpuid3Sa.asm program example demonstrates 
the correct use of the CPUID instruction. (See Exam- 
ple 1.) It also shows how to identify earlier processor 
generations that do not implement the processor signa- 
ture or CPUID instruction. This program example con- 
tains the following two procedures: 


® get_cpu_type identifies the processor type. Fig- 
ure 3 illustrates the flow of this procedure. 


e get_fpu type determines the type of floating- 
point unit (FPU) or math coprocessor (MCP). 


This procedure has been tested with 8086, 80286, 
Intel386, Intel486, and Pentium processors. This pro- 
gram example is written in assembly language and is 
suitable for inclusion in a run-time library, or as system 
calls in operating systems. 
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Yes 
cpu_type=0 


is the 

CPUID 

instruction 

supported 
9 


cpuid_fiag = 1; indicates 
CPUID instruction present. 

Execute CPUID with input of 0 
to get vendor ID string and 

input values for EAX. 


If highest input value is at least 1, 
execute CPUID with input of 1 in 
EAX to obtain model, stepping, 
family, and features. 

Save in cpu_type, stepping, 
model, and feature_flags. 


Does the 

vendor ID = 

“Genuinelntel” 
? 


end_get_cpu_type 
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Figure 3. Flow of Processor get_.cpu__type Procedure 
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7.0 USAGE PROGRAM EXAMPLE 


The cpuid3b.asm and cpuid3b.c program exam- 
ples demonstrate applications that call get cpu_ 
type and get. fpu_type procedures and interpret 
the returned information. The results, which are dis- 
played on the monitor, identify the installed processor 
and features. The cpuid3b.asm example is written 


get_cpu_type* 


get_fpu_type 


o 
¢ 
e 
é 
e 
e 
’ 
¢ 
¢ 
e 
e 
s 
¢ 
‘ 
¢ 
s 
e 
e 
’ 
’ 
s 
‘ 
’ 
‘ 
? 
a 
e 
‘ 
’ 
ra 
6 
, 
¢ 
e 
e 
6 
e 
¢ 
e 
’ 
é 
¢ 
‘ 
¢ 
é 
¢ 
é 
¢ 
é 
ul 
o 
, 
‘ 
é 
¢ 
4 
e 
e 
é 
¢ 
é 
’ 
‘ 


*See Figure 3. 


Part of ‘ 
cpuid3a.asm ; 


intel. 


‘in assembly language and demonstrates an application 


that displays the returned information in the DOS envi- 
ronment. The cpuid3b.c example is written in the C 
language. (See Examples 2 and 3.) 


Figure 4 presents an overview of the relationship be- 
tween the three program examples. 


Part of 
cpuid3b.c and 
cpuid3b.asm 
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Figure 4. Flow of Processor identification Extraction Procedures 


2-452 


| ntl . : AP-485 


Example 1. Processor Identification Extraction Procedure 


; Filename: cpuid3a.asm 
j Copyright 1993, 1994 by Intel Corp. 


; This program has been developed by Intel Corporation. You 
have Intel's permission to incorporate this source code into 

: your product, royalty free. Intel has intellectual property 
rights which it may assert if another manufacturer's processor 
mis-identifies itself as being "GenuineIntel" when the CPUID 

; instruction is executed. - 


Intel specifically disclaims all warranties, express or 

: implied, and all liability, including consequential and other 
: indirect damages, for the use of this code, including 
liability for infringement of any proprietary rights, and 
including the warranties of merchantability and fitness for a 
; particular purpose. Intel does not assume any responsibility 
for any errors which may appear in this code nor any 

; responsibility to update it. 


This code contains two procedures: 

; _get_cpu_type: Identifies processor type in _cpu_type: 
0=8086/8088 processor 

2=Intel 286 processor 

3=Intel386(TM) family processor 

: 4=Intel486(T™) family processor 

; 5=Pentium(T) family processor 


; _get_fpu_type: Identifies FPU type in _fpu_type: 
; O=FPU not present 

: 1=FPU present 

2=287 present (only if _cpu_type=s3) 
3=387 present (only if _cpu_type=3) 


; This program has been tested with the MASM assembler. 

; This code correctly detects the current Intel 8086/8088, 
; 80286, 80386, 80486, and Pentium(tm) processors in the © 
; real-address mode. 


; To assemble this code with TASM, add the JUMPS directive. 
; jumps ; Uncomment this line for TASM 


TITLE cpuid3a.asm 
DOSSEG 
.model small 


CPU_ID MACRO 
241618-5 
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ara intel. 


db Ofh ; Hardcoded CPUID instruction 
db Oa2h 7 | 
ENDM 
.data 
public -_cpu_type 
public _fpu_type 
public _cpuid_flag 
public _intel_CPU 
public _vendor_id 
public _cpu_signature 
public _features_ecx 
public _features_edx 
public _features_ebx 
_cpu_type db 0 
_fpu_type db 0 
. _cpuid_flag db 0 
_intel_CPU db 0 
_vendor_id db BE mm ae ee " 
intel _id db "GenuineiIntel" 
_cpu_Signature dd 0 
_features_ecx dd 0 
_features_edx ad 0 
_features_ebx dd 0. 
fp_status dw 0 | 
.code 
.8086 


. Kk K RK KKK KKK KK KK KK KKK KKK KKK KKK KKK K KKK KKK KKK KKK KKK KKK KKK KK KES 
Uy - re 2 


public _get_cpu_typ 
_get_cpu_type proc ° 


This procedure determines the type of processor in a system 
and sets the _cpu_type variable with the appropriate 

3 value. If the CPUID instruction is available, it is used 

to determine more specific details about the processor. 

All registers are used by this procedure, none are preserved. 
To avoid AC faults, the AM bit in CRO must not be set. 


Intel 8086 processor check 
Bits 12-15 of the FLAGS register are always set on the 
8086 processor. 


check_8086: 
pushf ; push original FLAGS 
pop ax ; get original FLAGS 
mov cx, ax ; save original FLAGS 


and ax, Offfh ; clear bits 12-15 in FLAGS 
241618-6 
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push ax ; save new FLAGS value on stack 
popf ; replace current FLAGS value 
pushf ; get new FLAGS 

pop ax ; store new FLAGS in AX 

and ax, Of000h ; if bits 12-15 are set, then 
cmp ax, Of000h ; . processor is an 8086/8088 
mov _cpu_type, 0 ; turn on 8086/8088 flag 

je end_cpu_type ; Jump if processor is 8086/8088 


; Intel 286 processor check 
; Bits 12-15 of the FLAGS register are always 
; Intel 286 processor in real-address mode. 


clear on the 


.286 
check_80286: 
or ex, O0£f000h ; try to set bits 12-15 
push cx ; save new FLAGS value on stack 
popf ; replace current FLAGS value 
pushf ; get new FLAGS 
pop ax ; store new FLAGS in AX 
and ax, Of000h ; if bits 12-15 are clear 
mov _cpu_type, 2 ; processor=80286, turn on 80286 flag 
jz end_cpu_type ; 1f£ no bits set, processor is 80286 


Intel386 processor check 

; The AC bit, bit #18, is a new bit introduced in the EFLAGS 
: register on the Intel486 processor to generate alignment 

ci faults. 

; This bit cannot be set on the Intel386 processor. 
.386 ; it is safe to use 386 instructions 


check_80386: 


pushfd j; push original EFLAGS 

pop eax 3 get original EFLAGS 

mov ecx, eax ; save original EFLAGS 

xor eax, 40000h ; flip AC bit in EFLAGS 

push eax ; save new EFLAGS value on stack 
popfd ; replace current EFLAGS value 
pushfd ; get new EFLAGS 

pop eax ; store new EFLAGS in EAX 

xor eax, ecx ; can't toggle AC bit, processor=80386 
mov _cpu_type, 3 ; turn on 80386 processor flag 
jz end_cpu_type ; jump if 80386 processor 

push ecx 

popfd ; restore AC bit in EFLAGS first 


Intel486 processor check 


: Checking for ability to set/clear ID flag (Bit 21) in EFLAGS 
; which indicates the presence of a processor with the CPUID 
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: instruction. 
, .486 
‘check_80486: | | 
| mov _cpu_type, 4 ; turn on 80486 processor flag 

mov eax, ecx | ; get original EFLAGS 
xor eax, 200000h ; flip ID bit in EFLAGS 
push eax _ j save new EFLAGS value on stack 
popfd ; replace current EFLAGS value 
pushfd ; get new EFLAGS 
pop eax ; store new EFLAGS in EAX 
xor eax, eCx ; can't toggle ID bit, 


je end_cpu_type ; processor=80486 


: Execute CPUID instruction to determine vendor, family, 
; model, stepping and features. For the purpose of this 


; code, only the initial set of CPUID information is saved. 
mov. _cpuid_flag, 1 ; flag indicating use of CPUID inst. 
push ebx | ; save registers 
push esi 
push edi 
mov eax, 0 ; set up for CPUID instruction 
CPU_ID ; get and save vendor ID 
mov dword ptr _vendor_id, ebx 
mov dword ptr _vendor_id[+4], edx 
mov dword ptr _vendor_id[+8], ecx 
Mov si, ds 
mov es, si 
mov si, offset _vendor_id 
mov di, offset intel_id 
mov ex, 12 ; should be length intel_id 
_cld 3 set direction flag | 
repe cmpsb ; compare vendor ID to "GenuineIntel" 
jne end_cpuid_type ; if not equal, not an Intel processor 
mov _intel_CPU, 1 ; indicate an Intel processor 
cmp . eax, 1 ; make sure 1 is valid input for CPUID 
gp. end_cpuid_type ; if not, jump to end 
mov eax, 1 pa 
CPU_ID ; get family/model/stepping/features 
mov _cpu_signature, eax 4 | 
mov _features_ebx, ebx 
Mov _features_edx, edx 
mov _features_ecx, ecx 
shr eax, 8 ; lsolate family 
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and eax, Ofh 
mov _cpu_type, al ; set _cpu_type with family 


end_cpuid_type: 


pop edi ; restore registers 
pop esi 
pop — ebx 
.8086 
end_cpu_type: 
ret 


_get_cpu_type endp 


a KRKKKKEKEKKKKKEKKEKEKKEKKEKKKKRKKRKRKRKKRKEKRKEKKRKAKK KK KKK KKK KKK KK KKK KK KKKKRKRKR KEK KK KEK 
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public _get_fpu_type 
_get_fpu_type proc 


; This procedure determines the type of FPU in a system 
and sets the _fpu_type variable with the appropriate value. 
: All registers are used by this procedure, none are preserved. 


; Coprocessor check 

The algorithm is to determine whether the floating-point 

; status and control words are present. If not, no 

; coprocessor exists. If the status and control words can 

be saved, the correct coprocessor is then determined 

; depending on the processor type. The Intel386 processor can 
work with either an Intel287 NDP or an Intel387 NDP. 

The infinity of the coprocessor must be checked to determine 
; the correct coprocessor type. 


fninit ; reset FP status word 

mov fp_status, 5ad5ah; initialize temp word to non-zero 
fnstsw fp_status ; save FP status word 

mov ax, fp_status ; check FP status word 

cmp al, 0 ; was correct status written 

mov _fpu_type, 0 ; no FPU present 

jne end_fpu_type 


check_control_word: 


fnstcw fp_status ; save FP control word 

mov ax, fp_status ; check FP control word 

and ax, 103fh ; selected parts to examine 

cmp ax, 3fh 7 ; was control word correct 

mov _fpu_type, 0 

jne end_fpu_type ; incorrect control word, no FPU 
Mov _fpu_type, 1 


; 80287/80387 check for the Intel386 processor 
- 241618-9 
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check_infinity: 


cmp 
jne 
fidl 
fldz 
fdiv 
fld 
fchs 
fcompp 
fstsw 
mov 
mov 
sahf 
jz 
mov 


end_fpu_type: 


ret 


_get_fpu_type 
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end 


_cpu_type, 3 
end_fpu_type 


fp_status 
ax, fp _status 
_fpu_type, 2 


end_fpu_type 
_fpu_type, 3 


endp 


> must use default control from FNINIT 


form infinity 

8087/Intel287 NDP say +inf = -inf 
form negative infinity 

Intel387 NDP says +tinf <> -inf 
see if they are the same 

look at status from FCOMPP 


store Intel287 NDP for FPU type 
see if infinities matched 

jump if 8087 or Intel287 is present 
store Intel387 NDP for FPU type 


241618-10 


AP-485 


Example 2. Processor Identification Procedure in Assembly Language 


Filename: cpuid3b.asm 
Copyright 1993, 1994 by Intel Corp. 


This program has been developed by Intel Corporation. You 
have Intel's permission to incorporate this source code into 
your product, royalty free. Intel has intellectual property 
rights which it may assert if another manufacturer's processor 
mis-identifies itself as being "GenuineIntel" when the CPUID 
instruction is executed. 


Intel specifically disclaims all warranties, express or 
implied, and all liability, including consequential and other 
indirect damages, for the use of this code, including | 
liability for infringement of any proprietary rights, and 
including the warranties of merchantability and fitness fora 
particular purpose. Intel does not assume any responsibility 
for any errors which may appear in this code nor any 
responsibility to update it. 


This program contains three parts: 
Part 1: Identifies processor type in the variable _cpu_type: 


Part 2: Identifies FPU type in the variable _fpu_type: 


Part 3: Prints out the appropriate message. This part is 
specific to the DOS environment and uses the DOS 
system calls to print out the messages. 


This program has been tested with the MASM assembler. 

If this code is assembled with no options specified and linked 
with the cpuid3a.asm module, it correctly identifies the | 
current Intel 8086/8088, 80286, 80386, 80486, and Pentium(tm) 
processors in the real-address mode. 


To assemble this code with TASM, add the JUMPS directive. 
jumps ; Uncomment this line for TASM 


TTE cpuid3b.asm 
DOSSEG 

.model small 
.stack 100h 


.data 

extrn _cpu_type: byte 
extrn _fpu_type: byte 
extrn _cpuid_flag: byte 
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extrn _intel_CPU: byte 
extrn _vendor_id: byte 
extrn _cpu_signature: dword 
extrn _features_ecx: dword 
extrn _features_edx: dword 
extrn _features_ebx: dword 


; The purpose of this code is to identify the processor and 
coprocessor that is currently in the system. The program 

; first determines the processor type. Then it determines 
whether a coprocessor exists in the system. Ifa 
coprocessor or integrated coprocessor exists, the program > 

; identifies the coprocessor type. The program then prints 
the processor and floating point processors present and type. 


.code 
.8086 . 
start: mov ax, @data 

. mov ds, ax ; set segment register 
mov es, ax ; set segment register 
and sp, not 3 ; align stack to avoid AC fault 
call _get_cpu_type ; determine processor type 
call _get_fpu_type 
call print 
Mov ax, 4c00h ; terminate program 
int 21h 


° KAEKKKKKKKREKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KR KEK 
’ 


extrn _get_cpu_type: proc 


—KaKKKKKKKKKKKKRKKRKKKKKKRKKK KK KKKRKKRKR KKK KKKKRKRKK KKK KKK KKKKKKRKRKKKKKKRKRKRKEK 
, : 


extrn _get_fpu_type: proc 


KEKE KKEKEKKEKEKKEKEEKKKEKKKEKKKRKRKRKEKKKEKKRKKKKKKKRKKRKRKKKK KK KK KKK KKK KKK KRKKRKKRKEK 
4 ; 


FPU_FLAG equ 0001h 

VME_FLAG | equ 0002h 

PSE_FLAG equ 0008h 

MCE_FLAG equ 0080h 

CMPXCHG8B_FLAG equ 0100h 

APIC_FLAG equ 0200h 
.data 


id_msg db "This system has as" 
cp_error db "n unknown processors" 
cp_ 8086 db "n 8086/8088 processors" 
cp_286 db "n 80286 processors" 


241618-12 


2-460 | | : 


AP-485 


intel. 


cp_386 "n 80386 processors" 

cp_486 "n 80486DX, 80486DxX2 processor or" 
" 80487SX math coprocessor$s" 

cp_486sx "n 80486SX processors" 

fp_8087 "and an 8087 math coprocessor$" 

fp_287 "and an 80287 math coprocessor$" 

fp_387 " and an 80387 math coprocessors" 


intel486_msg 
intel486dx_msg 
intel486sx_msg 
inteldx2_msg 
intelsx2_msg 
inteldx4_msg 
inteldx2wb_msg 


" Genuine Intel486(TM) processors" 
Genuine Intel486(TM) DX processors" 

" Genuine Intel486(TM) SX processors" 

" Genuine IntelDX2(TM) processors" 

" Genuine IntelSX2(TM) processor$" 

" Genuine IntelDX4(TM) processors" 

" Genuine Write-Back Enhanced" 

" IntelDX2(T) processors" 

" Genuine Intel Pentium(TM) processors" 
"n unknown Genuine Intel processors" 


pentium_msg 
unknown_msg 


GEGCEREEEEE FEE FEE & 


; The following 16 entries must stay intact as an array 


intel_486_0 dw offset intel486dx_msg 

intel _486_1 dw offset intel486dx_msg 

intel 486 _2 dw offset intel486sx_msg 

intel_486_3 dw offset inteldx2_msg 

intel 486 4 dw offset intel486_msg 

intel _486_5 dw offset intelsx2_msg 

intel _486_6 dw offset intel486_msg 

intel_486_7 dw offset inteldx2wb_msg 

intel _486_8 dw offset inteldx4_msg 

intel _486_9 dw offset intel486_msg 

intel _486_a dw offset intel486_msg 

intel_486_b dw offset intel486_msg 

intel_486_c dw offset intel486_msg 

intel 486d aw offset intel486_msg 

intel _486_e dw offset intel486_msg 

intel_486_f dw offset intel486_msg 

; end of array | 

family_msg ab 13,10,"Processor Family: $" 

model_msg db 13,10,"Model: SM 

stepping_msg db 13,10,"Stepping: ! 

cr_lf db 23: 105."S" 

turbo_msg db 13,10,"The processor is an OverDrive(TM) " 
db " processors" 

dp_msg db 13,10,"The processor is the upgrade processor" 
db "in a dual processor system$" 

fpu_msg db 13,10,"The processor contains an on-chip FPUS$" 
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mce_msg db 13,10,"The processor supports Machine Check" 
db " Exceptionss" | . os 
cmp_msg db 13,10,"The processor supports the CMPXCHG8B" 
db " instructions" 
vme_msg ab 13,10,"The processor supports Virtual Mode" 
| db " Extensionss" 
pse_msg db 13,10,"The processor supports Page Size" 
db " ExtensionssS" | 
apic_msg db 13,10,"The processor contains an on-chip" 
db " APICS" 
not_intel db "t least an 80486 processor." 
db 13,10,"It does not contain a Genuine Intel" 
db " part and as a result, the",13,10,"CPUID" 
db " detection information cannot be determined" 
db "at this time.$" 
ASC_MSG MACRO msg , 
LOCAL ascii_done ; -local label 
add al, 30h 
cmp al, 39h ; is it 0-9? 
jle ascii_done 
add al, O7h 
ascii_done: 
mov byte ptr msg[20], al 
mov dx, offset msg 
mov ah, 9h 
int 21h 
ENDM 
.code 
.8086 


print proc 


; This procedure prints the appropriate cpuid string and 

; numeric processor presence status. If the CPUID instruction 
: was used, this procedure prints out the CPUID info. , 

; All registers are used by this procedure, none are preserved. 


mov ax, offset id_msg 3} print initial message 

Mov ah, 9h 

int 2ih 

emp _cpuid_flag, 1 ; if set to 1, processor 

; supports CPUID instruction 

je an print_cpuid_data ; print detailed CPUID info 
print_86: 

emp _cpu_type, 0 

jne print_286 
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mov dx, offset cp_8086 

mov ah, 9h or 

int 21h 

cmp _fpu_type, 0 

je end_print 

mov dx, offset fp_8087 

mov ah, 9h 

int 21h 

jmp end_print 
print_286: 

cmp _cpu_type, 2 

jne print_386 

mov dx, offset cp_286 

mov ah, 9h 

int 21h 

cmp _fpu_type, 0 

je end_print 
print_287: 

mov dx, offset fp_287 

mov ah, 9h 

int 2ih 

jmp end_print 
print_386: 

cmp _cpu_type, 3 

jne print_486 

mov dx, offset cp_386 

mov ah, 9h 

int 21h 

cmp _fpu_type, 0 

je end_print 

cmp _fpu_type, 2 

je print_287 

mov dx, offset fp_387 

Mov ah, 9h 

int 21h 

jmp end_print 
print_486: 

cmp _cpu_type, 4 

jne print_unknown ; Intel processors will have 

mov dx, offset cp_486sx + CPUID instruction 

cmp _fpu_type, 0 

je print_486sx 

mov dx, offset cp_486 
print_486sx: 

mov ah, 9h 

int 2ih 

jmp end_print 
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print_unknown: 
Mov dx, offset cp_error 
jmp print_486sx 


print_cpuid_data: 


jmp print_common 
print_unknown_type: 7 


print_common: 
mov ah, 2h 
int 2ih 
; print family, model, and stepping 
print_family: 
mov al, _cpu_type 
ASC_MSG family_msg 


print_model: 


ASC_MSG modeil_msg 
print_stepping: 


and al, Ofh 
ASC_MSG stepping_msg 


print_upgrade: 


test ax, 1000h 
jz check_dp 
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.486 
cmp _intel_cPU, 1 ; check for genuine Intel 
jne not_GenuineIntel ; processor 
print_486_type: | 
cmp _cpu_type,. 4 ; if 4, print 80486 processor 
jne print_pentium_type 
mov ax, word ptr _cpu_signature 
shr ax, 4 
and eax, Ofh ; isolate model 
mov dx, intel_486_0[eax*2] 
jmp print_common 
print_pentium_type: | 
cmp _cpu_.type, 5 if 5, print Pentium processor 
jne print_unknown_type 
Mov dx, offset pentium_msg 


; if neither, print unknown 


j; print family msg 


mov ax, word ptr _cpu_signature 
shr ax, 4 
and — al, Ofh 


; print model msg 


mov ax, word ptr _cpu_signature 


; print stepping msg 


mov ax, word ptr _cpu_signature | 


; check for turbo upgrade 
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mov 
mov 
int 
jmp 


check_dp: 
test 
jz 
mov 
mov > 
int 


print_features: 
Mov 
and 
jz 
mov 
mov 
int 


check_MCE: 
mov 
and 
jz 
Mov 
mov 
int 


check_CMPXCHG8B: 
mov 
and 
jz 
mov 
mov 
int 


check_VME: 
mov 
and 
jz 
mov 
mov 
int 


check_PSE: 
mov 
and 
jz 
mov 
mov 


ax, 


dx, offset turbo_msg 
ah, 9h 

21h 

print_features 


ax, 2000h ; check 


print_features 


dx, offset dp_msg 
ah, 9h 
21h 


ax, word ptr _features_edx 
FPU_FLAG : 
check_MCE 

dx, offset fpu_msg 

ah, 9h 

21h 


check 


ax, word ptr _features_edx 
ax, MCE_FLAG ; check 
check_CMPXCHG8B 


dx, offset mce_msg 
ah, 9h 
21h 


ax, word ptr _features_edx 


ax, CMPXCHG8B_ FLAG ; check 
check_VME 

ax, offset cmp_msg 

ah, 9h 

21h 

ax, word ptr _features_edx 

ax, VME_FLAG ; check 
check_PSE 

dx, offset vme_msg 

ah, 9h 

21h 


ax, word ptr _features_edx 


ax, PSE_FLAG ; check 
check_APIC 

dx, offset pse_msg 

ah, 9h 


for 


for 


for 


for 


for 


for 
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dual processor 


FPU 


MCE 


CMPXCHG8B 


PSE 
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int 2ih 


check_APIC: . 
mov ax, word ptr _features_edx . 
and ax, APIC_FLAG ; check for APIC 
jz end_print 
mov dx, offset apic_msg 
mov ah, 9h 
int 21h 


jmp end_print 


not_GenuinelIntel: 
mov dx, offset not_intel 
mov ah, 9h 
int 21h 


end_print: . 
mov offset cr_lf 
mov 9h 
int 
ret 
endp 


end 


241618-18 


2-466 : : 


intel. aP-488 


Example 3. Processor identification Procedure in the C Language 


/* Filename: cpuid3b.c * / 
/* Copyright 1994 by Intel Corp. sid 
(a | 
/* This program has been developed by Intel Corporation. You *) 


/* have Intel's permission to incorporate this source code into */ 
/* your product, royalty free. Intel has intellectual property 1a 
/* rights which it may assert if another manufacturer's processor*/ 
/* mis-identifies itself as being "GenuineIntel" when the CPUID */ 


/* instruction is executed. : . * / 
| ies it | 
/* Intel specifically disclaims all warranties, express or ly | 
/* implied, and all liability, including consequential and other */ 
/* indirect damages, for the use of this code, including dt 
/* liability for infringement of any proprietary rights, and x} 
/* including the warranties of merchantability and fitness for a */ 
/* particular purpose. Intel does not assume any responsibility */ 
/* for any errors which may appear in this code nor any bale 
/* responsibility to update it. */ 
is */ 
/* This program contains three parts: ak 
/* Part 1: Identifies CPU type in the variable _cpu_type: x7 
= : ay | 
/* Part 2: Identifies FPU type in the variable _fpu_type: ay 
ae xy 
/* Part 3: Prints out the appropriate message. aad 
ae ay, 
/* This program has been tested with the Microsoft C compiler. ia | 
/* If this code is compiled with no options specified and linked */ 
/* with the cpuid3a.asm module, it correctly identifies the * 
/* current Intel 8086/8088, 80286, 80386, 80486, and ay 
/* Pentium(tm) processors in the real-address mode. : sy 
#define FPU_FLAG 0x0001 

#define VME_FLAG 0x0002 

#define PSE_FLAG 0x0008 

#define MCE_FLAG 0x0080 

#define CMPXCHG8B_ FLAG 0x0100 

#define APIC_FLAG 0x0200 


extern char cpu_type; 
extern char fpu_type; 
extern char cpuid_flag; 
extern char intel_CPU; 
extern char vendor_id[12]; 
extern long cpu_signature; 
extern long features_ecx; 
extern long features_edx; 
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extern long features_ebx;. 
main() { 
get_cpu_type () ; 
get_fpu_type() ; 
print(); 
} 
print() { 
printf ("This system has a"); 
if (cpuid_flag == 0) { 
switch (cpu_type) { 
case 0: 
printf("n 8086/8088 ee 
if (fpu_type) printf(" and an 8087 math coprocessor") ; 
break; 
case 2: 
printf('n 80286 processor") ; 
if (fpu_.type) printf (" and an 80287 math coprocessor"); 
break; 
case 3: 
printf('"n 80386 processor") ; 
if (fpu_type == 2) 
printf(" and an 80287 math coprocessor") ; 
else if (fpu_type) : 
printf(" and an 80387 math coprocessor") ; 
break; : 
case 4: 
if (fpu_type) printf("n 80486DX, 80486DX2 processor or \ 
80487SX math coprocessor") ; | 
else printf("n 80486SX processor") ; 
break; 
default: 
printf("n unknown processor") ; 
} 
} else { 
/* using cpuid instruction */ 
if (intel_CPU) { 
if (cpu_type == 4) { 
switch ((cpu_ eignatires>4)e0xf) { 
case 0: 
case 1: 
printf(" Genuine Intel486(TM) DX processor") ; 
break; 
case 2: 
printf(" Genuine Intel486(TM) SX processor") ; 
break; 
case 3: 
printf(" Genuine IntelDX2(TM) processor") ; 
break; 
case 4: 
printf(" Genuine Intel486(TM) processor") ; 
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break; 
case 5: ; 
printf(" Genuine IntelSX2(TM) processor") ; 
break; 
case 7: 
printf(" Genuine Write-Back Enhanced \ 
IntelDX2 (T) processor") ; 
break; 
case 8: 
printf(" Genuine IntelDX4(TM) processor") ; 
break; 
default: 
printf(" Genuine Intel486(T) processor") ; 
} 
} else if (cpu_type == 5) 
printf(" Genuine Intel Pentium(TM) processor") ; 
else 
printf("n unknown Genuine Intel processor") ; 
printf("\nProcessor Family: %X", cpu_type) ; 
printf ("\nModel: %X", (cpu_signature>>4) &0xf) ; 
printf ("\nStepping: $X\n", cpu_signature&0xf) ; 
if (cpu_signature & 0x1000) 
printf ("\nThe processor is an OverDrive(TM) upgrade 
\processor"); 
else if (cpu_signature & 0x2000) 
printf("\nThe processor is the upgrade processor \ 
in a dual processor system") ; 
if (features_edx & FPU_FLAG) 
printf("\nThe processor contains an on-chip FPU") ; 
if (features_edx:-& MCE_FLAG) 
printf("\nThe processor supports Machine Check \ 


Exceptions") ; 
if (features _edx & CMPXCHG8B_FLAG) 
printf("\nThe processor supports the CMPXCHG8B \ 
instruction") ; 
if (features_edx & VME_FLAG) 
printf("\nThe processor supports Virtual Mode \ 
Extensions"); 
if (features_edx & PSE_FLAG) 
printf("\nThe processor supports Page Size \ 
Extensions") ; 
if (features_edx & APIC_FLAG) 
printf("\nThe processor contains an on-chip APIC") ; 
} else { 
printf("t least an 80486 processor.\nIt does not \ 
contain a Genuine Intel part and as a result, the\nCPUID detection \ 
information cannot be determined at this time.") ; 
} 


} 
Wrinee,: Wi). 
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INTRODUCTION 


Since the introduction of the Intel386™ SL microproc- 
essor and the subsequent introduction of the 
Intel486™M SL microprocessor, the SL Architecture 
has become the de facto standard for mobile computers. 
Given the industry acceptance of SL Architecture, Intel 
is extending the SL Architecture to the SL Enhanced 
Intel486 microprocessor family. This application note 
describes how the same features of the SL Architecture 
can be implemented on the SL Enhanced Intel486 
CPUs. Although this application note is written for 
people with experience designing Intel486 SL CPU- 
based mobile computers, the information provided in 
this document will also be useful for anyone interested 
in learning more about the SL Enhanced Intel486 
CPUs. 


The first section of this document highlights the differ- 
ences between the Intel486 SL CPU and the SL En- 
hanced Intel486 CPU. Section two describes the archi- 
tectural differences in System Management Mode 
(SMM). Section three discusses power management fea- 
tures of the Intel486 SL CPU and the SL Enhanced 
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Intel486 CPU. Section four explains reset implementa- 
tion of the Intel486 SL CPU and the SL Enhanced 
Intel486 CPU. 


1.0 COMPARISON OF THE SL 
ENHANCED Intel486™ CPU AND 
Intel486 SL CPU 


The SL Enhanced Intel486 CPU supports many of the 
features available in the SL Architecture. The major 
difference between the SL Enhanced Intel486 CPU and 
the Intel486 SL CPU is level of integration. The 
Intel486 SL CPU is a highly integrated CPU with 
memory controller, ISA/PI-bus controller, and power 
management built into it. The SL Enhanced Intel486 
CPU has retained all the SMM and power management 
features from the SL Architecture. Features not sup- 
ported by the SL Enhanced Intel486 CPU can easily be 
implemented by external hardware. Table 1 highlights 
the differences between the SL Enhanced Intel486 CPU 
and the Intel486 SL CPU. 


Table 1. Feature Comparison of the SL Enhanced Intel486 CPU and the Intel486 SL CPU 


Yes 


Clocking Options 


CPU Frequency 


SL Enhanced Intel486 SL 
Intel486 CPU CPU 


Yes . 
168 lead PGA, 196 lead PQFP 196 lead POFP, 
208 lead SQFP 208 lead SQFP_ |. 
1X clock input or 2X clock input 2X clock input 


Intel486 SX CPU: 25 MHz, 33 MHz 
Intel486 DX CPU: 33 MHz, 50 MHz 
Intel486 DX2 CPU: 40 MHz, 50 MHz, 66 MHz 


c Yes 
ee Lae 
Yes 


Yes | 
Yes 


25 MHz, 33 MHz 
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2.0 SYSTEM MANAGEMENT MODE 
IMPLEMENTATION | 


System Management Mode (SMM), first introduced in 
the SL Architecture for notebook computers, provides 


a unique environment for software to perform power 


management functions much more efficiently. Since 
then, SMM has found its way into many new applica- 
tions. The SMM hardware interface on the SL En- 
hanced Intel486 CPU is similar to that of the Intel486 
SL CPU except for the handshaking protocol. The SL 
Enhanced CPUs handshake through the SMI and 
SMIACT# signals (see Figure 1), and the Intel486 SL 
CPUs handshake through the SMI and SMRAMCS # 
signals. | 


SMIACT# 


SYSTEM 
LOGIC 
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Figure 1. Basic SMI # Hardware Interface 


2.1 System Management Interrupt 


The system interrupts the normal program execution 
and invokes SMM by generating a System Management 
Interrupt (SMI#) to the CPU. On the Intel486 SL 
CPU, the SMI # input is held low as long as the CPU is 
in SMM. With the SL Enhanced Intel486 CPU, SMI # 
input only needs to remain active for a single clock 
provided the SMI setup and hold times, t20 and t21, 
are met. SMI# will also work correctly if it is held 
active for an arbitrary number of clocks. 


NOTE: 
A: Setup time for recognition on I/O instruction boundary 
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Figure 2. SMI # Timing when Servicing an I/O Trap 
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2.1.1 GENERAL DESIGN CONSIDERATIONS 


For Intel486 SL processor-based systems, the 82360SL 


I/O generates the SMI request. For any SMI to be rec- 


ognized by the SL Enhanced Intel486 CPU, the system 


logic must ensure all of the required timings are met. 
The following sections discuss the timing requirements 
that must be observed by the SMI generation logic in- 
terfacing to the SL Enhanced Intel486 CPU. 


2.1.1.1 I/O Trapping 


I/O trapping has proven to be very useful in the SL 
Architecture for device power management. Trapping 
the last I/O access prior to entering SMM prevents the 
CPU from accessing a powered-down device. With the 
exception of the SMFILO (System Management 
FILO), the SL Enhanced CPU supports the same I/O 
Instruction Restart option under SMM featured in the 


_ Intel486 SL CPU. 


The I/O Instruction Restart feature of the SL En- 
hanced Intel486 CPU is used the same way as the I/O 
Instruction Restart feature of the Intel486 SL CPU. 
When the I/O Instruction Restart option is enabled (by 
setting offset OFFOOH in the SMRAM to OFFH), the 
RSM instruction microcode modifies the restored EIP 
to point to the instruction immediately preceding the 
SMI # request, so that the I/O instruction can be re-ex- 
ecuted. For the CPU to trap the last I/O access correctly, 
the external hardware must ensure the SMI# signal is 
asserted at least three CPU clock periods prior to assert- 
ing the RDY# signal (see Figure 2). 
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2.1.1.2 Back-to-Back SMIis 


For back-to-back SMIs, the SMI# input must be held 
inactive for at least four clocks after it is de-asserted to 
reset the edge-triggered SMI detection logic. Otherwise, 
the second SMI# request may not be recognized (see 
Figure 3). 


2.2 SMI Active (SMIACT #) 


A new pin called SMIACT # (SMI ACTive) which in- 
dicates that the CPU is operating in SMM has been 
added to the SL Enhanced Intel486 CPU. The CPU 
asserts SMIACT # in response to an SMI interrupt re- 
quest on the SMI# pin. SMIACT # is driven active by 
the CPU before accessing the SMRAM. SMIACT# 
remains active until the last access to SMRAM when 
the CPU restores (reads) its state from SMRAM. After 
the RSM instruction is executed, the CPU de-asserts 
the SMIACT # signal. 


The SMIACT# § signal is equivalent to the 
SMRAMCS # signal on the Intel486 SL microproces- 
sor except that SMIACT# on the Intel486 SL CPU is 
active all the time and cannot be used as a chip select 
signal for external SMRAM. On the Intel486 SL micro- 
processor, the SMRAM is enabled automatically when- 
ever the CPU is switched into SMM. A similar mecha- 
nism can also be implemented by using the SMIACT # 
signal. Whenever the SMIACT*# signal is active, the 
SMRAM will be enabled by the system logic. 


2.2.1 GENERAL DESIGN CONSIDERATIONS 


As previously mentioned, one of the many uses of the 
SMIACT¥# output is to enable SMRAM when the 
CPU is operating in SMM. Most importantly, the 
SMIACT #¥ output is used by the system logic to main- 
tain system integrity while the CPU is in SMM. 
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If part of the system memory is overlaid by the 
SMRAM while the CPU is in SMM, the system logic 
should ensure that only the CPU and SMI handler have 
access to the SMRAM< area. Accesses to addresses 
overlaid by a bus master or DMA controller when 
SMIACT # is active should be re-directed to the system 
memory underneath the SMRAM and not the 
SMRAM itself. 


While inside SMM, the CPU should be protected from 
system activities such as CPU RESET, interrupt re- 
quests, and NMI, etc. The SMIACT# can be used by 
the system logic to block off these system activities 
while the CPU is in SMM. 


2.3 SMRAM Interfacing 


SMRAM resides in a unique address space so that the 
software operating under SMM is transparent to the 
normal address space. On the Intel486 SL microproces- 
sor, the size of SMRAM can be either 32 Kbytes or 
64 Kbytes. Depending on the size of the SMRAM, the 
SMRAM area can be located in either 38000H- 


3FFFFH or 30000H-3FFFFH. 


On the SL Enhanced Intel486 CPU, the size of the 
SMRAM can be between 32 Kbytes and 4 Gbytes. The 
location of the SMRAM is determined by the SMBASE 
(SMRAM BASE ADDRESS) register, and defaults to 
SMBASE + 8000H, which is 38000H after CPU RE- 
SET. The first SMI after a CPU RESET always begins 
executing instructions at 38000H. 
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Figure 3. Back-To-Back SMI # Timing 
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2.3.1 SMRAM INITIALIZATION 


The SL Enhanced Intel486 CPU family provides a new. | 


control register, SMBASE for changing the SMRAM 
base address (see Figure 4). The SMRAM base address 
can be changed after CPU RESET by invoking a dum- 
my SMI call to change the SMBASE register. 


31 0 


SS 


_ Figure 4. SMBASE Register 


SMRAM Base Address 
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In the SL Enhanced Intel486 CPU, a new slot is added 
to the CPU dump area inside the SMRAM at offset 
7EF8H for changing the SMRAM base address. Dur- 
ing the execution of the RSM instruction, if the reloca- 
tion bit is set, the CPU will read this slot and initialize 
the CPU to use the new SMBASE during the next SMI. 
From then on, the CPU will do its context save to the 
new SMRAM area pointed to by the SMBASE, store 
the current SMBASE in the SMM Base slot (offset 
7EF8H), and then execute the new jump vector based 
on the current SMBASE. 


SMBASE must start at a 32K aligned boundary. Pro- 
gramming the SMBASE register to values that are not 
32K aligned will cause the CPU to enter the shutdown 
state when executing the RSM instruction. After the 
SMRAM base address is changed, the new starting ad- 
dress for the SMI jump vector is calculated by adding 
8000H to the new SMRAM base address. The starting 


address for CPU state dump area will be remapped to 


the new SMRAM base address plus OFFFFH. 


A new bit (bit 17) has been added to the SMM Revision 


Identifier on the SL Enhanced Intel486 CPU to indi- 
cate whether the processor supports relocation of the 
SMRAM base address. With the SL Enhanced Intel486 
_CPU, the SMBASE relocation bit is always set to one 


to indicate the processor supports SMBASE relocation. | 


2.3.2 GENERAL DESIGN CONSIDERATIONS 


Since the memory controller is embedded inside the 
Intel486 SL CPU, many design issues with SMRAM 
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interface are handled internally by the CPU. For the SL 


‘Enhanced Intel486 CPU, these issues must be handled 


by the external system logic interfacing to the CPU. | 


2.3.2.1 Accessing SMRAM 


Before the CPU can execute code inside SMM, the 
SMRAM must be loaded with valid SMM code. If the 
SMRAM is not initialized with code prior to entering 
SMM, executing invalid code out of the SMRAM can 
place the CPU in an unknown state. Thus, the external 
memory controller must provide a mechanism to bring 
the SMRAM into system address space without invok- 
ing SMM. This will allow software such as BIOS to 
load the SMM code into SMRAM. _ 


The Intel486 SL CPU provides a hardware mechanism 
to access memory overlaid by SMRAM. Although sys- 
tem logic is not required to provide a mechanism to 
access memory located underneath SMRAM, it may be 
much easier to implement a suspend state (0-volt sus- 
pend or 5-volt suspend) if such a mechanism is provid- 
ed. 


2.3.2.2 Cache Coherency 


Since the Intel486 SL CPU does not support a second 
level cache, cache coherency with SMRAM is handled 


- completely inside the CPU. The CPU’s internal cache 


is automatically emptied before entering SMM and af- 
ter exiting SMM. 


The SL Enhanced Intel486 CPU does not flush its 


cache before entering SMM or after leaving SMM. 


Cache flushing is not required if the SMRAM is located 
in a non-cacheable area in the memory address space or 
in an external address space which is not visible to the 
system. If the SMRAM is located in a cacheable area 
that overlays system memory, both the CPU internal 
cache and any second level caches must be flushed be- 
fore entering SMM. If SMRAM is cacheable, the CPU 


internal cache and any second level caches must also be 


flushed when exiting SMM. The following steps must 
be taken by the system logic to maintain cache coheren- 
cy when SMM overlays normal system memory: 


1. Before entering SMM, the FLUSH# pin should be 
asserted when SMIACT # is driven active to empty 
the CPU cache. 


2. The KEN# pin must be driven inactive to.stop ac- 
cesses to the SMRAM area from filling in the cache 
line if SMRAM is not cacheable. 


a 
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3. Upon leaving SMM, if SMRAM is cacheable, the 
CPU cache is emptied by asserting the FLUSH# 


pin within one CPU CLK after the SMIACT# pin 
is de-asserted. 


It is the responsibility of the system logic to ensure that 
the setup and hold times for FLUSH # and SMIACT # 
signals are met. 


2.3.2.3 External Write Buffers 


Like the Intel486 SL processor, the SL Enhanced 
Intel486 CPU empties its internal write buffers before 
entering SMM to prevent data in the write buffers from 
being written to SMRAM space. If a system supports a 
second level cache, the second level write buffers must 
also be emptied before the CPU enters SMM. It is pos- 
sible that the CPU is in SMM before the second level 
write buffers are completely emptied by the memory 
controller. In case the second level write buffer is not 
completely emptied, the SMIACT # signal can be used 
to direct the memory write cycles to either SMM space 
or memory space. 


2.3.2.4 A20M# Pin 


The A20M# pin on the Intel486 CPU is provided to 
emulate the address wraparound at the 1 Mbyte bound- 
ary which occurs on the 8086 microprocessor (see Fig- 
ure 5). The SMRAM space on the Intel486 SL CPU is 
always below 1 Mbyte memory address space. Memory 
above 1 Mbyte can either be accessed through the 
ISAWINDOW register or the MCWINDOW register. 
The A20M # signal is automatically driven low when- 
ever the CPU is in SMM. When A20M # is active, all 
external bus cycles will drive A20 low, and all internal 
cache accesses will be performed with A20 low. 


A20 GATE 


CONTROL LOGIC 
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The SL Enhanced Intel486 CPU does not provide any 
memory mapping mechanism to access memory above 
1 Mbyte. To access memory above 1 Mbyte inside 
SMM, the software has to disable the A20M# manual- 
ly through the keyboard controller. Also, if the 
SMRAM is located above 1 Mbyte and A20M # is not 
enabled before entering SMM, the system will crash. In 
this case, the CPU will attempt to access SMRAM at 
the relocated address with A20 low, and will thus fetch 
invalid code. 


For these reasons, the A2ZOM# should be driven inac- 
tive prior to entering SMM and remain inactive as long 
as the CPU is in SMM. This can be accomplished by 
blocking the assertion of A20M#_ whenever 
SMIACT # is active. The state of the A2ZOM# should 
be saved upon entry to SMM and restored to its origi- 
nal state after leaving SMM. 


2.4 SMM Environment Initialization 


When the CPU is running in SMM, the processor is in 
a pseudo “real mode” environment, but without the 
64 Kbyte limit. After the SMRAM base address regis- 
ter has been relocated, the CPU segment registers will 
have values shown in Table 2 when an SMI¥# occurs. 
The CS selector register still contains the value 3000H, 
not the value corresponding to the new SMBASE. The 
rest of the registers are still initialized to zero. 


If the SMRAM base address has not been relocated, the 
segment registers can be initialized in the same way as 
with the Intel486 SL processor, i.e., using the CS regis- | 
ter which defaults to 3000H: Otherwise, the segment 
registers must be initialized correctly to point to the 
new SMRAM memory space. 
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Figure 5. A20M# Interface Logic 
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Normally, the data segment registers are initialized to 


point to the SMRAM base address. Upon entering . 


SMM, the CS BASE segment register is initialized to 
point to the SMRAM base address. The location of the 
SMRAM base address can be determined by reading 
the SMBASE register in the SMRAM at offset 
OFEF8H (the location of the SMRAM base address can 
also be stored in another memory location such as 
CMOS RAM by the BIOS which can be retrieved by the 
SMM program). 


The SMBASE contains a 32-bit address and has to be 
shifted to the right by four bits to generate a 16-bit 
segment address before it can be placed in the data 
segment selector registers. The CS selector register can- 
not be initialized by writing directly to it. It has to be 
initialized by executing a far jump instruction to an 
address within the SMRAM to force the CS selector 
register to point to the SMRAM base address. 


When the CPU is in SMM, the operand size and the 
address size are still 16 bits but there are no limits to 
segment size. The physical address of an instruction is 
obtained by adding the value in CS segment base regis- 
ter to the value in EIP register, rather than the IP regis- 
ter. To access data anywhere within the four Gbyte 
logical address space, operand-size override (opcode 
66H) and address-size override (opcode-67H) prefixes 
can be used as needed. Alternatively, SMRAM data 
located above | Mbyte can also be accessed by using 
32-bit displacement registers. 


Table 2. Register Values after SMI #_ 


Segment 
Register 
CS 
DS 
ES 
FS 
GS 
SS 


SMBASE | 4 Gbytes 
OH 4 Gbytes 
OH 4 Gbytes 
OH 4 Gbytes 
OH 4 Gbytes 


OH 


4 Gbytes 


3.0 POWER MANAGEMENT 


One of the most important power management features 
on the Intel486 SL processor is CPU clock control. The 
clock control scheme on the SL Enhanced Intel486 
CPU is similar to the Intel486 SL CPU, with the 
Intel486 SL CPU being driven by a 2X clock, and the 
SL Enhanced Intel486 CPU available with both the IX 
and 2X clocking options. 
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. The 2X clock input is twice the internal frequency of 


the CPU, whereas the IX clock frequency is the same . 
internal frequency of the CPU. With the IX clock, the 
two internal clock phases, “phase one” and “phase 
two”, are generated by an internal Phase Lock Loop 
(PLL). The CPU clock input for a IX clock cannot be 
changed dynamically because the PLL requires a con- 
stant frequency CLK input (to within 0.1 %). 


3.1 STPCLK# Interrupt 


As with the Intel486 SL CPU, the SL Enhanced 
Intel486 CPU provides an interrupt mechanism, 


‘STPCLK #, which allows system hardware to control 


the power consumption of the CPU by stopping the 
internal clock to the CPU. Unlike the normal inter- 
rupts, INTR and NMI, the STPCLK # interrupt does 
not initiate interrupt acknowledge cycles or interrupt 
table reads. | 


The Stop Clock feature on the SL Enhanced Intel486 
CPU has been improved, allowing the input to the 
STPCLK # to be driven asynchronously as well as syn- 
chronously. The major difference between asynchro- 
nous and synchronous control is that the STPCLK # 
interrupt latency is much shorter with asynchronous | 
control. 


With the Intel486 SL CPU, the STPCLK# input is 
controlled asynchronously through software. The 
STPCLK # is asserted after doing a dummy I/O read 
to the STPCLK register in the 82360SL or executing an 
HLT instruction. The STPCLK# signal will remain 
asserted until a system event wakes up the CPU. If the 
STPCLK # input is driven asynchronously, both setup 
and hold times tz9 and t2; must be met for the 
STPCLK # interrupt request to be recognized. 


After a STPCLK # interrupt request is recognized by 
the CPU, the processor will stop execution on the next 
instruction boundary, stop the pre-fetch unit, and then. 
empty all internal pipelines and the write buffers. Final- 
ly, a special Stop Grant bus cycle is generated. The pin 


state during a Stop Grant cycle is shown in Table 3. 


The interrupt acknowledge cycle is terminated when . 
the system logic returns RDY# or BRDY#. At this 
point the CPU is in the Stop Grant state and the inter- 
nal clock is stopped. The Stop Grant cycle is similar to 
the HALT cycle except that the address bus has the 
value 04H instead of OOH. 
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Table 3. Pin State During Stop Grant Cycle 


Address Bus 0000 0010H (Ag = 1) 
BE3 # -BEO# 1011 (Same as HALT) 


Using the STPCLK # input, the SL Enhanced Intel486 
CPU can be put into low power states similar to the 
Global Standby and Suspend states as with the Intel486 
SL microprocessor. 


3.2 Global Standby Implementation 


In an Intel486 SL processor-based system, the 82360SL 
puts the CPU in a low power standby state (CPU Icc 
~ 20 mA-50 mA) when the system is in Global Stand- 
by. A similar state called Stop Grant State is provided 
by the SL Enhanced Intel486 CPU. The Stop Grant 
state can be entered by simply asserting the external 
STPCLK # interrupt pin. Once the STPCLK # inter- 
rupt is acknowledged by the CPU (ie., after the Stop 
Grant cycle is placed on the bus), the CPU is in the 
Stop Grant state. 


While in the Stop Grant state, the CPU still responds to 
RESET or SRESET and requests a cache invalidation 
(i.e, HOLD, AHOLD, BOFF# and EADS#). How- 
ever, the CPU does not recognize any other inputs 
while in the Stop Grant state. Input signals to the CPU 
will not be recognized until one CPU clock cycle after 
STPCLK # is deasserted. 


>: 2 


CLK2 


CLK2 
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To emulate Global Standby, the stop clock control log- 


ic must be able to de-assert the STPCLK# signal 
whenever there is system activity (i.e, INTR, IRQ, 
NMI, and SMI#). Typically, the stop clock de-asser- 
tion logic is implemented by logic which latches the 
incoming interrupt requests from the system. The CPU 
returns to its normal state within 10-20 CPU clock 
cycles after exiting the Stop Grant state. 


As mentioned before, the CPU does not recognize any 


interrupt request while the STPCLK # input is active. 


To prevent the interrupt request from getting lost, the 
interrupt request logic must ensure the interrupt signal 
is held active for at least one CPU clock after the 
STPCLK # input is de-asserted. 


3.2.1 SUSPEND IMPLEMENTATION 


From the Stop Grant state, the CPU can go into a 
lower power state similar to the suspend state offered in | 
the Intel486 SL processor. After the CPU is in the Stop 
Grant state, the CPU can enter the lowest power Stop 
Clock state (~ 100 »A—200 A) by stopping the CPU 
clock input. The CPU clock input can be driven to 
either logic high or logic low during Stop Clock state. 
The CPU will not generate any acknowledge cycle 
when entering stop clock state. 


For a 2X clock input, the clock input to CLK2 can be 
stopped on either a logic high or logic low independent 
of the clock phase. The CPU will go back to Stop Grant 
state as soon as the CPU clock is re-started. Upon exit 
from Stop Clock State, the CPU clock input must be re- 
started in the same state when it was stopped (see Fig- 
ure 6). 
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Figure 6. CLK2 Phase Coherence in CLK2 Stop and Restart 
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For a CPU with a 1X clock input, the CPU clock can 
be stopped in the same manner as a CPU with a 2X 
clock input. Because of the phase lock loop, the CPU 
will not return to the Stop Grant state right after the 
CPU clock input has been re-started. To allow time for 
the PLL to stabilize, the CPU clock input must be held 
at a constant frequency for a period of time equal to the 
PLL startup latency (as specified in the data book) be- 
fore the CPU will return to the Stop Grant state. 


As long as the CPU clock input is stopped, the system 
logic must keep all the CPU input signals in the same 
state before the clock was stopped. Any change in state 
on an input signal (except for INTR) before the CPU 
has returned to the Stop Grant state will result in un- 
predictable behavior. The CPU will not be able to rec- 
ognize any interrupt request while the CPU clock 1S 
stopped. 


3.2.1.1 Dynamic Clock Switching 


For a CPU clock with a 1X clock input, the CPU clock 
cannot be changed “‘on-the-fly’. For power manage- 
ment as well as implementation of features such as de- 
turbo mode, it is advantageous to run the CPU clock at 
~ alower frequency. This can be accomplished by putting 
the CPU into Stop Clock state and change the CPU 
clock to a lower speed. After the CPU clock input fre- 
quency is changed, the clock control logic must ensure 
that the clock input has been running at a constant 
frequency for the time period necessary for the PLL to 
stabilize before de-asserting the STPCLK # signal. The 
lowest CPU clock rate for a 1X part is 8 MHz (see 
Figure 7). | 


25 MHz Restart 


Stop 


Figure 7. Clock Switching on IX Clock Input 


CLK 
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The Stop Grant and Stop Clock states are designed to 
save power. While the processor is in Stop Grant state, 
the input/output signals on the CPU remain at the 
same state when entering the Stop Grant state, floated 
(data and parity signals), or driven to a different state. 
If some of the signals are driven improperly, the system 
can end up consuming more power. : 


3.2.1.2 Power Consumption 


To achieve the lowest power consumption, all the possi- 
ble current leakage must be eliminated. The system log- 
ic should never drive the input signals with pull-up re- 
sistors LOW and input signals with pull-down resistors 
HIGH. While in the Stop Grant state, the pull-up resis- 
tors on STPCLK# and UP# are disabled internally. 
The system must continue to drive these inputs to the 
state they were in immediately before the CPU entered 
the Stop Grant state. For minimum CPU power con- 
sumption, all other input pins should be driven to their 
inactive level while the CPU is in the Stop Grant state. 


3.2.2 GENERAL DESIGN CONSIDERATIONS 


The STPCLK # input is an asynchronous signal. The 
system can crash if the interface to the STPCLK # is 
not designed properly. Special care must be taken to — 
ensure that all the timing requirements are met and the 
proper protocol is used. Listed below are some design 
considerations that should be considered when sa 
ing the STPCLK # interface. 


© The CPU cannot empty the write buffer during an 
HLDA cycle. Therefore, the CPU will not acknowl- 
edge any STPCLK # request during an HLDA cy- 
cle. 


Stop Restart 25 MHz 
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After the STPCLK # is asserted, the CPU does not 
generate a Stop Grant cycle until it completes the 
current instruction. The latency between a 
STPCLK# request and the Stop Grant bus cycle 
depends on the current instruction, the amount of 


data in the CPU write buffers, and the system mem- 


ory performance. 


The CPU will not enter the Stop Grant state until 
either RDY # or BRDY # has been returned. 


In response to HOLD being driven active during the 
Stop Grant state (when the CLK input is running), 
the CPU will generate HLDA and three-state all 
output and input/output signals that are three-stat- 
ed during the HOLD/HLDA state. After HOLD is 
de-asserted all signals will return to the state they 
were in prior to the HOLD/HLDA sequence. 


When the CPU enters the Stop Grant state, the in- 
ternal pull-up resistor is disabled so that the CPU 
power consumption is reduced. The STPCLK # in- 
put must be driven high (not floated) in order to exit 
the Stop Grant state. 


It is the responsibility of the system designer to en- 
sure that the CPU is in the correct state prior to 
asserting cache invalidation or interrupt signals to 


the CPU. 


KBD OR 
SOFTWARE CPU 
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4.0 RESET IMPLEMENTATION 


On a standard PC, the CPU can be reset by either hard- 
ware or software. On the SL Enhanced Intel486 CPU, 
asserting the RESET input to the CPU will also set the 
SMBASE register to the default value of 30000H. In 
other words, the SMRAM base address will reset to 
30000H whenever the operating system asserts the 
CPU RESET signal. For some older software, a CPU 
RESET is generated by the software to return the CPU 
to real mode from protected mode. 


Normally, this is not a problem if SMBASE relocation 
is not used. If the SMRAM base address has been relo- 
cated, the CPU could be executing invalid SMM code 
from the address. The SRESET pin has the same func- 
tions as RESET except that it does not reset the 
SMBASE register. 


For a system which uses SMBASE relocation, the logic 
which generates the software CPU RESET must be tied 
to the SRESET input and not the RESET input on the 
CPU. All hardware resets should be implemented 
through the RESET pin (see Figure 8), and all software 
resets should be implemented through the SRESET 
pin. 


While inside SMM, the CPU should be protected from 
being reset by a software CPU RESET. SRESET 
should be blocked whenever the SMIACT # is active. 
Any request for a CPU RESET when the CPU is in 
SMM should be latched so it can be serviced after the 
CPU exits SMM. To ensure the execution of the RSM 


instruction does not get interrupted by the SRESET, 


the SRESET must be blocked until at least 20 CPU 
clock cycles after SMIACT # has been driven inactive. 


- 241810-8 


Figure 8. SRESET Interface Logic 
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4.1 General Design Considerations 


The system designer should consider the following re- 
strictions while implementing the CPU Reset logic: | 
® For SRESET to be recognized by the CPU, the 
SRESET input must be driven active (high) for at 
least 15 CPU clock cycles. 
@ The SRESET i is not intended to be used for flushing 
the on-chip cache. For compatibility with future 
_ generation Intel CPUs, the SRESET input pin 
should not be used for flushing the on-chip cache. 


e Hardware resets should not be blocked when the 
CPU is in SMM so that the system can recover from 
a fatal system failure. 
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CONCLUSION 


While taking advantage of the benefits of the SL Archi- 
tecture, the SL Enhanced Intel486 CPU family pro- 
vides a whole new world of opportunity for system de- 
signers to develop innovative, energy-efficient mobile 
and desktop designs. The SL Enhanced Intel486 micro- 
processor family combines power management, com- 
patibility and performance, allowing system designers 
to build a wide variety of machines to meet the diverse — 
needs of a broad range of users. 
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1.0 INTRODUCTION 


Intel’s System Management Mode (SMM), introduced 
as part of the Intel SL technology, has become an in- 
dustry standard for portable computing. Through the 
utilization of SMM, system designers have a new meth- 
od of adding software controlled features that operate 
transparently to the operating system and applications 
software. In portable computer systems, SMM is often 
used to implement the power control on various system 
components to conserve power consumption. Flexible 
clock control has also become essential to the design 
of power-saving computers. The new SL Enhanced 
Intel486™ microprocessor family incorporates all of 
the best power management features which first ap- 
peared in Intel SL technology, bringing Intel486 CPU 
performance to portable computers and energy-efficient 
desktop systems. The purpose of this application note is 
to provide system designers with a better understanding 
of the theory and implementation of power manage- 
ment with the new SL Enhanced Intel486 CPUs. 


2.0 POWER MANAGEMENT 
FEATURES 


e System Management Mode—This mode is com- 
posed of a special purpose interrupt that serves as 
the hardware interface and a secure memory address 
space that stores processor status and special soft- 
ware routines. It can be used to implement power 
management for portable systems. 


e Flexible Clocking Options—The clock input to the 
CPU can either be a 1X clock or a 2X clock. For 1X 
clock option, the internal clock of the CPU runs at 
the same speed as the input clock (CLK input). In 
the 2X clock case, the clock input (CLK2 input) 
needs to be twice the frequency of the internal clock. 


¢ Different Low Power States—Different low power 
processor states are available for various operating 
conditions. This feature enables the conservation of 
processor power consumption without sacrificing 
performance. 


¢ Low Voltage Power Supply Option—The SL En- 
hanced Intel486 CPUs can be powered by either a 
SV or a 3.3V supply, with the 3.3V supply providing 
a 50% power savings. 


2.1 System Management Mode 


The System Management Interrupt (SMI#) input pin 
on the processor provides the hardware interface for 
the computer system to invoke SMM. An exclusive 
memory address space, SMRAM, is only available for 
the CPU to access while in SMM. The size of SMRAM 
can be between 32 Kbytes and 4 Gbytes. It is used to 
store processor state and SMI handlers. SMI handlers 
are special software routines that can be designed to 
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control the power states of system components. Intel’s 
SMM has a special instruction, RSM, that is responsi- 
ble for exiting SMM. When executed, RSM instruction 
restores the processor state from SMRAM and returns 
control to the application that was interrupted by 
SMI#. 


The servicing of SMI # is different from that of a regu- 
lar interrupt. The system invokes SMM by generating 
an SMI# to the SL Enhanced Intel486 CPU. Normal 
program execution will be interrupted as a result and 
the CPU will respond to the interrupt by asserting 
SMIACT #. This signal is used by the system to enable 
SMRAM space. The CPU will then save its state into 


_ the SMRAM< $§area, starting at address location 


3FFFFH and proceeding downward in a stack-like 
fashion. After completing a state save, the CPU will be 
in a pseudo real-mode processor environment. The mi- 
crocode will then direct the CPU to begin executing 
instructions at the absolute address of 38000H 
(SMRAM), at which point it will begin executing SMI 
handlers. SMI handlers can perform various system 
management functions, including system power con- 
trol. The last instruction in an SMI handler is always 
the RSM instruction. This instruction will restore the 
CPU state from SMRAM, de-assert SMIACT# and 
return control of the system to the interrupted pro- 
gram. 


The generation of an SMI# to the CPU can be initiated 
by hardware or software for the purpose of power man- 
agement. The actual implementation depends on the 
specific chipset used and how the system is designed. 
Hardware can generate SMI# by pulling the SMI# 
pin low directly or through other chipset pins, i.e., bat- 
tery level control. When the charging level of the main 
battery falls below a certain limit, the chipset monitor- 
ing the battery level can interrupt the normal program 
operation by pulling the SMI# pin low. While in 
SMM, the CPU can execute certain power-down SMI 
handlers to put the entire computer system in a low- 
power mode that can operate out of a different battery 
source. This will enable the system to maintain its cur- 
rent status while allowing the main battery to be 
changed. SMI# can also be initiated through software. 
Different chipsets have various ways of interfacing to 
the CPU in this aspect. Most of them have dedicated 
timers to detect an idled device. Once these timers are 
enabled, the timeout will automatically generate an 
SMI#. 


Exiting SMM is accomplished by the execution of the 
RSM instruction. Besides restoring the CPU state, the 
RSM instruction can also perform three other func- 
tions. The first is called ““Auto HALT Restart”. The 
System Management Interrupt request can interrupt 
the HALT instruction. By setting the appropriate con- 
trol slot in the SMRAM space, the RSM instruction 
can return control to the HALT instruction or the in- 
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struction immediately following the HALT instruction. 
The second special feature is “I/O Trap Restart”. If 
SMI# interrupt is generated on an I/O access, the 
RSM instruction will re-execute that I/O instruction if 
its I/O Trap Restart slot in the SMRAM is set. The 
third function is the relocation of SMBASE, the start- 
ing address of SMRAM. This provides system design- 
ers with the flexibility of placing SMRAM< space into 


an area in which its integrity is best ensured. The start- . 


ing address is controlled by the SMBASE register, 
which has a power-on reset value of 30000H. The de- 
fault SMRAM area starts at 38000H and ends at 
3FFFFH. The SMRAM can be relocated to any 32K 
aligned area, either overlaid on top of the normal sys- 
_ tem address space or placed in a distinct address space. 


2.2 Flexible Clock Control Options 


The standard Intel486 SX and Intel486 DX CPUs are 
driven by 1X clock as opposed to the Intel386 CPUs 


which use a 2X clock input. The SL Enhanced Intel486 _ 


CPUs are available with either the 1X or the 2X clock- 
ing options. 


The 1X clock allows simpler systern design by cutting 
in half the clock speed required in the external system. 
The 1X clock relies on an internal Phase Lock Loop to 
generate the two internal clock phases, “phase one” 


(ph1) and “phase two” (ph2). The rising edge of the 


CLK input corresponds to the start of ph1. All external 
timings are specified with respect to the rising edge of 
the CLK input. The PLL requires a constant frequency 
CLK input (to within 0.1%), and therefore the CLK 
input cannot be changed dynamically. 


The 1X CLK input option is essential for those proces- 
sors with an on-chip clock doubler. The 1X CLK input 
provides the fundamental timing references for the bus 
interface unit. The CLK input is doubled internally so 
that the CPU core will operate at twice the CLK input 
frequency, and hence twice the bus frequency. The in- 
ternal clock doubler enhances all operations using the 
internal cache and/or not blocked by external bus ac- 
cesses. This mode also uses PLL and therefore the CPU 
CLK input must be maintained at a constant frequen- 


cy. 


The SL Enhanced Intel486 CPUs also offer a 2X clock 
option for systems that rely on dynamic frequency scal- 
ing for CPU power management. The frequency of the 
CLK2 input is twice the internal frequency of the CPU. 
The internal clock is comprised of two phases, “PH1” 
and “PH2’’. Each CLK2 period is a phase of the inter- 
nal clock. All timings are referencec! to the rising edge 
of the PH1 of the CLK2 input. It is therefore important 
to synchronize the external circuitry with the PH1 of 
the CLK2 input. Because the 2X clock option does not 
rely on the PLL to generate the internal phase clocks, 
the frequency of the CLK2 input can be changed dy- 
namically or ‘‘on-the-fly”’. 
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2.3 Different Low Power States 


2.3.1 POWER STATES IN 1X MODE 


Several low power modes are available on the 1X mi- 


- croprocessors. These modes make it possible for a pow- 


er-sensitive computer system to optimize power conser- 
vation. Some of the CPU power controls are realized 
through a specially provided interrupt mechanism. 
Each of the following low power states is described in 
detail. a 


© Auto Idle Power Down State 


This low power state is available in normal opera- 
tion for the DX2 CPUs. DX2 CPUs have an. inter- 
nal clock doubler which doubles the 1X clock input 
and therefore enables the CPU core to operate at 
twice the speed of the input clock. When the SL 
Enhanced Intel486 CPU is known to be truly idle 
and waiting for a ready from a memory or an I/O 
bus cycle read or write, the DX2 CPU will reduce 
its core clock rate to 1X from the doubled DX2 
clock. In this state, the CPU only consumes half of 
the normal power. More importantly, this function 
is transparent to software and external hardware © 
and therefore does not cause any performance deg- 
radation. 


© Stop Grant State 


The Stop Grant state is initiated by simply asserting 
the external STPCLK# interrupt pin. The Stop 
Grant state is used to transition to the Stop Clock 
state. | 


The CPU enters the Stop Grant state through the 
following steps: When the CPU recognizes the 
STPCLK# request, it will stop the execution of the 

- normal program on the next instruction boundary, 
stop the pre-fetch unit, empty all internal pipelines 
and write buffers, generate a Stop Grant bus cycle 
and then stop the internal clock. 


This state is exited when STPCLK*#¥ pin is pulled 
high. The rising edge of the STPCLK # will tell the | 
CPU to return control to the interrupted program 
and start to execute the instruction following the 
last executed instruction of the interrupted program. 


e Stop Clock State | 
The Stop Clock state is entered from the Stop Grant 
state by completely stopping the clock input to the 


PLL. In this state, the CPU consumes only 
~ 100 wA-200 pA of current. 


@ Auto HALT Power Down State 


When the HALT instruction is executed, the CPU 
will issue a normal HALT bus cycle. The SL En- 
hanced Intel486 CPU will automatically stop the in- 
ternal CPU clock, therefore causing the CPU to en- 
ter a low power state with a current of ~ 20 mA— 
55 mA. - 
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¢ Stop Clock Snoop State 


Cache snooping is necessary during Stop Grant and 
Auto HALT Power Down states in order to main- 
tain memory coherency. When the system issues a 
request for cache snooping, the CPU will transpar- 
ently enter the Stop Clock Snoop state and will pow- 
er up for 1 full core clock to complete the cache 
snoop cycle. It will then re-freeze the clock to the 
CPU core and either return to the Stop Grant or the 
Auto HALT Power Down state. 


2.3.2 Transition of Power States and Latency 
Associated with 1X Mode 


It is important to understand how the different power 
states are interrelated and how one transitions to anoth- 
er. The latency is different for different state tran- 
sitions. Figure | depicts which transitions are allowed. 
We shall describe how the transitions are made and 
how much latency is associated with each transition. 


1. The Auto Idle Power Down state is entered whenev- 
er the CPU is detected idle and waiting for a RDY # 
from either a memory or I/O read/write. This state 
only applies to SL Enhanced Intel486 DX2 CPUs. 
Both the internal CPU core clock and the current 
drop to half of the doubled frequency. There is no 
latency associated with this transition. The CPU 
will go back to normal operation when a RDY # is 
detected. 


Auto HALT Power Down State | @ | 
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2. The Stop Grant state is entered when the 


STPCLK # interrupt is asserted to the CPU by the 
system. In this state, the clock output of PLL (or the 
clock input to the internal core) is stopped. The 
speed of the clock input to the PLL can be main- 
tained or changed. If the clock frequency is 
changed, the CPU requires the clock to be held at a 
constant frequency for a minimum of 1 ms before 
de-asserting STPCLK#. The 1 ms time period is 
necessary so that the PLL can stabilize the input 
clock period. De-asserting STPCLK # returns the 
CPU to normal operation. The CPU will also return 
to its normal state when RESET or SRESET is as- 
serted. 


. The Stop Clock state can only be entered from Stop 
Grant state when the clock input to the PLL is 


stopped. The CPU must go through Stop Grant 
state to return to normal operation. Before the CPU 
can return to Stop Grant state, the clock input has 
to stabilize for the 1 ms required by the PLL. 


. The Auto HALT Power Down state is entered when 


HALT instruction is executed. The clock input to 
the internal CPU core is automatically stopped 
upon the execution of HALT instruction. The clock 
input to the PLL cannot be changed during this 
state. This state is exited back to normal operation 
whenever any of the following events happen: 
INTR, NMI, SMI#, RESET or SRESET. There is 
no latency associated with this state transition. 


Auto Idle Power Down State 


Normal State 


ef Le] 42 


Stop Clock Snoop State @ | Stop Grant State 


Stop Clock State 
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Figure 1. Power State Transitions for 1X CPUs 
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5. When the CPU is in Auto HALT Power Down 
state, the system can generate STPCLK# to the 
CPU to bring the CPU into a Stop Grant State. 
When the system de-asserts the STPCLK # request, 
the CPU will return to the Auto HALT Power 
Down state. There is no latency associated with this 
transition. HALT bus cycles will. be launched when- 
ever this transition occurs. 


6, 7. Cache snooping can be performed when the cpu 


is either in Stop Grant state or in Auto HALT 
Power Down state. Cache snoop cycles begin 
when the CPU receives an EA.DS# from the sys- 
tem. The CPU will only wake up for 1 complete 
core clock to perform cache invalidation and then 
re-freeze the clock, i.e., either return to the Auto 


HALT Power Down state or to the Stop Grant 


state. 


2.3.3 POWER STATES IN 2X MODE 


There are five operating modes for the 2X SL En- 
hanced Intel486 CPU. The CPU power controls are 
realized through a specially provided interrupt mecha- 
nism, STPCLK #. In the following, we shall describe 
each of the power states in detail. 


e Normal State 


2X CPUs do not have a PLL, and therefore do not. 


require a stabilized clock period. This means the fre- 
quency of the input clock (CLK2) can be changed 
dynamically. Depending on the level of activity, 
CPUs do not always have to operate at full speed. 
Reducing the CPU speed saves power. 


e Stop Grant State 


The Stop Grant state is initiated by simply asserting 
the external STPCLK# interrupt pin. The Stop 
Grant state is used to transition to the Stop Clock 
state. 


HALT State 


intel. 


The CPU enters the Stop Grant state through the 
following steps: when the CPU recognizes the 
STPCLK# request, it will stop the execution of the 
normal program on the next instruction boundary, 
stop the pre-fetch unit, empty all internal pipelines 
and write buffers, generate a Stop Grant bus cycle 
and then stop the internal clock. 


This state is exited when STPCLK# pin is pulled 
high. The rising edge of the STPCLK # will tell the 
CPU to return control to the interrupted program 
and start to execute the instruction following the 
interrupted instruction. 


© Stop Clock State 


The Stop Clock state is entered from the Stop Grant 
state by completely stopping the clock input 
(CLK2). In this state, the CPU consumes only 
~ 100 pA-200 pA of current. 


e HALT State 


When the HALT instruction is executed, the CPU 
will issue a normal HALT bus cycle. For 2X CPUs, 
there is no power savings in this state. 


e Stop Clock Snoop State 


Cache snooping is necessary during Stop Grant state 
in order to maintain memory coherency. When the 
system issues a request for cache snooping, the CPU 
will transparently enter the Stop Clock Snoop state 
and will power up for 1 full core clock to complete 
the cache snoop cycle. 


2.3.4 TRANSITION OF POWER STATES AND 
LATENCY ASSOCIATED WITH 2X MODE 


Figure 2 shows the state transitions of a 2X microproc- 


_essor. We shall describe how the transitions are made 


and how much latency is associated with each tran- 
sition. 


Stop Clock State 
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Figure 2. Power State Transitions for 2X CPUs 


2-488 


| 

intel. 

1. The Stop Grant state is entered when the 
STPCLK # interrupt is asserted to the CPU by the 
system. In this state, the speed of the external clock 
input (CLK2) can be maintained or changed. There 
is no latency associated with changing the CLK2 
frequencies. De-asserting STPCLK# returns the 
CPU to normal operation. The CPU will also return 


to its normal state when RESET or SRESET is as- 
serted. 


2. The Stop Clock state is entered from Stop Grant 
state when the clock input (CLK2) is stopped. The 
CPU must go through Stop Grant state to return to 
normal operation. There is no additional delay asso- 

- clated with returning to normal operation. 


3. This state is entered when the HALT instruction is 
executed. The HALT state consumes the same pow- 
er as the normal state. The HALT state is exited 
back to normal operation whenever any of the fol- 
lowing events happen: INTR, NMI, SMI#, RESET 
or SRESET. There is no latency associated with this 
state transition. 


4. When the CPU is in HALT state, the system can 
generate STPCLK # to the CPU to bring the CPU 
into Stop Grant state. When the system de-asserts 
the STPCLK # request, the CPU will return to the 
HALT state. There is no latency associated with this 
transition. HALT bus cycles will be launched when- 
ever this transition occurs. 


3.0 IMPLEMENTATION OF POWER 
MANAGEMENT FEATURES 


The means of implementing power management fea- 
tures depend on the specific chipset used. Most of the 
chipsets have both hardware and software power man- 
agement. There are always a number of dedicated or 
user-definable timers that monitor the activity of cer- 
tain device(s), such as CPU or peripheral components. 
In a software approach, the timeout of these timers can 
trigger a System Management Interrupt. Upon the de- 
tection of an SMI, the CPU will execute the power 
management SMI handlers in the BIOS which exercise 
device power control through detailed programming. 
For a hardware-based approach, the timers can auto- 
matically be enabled to perform power controls. If the 
status of the specific device or the entire system needs 
to be saved before changing its power state, the soft- 
ware approach must be used. In other words, if the 
original status of a device or the entire system is re- 
quired to return the system to its operational state be- 
fore the power state change, SMM must be invoked and 


power control will be accomplished by the SMI han-— 


dlers. This section summarizes several power control 
schemes that are common conceptually to all major 
chipsets and explains how they interact with the power 
management features offered by the SL Enhanced 
Intel486 CPUs. 
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3.1 CPU Power Control 


Controlling SL Enhanced Intel486 CPU power is 
achieved by reducing the CPU clock speed, stopping 
the CPU clock or shutting off the CPU power. 


All chipsets have the option of pre-programming the 
CPU speed regardless of the level of system activity 
there is and the CPU clock speed can be divided down 
by 2 to 64. Once the CPU is selected: to run in a re- 
duced speed mode and the CPU clock division is select- 
ed, the CPU will always run at a divided speed until the 
CPU is switched into some other mode of operation. 
Speed reduction is done by BIOS through program- 
ming certain register bits immediately after booting. 


The speed of the CPU can also be changed dynamically 
depending on the level of system activity. Most of the 
chipsets provide mechanisms of monitoring the level of 
system activity involving the CPU by detecting the tog- 
gling of certain CPU signal lines. The timers associated 
with these monitoring devices are responsible for deter- 
mining when to reduce the CPU speed by timing out a 
programmable time period. The chipset reduces the 
speed of the CPU clock by dividing its clock input to 
the CPU after STPCLK # is asserted. For 2X SL En- 
hanced Intel486 CPUs, the speed can be changed on- 
the-fly. For 1X SL Enhanced Intel486 CPUs, clock in- 
put has to be stabilized for 1 ms before de-asserting 
STPCLK #. Stopping the CPU clock is accomplished 
in the same fashion. 


The CPU clock control can be achieved either through 
hardware or software approach. If the status of the | 
CPU needs to be saved in order to return the system to 
the original state, CPU clock control should be done 
through the SMI handlers in SMM mode. 


Shutting off the power to CPU can only be done in 
suspend mode. 


3.2 Controlling Power of Peripheral 
Components : 


The power control of peripheral components is accom- 
plished through idle detectors and SMI generators. 


Idle detectors monitor the access to the following devic- 
es: Keyboard, Video RAM, Floppy Disk Drive, Hard 
Disk Drive, Serial Port and Parallel Port. Idle detectors 
can also monitor access to a programmable address 
range. Some chipsets even provide additional pins to 
monitor other user-definable miscellaneous activities. 
There are timers associated with each of the idle detec- 
tors with programmable idle time period and the timers. 
activate power control pins that are directly tied to the 
controlled devices. When a timer times out the pro- 
grammed period, it will activate the power control pin 
to shut down the power to the specific device. For ad- 
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dress range idle detectors, when there is no access to 
the monitored address range for a programmed timeout 
_ period, the timers will either reduce the clock speed or 
shut off the power of the device that has the same ad- 
dress bits. The idle detectors can operate outside of 
SMM mode and can be independent of CPU state. 


SMI event generators generate SMI requests through a 
number of dedicated or user-defined events. Depending 
on the specific chipset used, these events can be the 
activity timeout of individual devices or a collection of 
devices. Upon the timeout, an SMI request will be gen- 
erated to the CPU, which in turn invokes SMM. The 


SMI handlers in the SMRAM contain the software rou- _ 


tine that controls the power state of the device(s) initi- 
ating the SMI request. By executing this routine, the 
CPU will access the power management control regis- 
ters associated with the devices. After proper program- 
ming, these registers will activate the proper power 
- control pins to shut down the power to the proper de- 


vice(s). Controlling the power of peripheral devices © 


through SMI handlers offers complete flexibility to ei- 
ther manage the power individually or collectively. 
_ This is very important for optimum power conserva- 
tion. 
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Suspend state is the deepest level of power conserva- 
tion. There are two types of suspend: normal suspend 
and 0-volt suspend. In normal suspend, only the CPU, 
chipset and memory sub-system are powered. System — 
status is saved into SMRAM. The rest of the system is 
shut down. DRAM is refreshed with a very slow clock 
(64 KHz or 32 KHz). In OV suspend, the entire system, 
including the CPU, is shut down except the part of the 
system logic that is responsible for resume. The resume 
logic is always powered by an RTC battery. The system 
status is saved onto hard disk. Suspend is normally trig- 
gered by the suspend timer, and the timeout of the sus- 
pend timer is also programmable. 


3.3 Suspend and Resume 


Because of the amount of BIOS support required by 
Suspend, SMM must be invoked. Hardware alone can- 
not accomplish the task. 


4.0 SUMMARY 


SL Enhanced Intel486 CPUs provided the best power 
management features that Intel SL technology offers. 
Intel’s System Management Mode has become an in- 
dustry standard for power-saving computing. Through 
Intel’s SMM, the implementation of power manage- 
ment is very flexible, enabling the optimization of pow- 
er conservation for different system designs. The vari- 
ous CPU clock control options available on SL En- 
hanced Intel486 processors provide the basis for run- 
time power management with no performance impact. 
All major chipsets support the Intel power manage- 
ment scheme with easy-to-design software and hard- 
ware interfaces. 
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1.0 INTRODUCTION 


Today’s market for notebook computers demands desk- 
top performance in smaller and smaller form factors. 
Along with the higher performance comes greater pow- 
er consumption, which adds unique challenges for the 
mobile operating environment (battery operating time, 
thermal management, physical dimensions, etc.). In- 
cluded in this application note is a basic description of 
the thermal forces at work in mobile applications, with 
mathematical models that can be used to project system 
thermal parameters and aid the designer in worst-case 
design. 


The first section begins with a review of the basic ther- 
mal definitions which apply to notebook designs. Ther- 
mal data from a notebook experiment is presented to 
show a relationship between the temperature outside 
the notebook and the CPU case temperature. A model 
is given that helps the designer ensure the CPU thermal 
operating specifications are met. 


Next, several power consumption profiles are provided, 
starting with the assumed worst-case model, along with 
more conservative power profiles based on the degree of 
power management implementation. Based on these 
models, designers may forecast the amount of addition- 
al thermal margin their applications need. 


With these thermal models, power consumption pro- 
files, and test measurements, the designers will have the 
necessary tools and techniques to design for the worst 
case, and understand that by applying simple design 
enhancements, they can improve the quality of their 
designs. The designer should ensure the measured CPU 
case temperature (Tcasp) complies with the Tcasr 
specifications published in the SL Enhanced Intel486 
Microprocessor Data Sheet Addendum. 


2.0 THERMAL BACKGROUND 


2.1 Heat Transfer 


Designing high performance CPU notebook systems re- 
quires some knowledge of the three processes by which 
heat is transferred from one point to another, namely: 
conduction, convection, and radiation, which are de- 
scribed in the following three sections. This knowledge 
will help the designer understand the subsequent meth- 
ods of heat transfer and their value in maintaining the 
CPU within its specified Tcasp limits. The formulas 


NOTES: 
1. Physics, Second Edition, Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 531. 


2. Physics, Second Edition, Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 531. 


AP-498 


describing heat transfer by conduction, convection and 
radiation can be shown analogous to Ohm’s Law: 


af 
R. 


The thermal current, temperature difference and ther- 
mal resistance are analogous to electrical current (D), 
voltage (V), and electrical resistance (R), respectively. 


2.1.1 CONDUCTION 


Conduction is a process by which heat flows from a 
region of higher temperature to one of lower tempera- 
ture within a medium (solid, liquid, or gas) or between 
mediums in direct physical contact(!). In a one-dimen- 
sional system, conductive heat transfer is governed by 
the following relation: 


Conductive Ohm’s Law 
Heat Transfer 
AT V 
q = | — 
L/kA R 


where: 
q = Heat flow rate (W) 
k = Material thermal conductivity (W/m°C) 
A = Cross-sectional area (m2) 


AT 
ao = Temperature gradient (C/m) 


L = Distance of heat transfer 


In the preceding equations, the thermal current, q, can 
be viewed analogous to electrical current; AT analo- 
gous to voltage; and L/kA analogous to thermal resist- 
ance. To improve thermal conduction in a notebook 
environment, copper and other highly-thermal conduc- 
tive metals can be used in the package design. 


2.1.2 CONVECTION | 


Convection is a process of energy transport by the com- 
bined action of heat conduction, energy storage, and 
mixing motion). Convection is the predominant 
mechanism for transferring energy between a solid sur- 
face and a fluid. In the notebook environment, this is 
equivalent to the heat transfer between the case_surface 
and the ambient environment (air). The basic relation 
that describes heat transfer by convection from a sur- 
face to a fluid presumes a linear dependence on the 
difference between the temperature at the surface and 
deep in the fluid, and is referred to as Newtonian cool- 
ing: 


Convective Ohm’s Law 
Heat Transfer 
Ts —Ta V 
dc = — 
1/hcA R 
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where: 
qc = Convective heat flow rate from a surface to 
ambient (W) 
A = Surface area (m2) 
Ts = Surface temperature (°C) 
Ta = Ambient temperature (C) 


Average convective heat transfer coefficient 


(W/m2°C) 


me 
QO 
I 


In the preceding equations, the thermal current, qc, 
can be viewed analogous to electrical current; Ts — Ta 


analogous to voltage; and 1/hc A analogous to thermal — 


resistance. 


In forced convection, fluid flow is caused by an external 
factor such as a fan, while in free or natural convection, 
fluid motion is induced by density differences resulting 
from temperature gradients in the fluid (liquid or gas). 
Under the influence of gravity or other body forces, 
these density differences give rise to buoyancy forces 
that circulate the affected fluid and convect heat 
toward or away from surfaces wetted by the fluid. Al- 
though fans are often used to increase air convection 
inside desktop computers, they may not be a practical 
solution for notebook systems. 


2.1.3 RADIATION 


Thermal radiation is defined as racliant energy emitted 
by a medium by virtue of its temperature, without the 
aid of any intervening medium(3). The amount of heat 
transferred by radiation between two bodies at temper- 
atures T; and T) is governed by the following expres- 
sion: 


Radiative Ohm’s Law 
Heat Transfer 
1/eo R 
where: 


q = Amount of heat transferred by radiation (W) 
€ = Emissivity 0 < € < 1 
o = Stefan-Boltzmann constant, 
5.67 X 10-8 (W/m2 °K4) 
NOTE: 
3. Physics, Second Edition, Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 535. 
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A = Area (m2) 
T,, Tz = Surface temperature (°K) 


For radiation to be an effective method of heat transfer, 
compared to natural or forced convection mechanisms, 
a relatively large temperature difference must exist be- 
tween T; and T>. For most low-power electronic appli- 
cations, these temperature differences are relatively 
small, and therefore, radiative effects are normally ne- 
glected. However, for high-power applications, heat 
transfer by radiation factors should be considered. Al- 
though heat radiation is a secondary thermal effect in a 
notebook system, some manufacturers are selecting 
coatings (1.e., black paint) for their notebook designs 
that are absorptive in the infrared to improve upon 
these thermal radiation effects. . 


2.2 Thermal Impedance 


Thermal management of an electronic system encom- 
passes all the thermal processes and technologies which 
must be used to remove and transport heat from indi- 
vidual components to the system thermal sink in a con-— 
trolled manner. The primary heat transfer processes 
(conduction, convection and radiation) can be com- 
bined into a single linear model (see Section 5.1). 


The junction-to-case (@yc) and junction-to-ambient 


(Oya) thermal resistance values are used as measures of 


IC package thermal performance. These parameters are 
defined by the following relations: 


a hy Tig 


0 
JC =) 


= Ty — Ta 
p 


OA 


Oya = 93c + OCA 


where: 
034 = Junction-to-ambient thermal resistance (C/W) 
@3c = Junction-to-case thermal resistance CC/W ) 
O8ca = Case-to-ambient thermal resistance (C/W) 


Ty = Average die temperature (°C) 

Tc = Case temperature at a predefined location (C) — 
P = Device power dissipation (W) 

Ta = Ambient temperature (C) 


Osc is a measure of package internal thermal resistance 
from silicon die to package exterior. This value is high- 
ly dependent upon packaging material, thermal con- 
ductivity, and package geometry. 9yq measures the 
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conductivity and convective thermal resistance from 
package exterior to the ambient, as well as package in- 
ternal thermal resistance. 634 values depend on materi- 
al, thermal conductivity, package geometry, and ambi- 
ent conditions such as flow rates and coolant physical 
properties. 


To improve CPU thermal characteristics in a notebook 
system, heat sinks are sometimes mounted on the top of 
the CPU using highly conductive adhesive materials. 
Adding a heat sink will not change 6c; however, it will 
improve heat conduction and convection due to the in- 
crease in surface area, resulting in a significant reduc- 
tion in the case-to-ambient and, therefore, junction-to- 
ambient thermal resistance. Depending on the product 
and materials used, 0yq4 can be reduced by 10 to 30 
percent. 


To guarantee component functionality and reliability, 


the maximum device operating temperature is defined 
and constrained by the package exterior temperature at 
a predefined location. The guidelines for ambient tem- 
perature specify that measurements should be taken at 
an undisturbed location at a certain distance away from 
the package—traditionally 12 inches horizontally from 
the center of the CPU. Measuring T, in the traditional 
manner is not possible in a notebook system. The case 
temperature, however, is measured at the center surface 
of the package. Depending on the ambient temperature 
and board power in the system’s environment, thermal 
enhancements such as heat fins or forced air cooling 
may be necessary to meet the case temperature require- 
ments. 


3.0 POWER MODELING 


3.1 Power Consumption Model 


In a linear model, power consumption is governed by 
the following equation: 


P = Voc X Icc 
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where: 
P = Power consumed by the component 
Vcc = Supply voltage 
Icc = Current through the component 


The preceding equation indicates that power consump- 
tion is linearly proportional with both supply voltage 
and current flowing through the component. For exam- 
ple, a 3.3V microprocessor will consume less power 
than a 5V microprocessor running the same application 
under equivalent operating conditions. 


A system’s total power consumption is defined as either 
the sum of the power consumed by each individual 
module within the system, or the sum of the products 
of each module’s voltage supply and its current. It is 
equivalent to: 


Psystem = Pcpu + Pmemory + Paispiay + ete 


3.2 Empirical Data 


Several CPU Icc measurements were taken using an 
SL Enhanced Intel486 CPU evaluation board with vari- 
ous CPU modules running under two different operat- 
ing environments: Windows 3.1 and IndeoT™ Video 
software (see Table 3-1). The modules used were SL 
Enhanced Intel486 DX-33 CPUs (1X SQFP, 1X PGA, 
and 2X SQFP), and SL Enhanced Intel486 DX2-50 
CPUs (1X PGA). All of the CPU modules use a chipset 
with Power Management software. 


When comparing the Icc measurements between sever- 
al application environments, the CPUs running Indeo 
Video software consume the most Icc current by ap- 
proximately 10%. Since the Icc value represents the 
number of gates switching inside the CPU, and hence, 
the intensity of the CPU working condition, it is there- 
fore concluded that running Indeo Video software will 
be close to the worst case for thermal measurement 
purposes. 


Table 3-1. Case Analysis of Power Consumption for SL Enhanced Intel486 CPUs 


vf 
3.5 


NOTES: 
1. All thermal values are measured at zero airflow. 


35 [250] 0270 | 0008| 0 
fox [so [a3 | pea | 15|17.0| 0491 [00s | 0 
[—foxe [so] 50 | rca | 15] 170] 0656 [00s | 0 
Pozea | N/A | 0 


2. Measurements taken on an SL Enhanced Intel486 CPU Evaluation Board (no heat spreader, no heat sink). 
3. All measurements were taken using one module of each microprocessor. 
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3.3. Typical System Power 
Consumption Profiles 


Table A-2 in the Appendix examines the power dissi- 
pated for four typical “‘system”’ profiles. These are sys- 


tems in the sense that the power used is assumed con- . 


trolled (except in the first case) by either the operating 
system or the system hardware, aside from the CPU. 


These cases, however, do not attempt to add the effects | 


of other power dissipating components that would exist 
in a complete PC notebook system. The first case gives 
the most conservative power calculation: the maximum 
' power that can be generated by the CPU. The second 
case gives the typical average power. The last two sug- 
gest power consumption possibilities that could occur 
in a given system, or even be guaranteed by power man- 
agement. Many similar combinations would also be rea- 


sonable. Vcc in each case assumed as the standard val- — 


ue (5.0V or 3.3V). 


In Case 1, the average power is calculated as the stan- 
dard Vcc (3.3V or 5V) times the maximum current, 
Icc(max), that can be drawn by a particular CPU. This 
calculation gives the maximum power that can be dissi- 
pated while continuously executing the most power 
consuming instruction sequence. This value should be 
used in a system design if no thermal power manage- 
ment is imposed, and the designer wants to minimize 


potential problems even under worst-case circum- 


stances: a conservative design. 


In Case 2, the average power is calculated as the stan- 
dard Vcc (3.3V or 5V) times the TYPICAL current, as 
_ specified in Intel486 microprocessor data books. This 
_ calculates the average heat from the CPU that would be 
dissipated over time while executing a typical mix of 
software. The designer could use this power value in a 
less conservative design. However, if the CPU case tem- 
perature approaches its maximum specified value and 
no thermal power management is applied, the Tc(max) 
specification could be exceeded. (Section 5.5 discusses 
the time dependency issues in averaging the thermal 
power generated while executing a mix of software.) 


In Case 3, the average power is calculated as the stan- 
dard Vcc (3.3V or 5V) times the Icc(max) for 10% of 
the time, and Icc(typical) for 90% of the time. This is a 
more conservative assumption than Case 2, allowing for 
some intervals in which the CPU runs at full power, 
and an overall thermal guardband over Case 2. 


Case 4 assumes that the current is distributed at the 
maximum for 10%, typical for 80%, and Stop Clock 
for 10% of the time. A mix of this sort is appropriate in 
a design where power management is applied to assert 


Stop Clock for at least 10% of the time. This mix could 


reduce the performance of the CPU. Suppose the sys- 
tem designer devises a theoretical mix of Icc(Max, 
- Typical and Stop Clock) which is exceeded by the sys- 
tem only 1% of the time when running all standard 
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benchmarks and applications. The designer builds the 
system to that specification, with thermal power man- 
agement responding only when the limit is exceeded. 
Then one can safely design the system assuming a sig- 
nificantly lower power than the absolute maximum, 


and experience performance degradation only 1% of 
the time. 


The cases above illustrate that many less than worst- 
case power profiles are possible, depending on the soft- 
ware being run, and power management options being 
used. Intel recommends that systems be tested for ther- 
mal problems under the worst case power usage that 
the customer could contrive, and in an ambient temper- 
ature equal to the maximum specified for the design. 


4.0 6), BASED ON EXTERNAL Ta 


This section provides some experimental thermal data 
which will help the designer better understand some of 
the system level issues affecting the thermal perform- 
ance of a notebook. Section 4.1 describes in one experi- 
ment how test chamber 07, can be used as an approxi- 
mate thermal performance criteria in the early stages of 
a notebook design. Section 4.2 shows how in another 


experiment Oy, is affected by CPU location on the 


motherboard inside the notebook. Section 4.3 presents 
a model showing the relationship between power gener- 
ated by other components on the motherboard and the 
CPU’s 6y,4 requirement. | 


4.1 Measurements from Commercial 
Notebooks 


Since component location differs among notebook de- 
signs causing internal power densities to vary between 
notebooks there is no one place inside a notebook - 
where T, can be defined in order to obtain an accurate 
system. thermal profile. This section presents experi- 
mental thermal data collected for four different 
Intel386 SL CPU notebooks running Indeo Video 
software and shows—as a rough estimate—that the 
ambient temperature (T,) outside a notebook can be 
used with thermal resistance (8;,) to project CPU tem- 
perture (Tj, Tc). 


In this notebook experiment, the CPU case temperature 
of four different Intel386 SL CPU notebooks (PQFP 
packages) were measured using K-type thermocouples 


_a digital multimeter, and an Intel386 DX CPU-based 


system with data acquisition software. To simulate 
maximum Icc consumption, each notebook continu- 
ously executed Indeo Video software for the duration of 


the experiment. Table 4-1 shows the junction tempera- 


ture and @y,a calculated by using the thermal imped- 
ance equations from Section 2.2. The CPU case temper- 
ature, the maximum power consumption of 2.5W 
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running Indeo Video software, and the test chamber 
Osc of 6°C/W as specified in the Intel386™ SL Micro- 
processor Data Book for the 196L PQFP were used to 
calculate the junction temperature of each CPU. With 
this calculated CPU Ty and measured ambient room 
temperature of 25°C, the corresponding notebook’s 6y,4 
was obtained. 6s, can be calculated by combining the 
first two equations in Section 2.2 as follows: 


ictal? 
05a = Ojo + age © 


Although the test chamber 07, for the 196L PQFP of 
23°C/W was collected with only the CPU present on a 
test board, the value obtained approximates Oy, of a 
notebook operating in an environment of 25°C. One 
possible explanation is that the conductive and radia- 
tive effects the other components have on the CPU in- 
side the operating notebook, such as the PC board, con- 
nectors, floppy disk, shielding and plastic enclosure, ac- 
tually cause the temperature inside the notebook to be 
lower than expected. The end result is a lower Oya for 
the CPU inside the operating notebook than the Oy, of 
a lone CPU inside a test chamber where the only con- 
ductive and radiative path is to the surrounding air, as 
shown in Notebooks #1 and #3. Table 4-1 shows the 
large variation in notebook 07, caused by different sys- 
tem designs. Thus, the test chamber 03, can be used as 
a rough guideline in the early stages of a notebook de- 
sign. For a more in-depth analysis of the conditions 
inside an operating notebook, see Section 4.3. For the 
final design, the thermal performance of the system 
should always be verified by measuring the CPU case 
temperature. 


| # 
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4.2 6 j, and 0jc Measurements in an . 
Actual Environment 


A test motherboard with the same form factor as that 
of the original Test Notebook #4 was fabricated in or- 
der to take experimental measurements of Oya and 0y3¢ 
(see Figure 4-1). The only differences between the two 
boards are the following: 


1. The test board had two slots instead of the 70/80 
pin connectors on the motherboard. 


2. The test board only had six thermal test packages 
mounted on it (three on the component side and 
three on the solder side). 


Measurements from the 6 thermal test units yielded a 
Osa range of 20°C/W-—25°C/W in the test chamber 
and 22°C/W-28°C/W in the Test Notebook and a 0yc 
range of 3°C/W-—5°C/W in both the test chamber and 
the Test Notebook (see Table 4-2). 


Table 4-1. 6y, Calculations for Intel386 SL CPU 
Notebooks Running Indeo™ Video Software 


oa ae eso 
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‘Thermal Test Board used in Test Notebook with 6 Thermal Samples 
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Figure 4-1. Test Notebook Boards 


Table 4-2. Thermal Resistance 6), and 9jc 
for the SQFP Package 


Test Motherboard | Test Motherboard 
in Test Chamber in Test Notebook 
(°C/W) 


NOTES: | 

1. Test Notebook #4 was used to collect the experimental 
data. 

2. 208L SQFP test package with heat spreader containing 
thermal test die was used in all experiments to vary and 
measure the power going into package as well as to mea- 
sure the temperature of the die. 

3. All measurements were made with zero airflow, simulat- 
ing a typical notebook environment. Ambient temperature 
is defined as ambient temperature outside the notebook. 


The worst CPU thermal location was on the center of 


the solder side with 03,4 = 28°C/W. The measured 
thermal resistance at this location was unfavorable be- 
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cause of the reduced CPU board area surrounding the 
CPU (due to the two slots) and the reduced convection 
cooling on the bottom side of the board. The best CPU 
thermal location was on the component side at one end 
of the test board with the most PCB area surrounding 
the package with 03,4 = 22°C/W. This shows how 
CPU location and system layout can impact the overall 
thermal performance of a notebook. 


The 6y,4 numbers should only be used as a first order | 
estimate in preliminary notebook designs. Since the lo- 
cation of a CPU inside a notebook impacts thermal 
performance, Ty should always be verified in the final 
design by 9jc and the CPU case temperature. 


4.3 System Impact on CPU 6 ja 


As notebooks evolve into smaller form factors with 
higher component density and smaller PCB sizes, the 
increasing power density inside the notebook has a 
large effect on CPU temperature. The power dissipated 
by components other than the CPU, and the layout of 
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the components, as well as the thermal-mechanical 
characteristics of the enclosure must be taken into ac- 
count in order to ensure a CPU junction temperature 
within specifications. In one model, such effects are ap- 


proximated by the introduction of the factors R and Py 
to the thermal impedence equations in Section 2.2: 


Ty -Ta—PpxXR 
CPU 

R is defined as the thermal coupling factor between the 
CPU and the other components on the PCB. This fac- 
tor takes into account the effects of the power dissipa- 
tion of the other components on the CPU case and 
junction temperatures. P; is defined as PCB power dis- 
sipation. This P, value is the power dissipated by all 
components (except the CPU) inside the notebook. 


A detailed discussion of how R and Py are used to 
calculate Thermal Headroom (thermal margin) for a 
Test Notebook is given in Section 5.3. Figure 4-2 gives 
a graphical representation of the effects of PCB power 
on the 0y,4 requirements. The horizontal line represents 
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the current method of a fixed Oy, requirement of 
23°C/W for this package, over the entire range of PCB 
power dissipated, Pp. The second line represents the 
model used with the factors P}, and R which takes into 
account the temperature rise inside the notebook and is 
obtained by substituting the example values Ty = 
100°C, Ta = 30°C, Pcpy = 2W, and R = 3.9 into 
the preceding equation. This line shows as the board 
power (Pi) increases, the thermal margin inside the 
notebook becomes smaller due to the higher tempera- 
ture environment. In both models, 65,4 must fall below 
the line to ensure a junction temperature below specifi- 
cation for the given conditions. For the Test Notebook 
#4 the cross-over point between the two lines is 6W. 
Beyond this point, the package thermal resistance of 
23°C/W is too high and thermal enhancements are nec- 
essary to reduce that resistance. In summary, this ex- 
ample shows that for the current method, a 0ya value 
obtained from the test chamber leaves margin for P, 
less than 6W. However, for P, greater than 6W, the 
junction temperature will be exceeded. Again, it is em- 
phasized that the designer should always perform a 
thorough system thermal analysis to ensure the speci- 
fied Tcasp (max) is not exceeded. 


Current Model 


R*Pb Model 
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Figure 4-2. Determining Maximum Thermal Resistance (6),) for a Given Amount of Power 
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5.0 CALCULATING THERMAL 
HEADROOM 


Thermal Headroom is the temperature margin between 
the calculated T 4 (max) and the T,, measured outside a 
given system, with a particular CPU and power. This 
section shows how to calculate Thermal Headroom and 
use it.as simple model for a system’s thermal properties. 
Then a term accounting for board power is added to the 
model, and the experimental measurements needed to 
implement this more sophisticated version are de- 
scribed. Finally, the significance of two secondary ef- 
fects is analyzed. | 


5.1 Using Thermal Headroom Graphs 


Figures A-1 and A-2 (in Appendix A) plot the calculat- 
ed T,(max) vs the power being used by the CPU and 
are intended to facilitate quick determination of ther- 
mal headroom. The lines on the graphs indicate the 
(estimated) maximum allowed arnbient temperature 
(T, in degrees Celsius) as a function of power dissipat- 
ed (P in Watts). Maintaining T, below or equal to 
~ Ta(max) indicates that the requirecl Tc(max) is proba- 
bly not exceeded. The graph lines are generated from 
the formula: 


Ta(max) = Tco(max) — @ca xX P 


9ca is the thermal resistance to heat flow between the 
CPU case and the ambient environment, as specified in 
the Intel Packaging Handbook. As discussed in Section 
4, experiments show that these parameters are approxi- 
mately the same for a notebook PC with Ta measured 
in open air outside the notebook case. The tendency of 
the notebook case to increase 9c,a by adding extra lay- 
ers of insulation is approximately offset by its action as 
a heat spreader, since it is thermally connected to the 
CPU board. Different types of CPU packages have dif- 
ferent 9c, values, and thus generate different lines on 
the graphs. For example, Figure A-1 shows the SQFP, 
PGA and PQFP packages for the SL Enhanced In- 
tel486 SX CPU and Figure A-2 shows the SQFP and 
PGA packages for the SL Enhanced Intel486 DX2 
CPU. 


To determine thermal headroom for a given CPU, first 
determine the correct line for the CPU type. (Some of 
the CPUs are marked on the graphs. For a CPU type 
that is not, find its 9ca in Table A-1, and match it toa 
CPU type that is marked on a graph line. Or use 0c4 
as the slope, and Tc = 85°C as the temperature axis 
intercept to match directly to a graph line.) 


Second, determine the power at which the CPU will 
operate. Various average power use scenarios could be 
appropriate for a given design; four of them are detailed 
in Table A-2 for each different CPU (discussed in detail 
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in Section 3.3). One can also calculate a custom power 
usage profile for one’s system using P = Icc X Vcc, 
and Table A-1, which gives Icc under 3 different con- 
ditions (Active, Stop Grant and Stop Clock). 


Third, draw an ordinate (vertical line) at the power 
value (determined above) to intersect the appropriate 
line for the CPU package type chosen. Draw an abscis- 
sa (horizontal line) from the intercept to the Ta axis, 
obtaining the maximum recommended ambient temper- 
ature for this system. If this Ta(max) is greater than 
what is measured in the air outside the actual system, 
the design has a positive thermal headroom of Ta(max) 
— T,a(measured), as long as the effect of “board pow- 


- er’, Pp, is neglected. If however the actual system is 


exceeding this T, (max), the thermal headroom is nega- 
tive even before considering Pp, and thus the thermal 
properties of the design will need improvement (‘‘ther- 
mal mitigation”’), (See Section 4 for Py definition, and 
Section 5.3 for more information about Thermal Head- 
room). 


There are numerous techniques for thermal mitigation, 
or improving the thermal properties of a design (i.e., a 
lower voltage version of the CPU or a CPU package 
with a lower 0c, could be used). Depending on pack- 
age size and material, various 9c, values can be ob- 
tained. Various passive and active thermal management 
strategies are discussed in Section 6. 


5.2 Example of Graph Use 


Consider the 5V PQFP SL Enhanced Intel486 SX-33 
CPU, which is likely to have special thermal needs be- 
cause of its power requirements. The graph line for it is 
indicated by the label (33 MHz PQFP 5V) on the 
Intel486 SX CPU graph, or by the fact that its slope 
value from Table A-1 is 17.0 °C/W. The power shown 
for Figure A-1 is for Case 1: Vcc = 5.0V; Icc = 
0.685 mA (from Table A-1); Active Max for 100% of 
the time; P = 3.43W (from Table A-2, or calculation). 


The ordinate is drawn from the P axis at 3.43W to 
intersect the intermediate slope line, and the abscissa 
from that point intersects the Temp. axis at about 27°C. 
If we assume 40°C is the lowest T, that can be readily 
achieved, we get a negative thermal headroom of 13°C. 
Designing a portable PC with this CPU clearly will 
require thermal mitigation. 


5.3 Adjusting Thermal Headroom for 
Board Power 


Experiments have shown that the term R X Pp pro- 
vides a good way to model the effects on the CPU Tc 
due to other heat sources inside a notebook. (Here R is 
an experimentally determined thermal coupling coeffi- 


a *. 
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cient, and Pp is the board power, as defined in Section 


4.). The term adds to Tg, or reduces the Ta(max) that 
is required to ensure that Tc does not exceed Tc¢(max): 


Ta(max) = Tco(max) — @ca X Popy — R X Pp 


To calculate thermal headroom adjusted for the effect 
of P,, subtract R x P, from the thermal headroom 
calculated as above. This of course makes the head- 
room smaller (worse), but by how much? This depends 
on the size of P,, but also on R, which is highly depen- 
dent on the particular design. In theory, the smallest 
value possible for R is zero: no thermal coupling be- 
tween the CPU and other heat sources inside the note- 
book. From testing one system, the range measured ex- 
perimentally (with no effort made to thermally isolate 
the CPU) has been 3.9 to 4.9. Values closer to zero are 
obtalned by positioning the other high-power devices 
away from the CPU, and thermally grounding them to 
the outer case. (Section 5.4 describes how to measure R 
for a given system.) 


5.4 Experimental Measurements are 
Essential 


The 6c, values given by Intel can be used as a rough 
“rule of thumb” to estimate likely thermal margins. If 
the thermal headroom calculated from the graphs is 
positive for a given design even after ccrrecting for the 
board power (R = 4 would be conservative for a real 
notebook design), the design is most likely satisfactory. 
But even then, Intel recommends that the CPU Tc be 
measured when the complete design can be run at full 
power, with Ta at the maximum allowed by the design- 
er’s specifications, to be really sure that Tc(max) will 
never be exceeded. If a conservative calculation of ther- 
mal headroom (including R xX P,) is negative, it is 
essential that the system be tested, and improvements 
in thermal mitigation be made until Tc(max) is never 
exceeded. 


There are several levels of thermal experiments that can 
be used. The simplest is to just measure the CPU Tc, 
and make adjustments in the design until it never ex- 
ceeds Tc(max). Then the equations and graphs can be 
ignored; if the design meets the Tc(max) specification, 
it does not matter if the (estimated) T,(max) is violat- 
ed, as far as Intel’s CPU is concerned. (Consideration 
should be given, however, to other components, such as 
a disk drive, that might have trouble due to high tem- 
peratures inside the notebook.) 


Suppose, however, the design is expected to be used for 
several variations over time, i.e., an Intel486 DX CPU 
now, and an Intel486 DX2 CPU later, with perhaps 
higher power peripherals which can also increase Py in 
later versions. These later versions with more power 
will likely require more thermal mitigation efforts, but 
simply measuring Tc in the first version of the note- 
book will give little guidance about how much more 
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thermal mitigation will be needed later. In this case, 
more detailed experiments on the first version, in order 
to build an accurate thermal model of the product line, 
can be very cost effective. This can be done using the 
equation from Section 5.3 and solving for Tc: 


To = Ta + 8ca X Popy + R X Pp 


In the preceding equation, T, is the actual air tempera- 
ture outside the notebook during the experiment; Pcpy 
is held fixed, and P,, is varied while the resulting Tc is 
measured. (Ta + @9ca X Pcpy) is the intercept of the 
resulting straight line, and R is the slope. The easiest 
way to measure R is to disconnect the CPU (so Pcpy 
= 0) and measure Tc with Vcc at the upper and lower 
limits of its range (1.e., 4.5V and 5.5V). Py is obtained 
for each Vcc value by Vcc X Icc. A third data point 
requires no measurement; when PL, = 0, Tc = Ta. 


When this semi-empirical model has been constructed 
for a given notebook design, it can be used to accurately 
determine thermal headroom for variations in both 
Pcpyu (plugging in a higher frequency CPU) and Pp, 
(adding higher power peripherals). Of course, if the 
thermal mechanical design is subsequently modified, R 
should be measured again for the new version. 


If the designers expect the P, to roughly track Pcpy, 
and R is small (as it should be in a good design), an 
approximation to the above model may make measure- 
ments easier: Assume Py = C X Pcpy, where C 1s the 
coefficient relating board power to CPU power. The 
preceding equation for Tc then becomes: 


To = Ta + Oca X Popu + R X Pp 
= Ta + 0ca X Popy + R X C X Pecpy 
Ta + (0c, + R X C) Popy 


Then (@cq + R X C) becomes a new coefficient, say 
8c’, which can be measured by varying Pcp and Py 
together by varying Vcc over its functional range. 


5.5 Secondary Effects 


Two kinds of secondary effects will be evaluated. The 
main model used in thermal analysis assumes a linear 
relationship between the temperature gradient and the 
rate of heat transfer, and also assumes a steady state 
(time independent) model. How valid are these assump- 
tions? 


Heat transfer by conduction is governed by a linear 
relationship between the temperature difference and the 
rate of heat transfer, and heat transfer by convection 
can be approximated by a linear relationship, as de- 
scribed in Section 2.1. However, heat transfer by radia- 
tion is proportional to the fourth power of the absolute 
temperatures involved. To demonstrate the contribu- 
tion that heat transfer by radiation makes to cooling the 
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CPU, consider the largest allowed Tc (85°C) and the 
smallest Ta that most designs would find acceptable 
(40°C). One of the larger CPU packages is 4.4 cm 
square. Assume the largest emissivity (let € = 1). The 
temperatures must be converted to degrees Kelvin by 
adding 273°. Using the formula from Section 2.1, q, the 
radiative heat transfer in Watts is: 


q = €oA (Tc4 — Ta4) = 1 * (5.67 10-8 W/(m2K4)) 
(0.044m)2 ((358 K)4 — (313 K<)4) = 0.75W 


To determine the significance of heat transfer by radia- 
tion in this case, one compares the 0.75 W just calculat- 


ed to the total heat transfer predicted by the linear ap- 


proximation using the experimentally determined O0c,. 
By rearranging the equation 


Tc -T 
Oca = a for P, one obtains 


i ees i 
p=-¢ A 
OCA 


Tc—-Ta = 45°C in this case, and Oca _ ranges 
15.5°C/W to 32.0°C/W for the CPUs covered in this 


article. These figures give a total heat power dissipation _ 


ranging from 1.41W to 2.90W. Thus, the radiative 
component varies between one fourth to one half of the 
total. This explains why the addition of black paint on a 
notebook case improved O@jq measurably (see 
Section 6). , 


The experimental determination of 0c, effectively in- 
corporates the radiative component, along with the 
conductive and convective components, in a combined 
linear approximation, which will be accurate for tem- 
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peratures near the values used for the measurement. 


The effect of the radiative, nonlinear component will be 
beneficial in that the actual power radiated away from 
the CPU, when temperatures are higher than those 
used in the 9cq measurements, will be greater than 
predicted by the linear model. This means that Tc will 
not increase as much as predicted by the model, for a 
given increase in power. 


_ The time independent assumption is fine if one is con- 


tent to build a system to tolerate Case 1 maximum pow- 
er, and with only passive thermal management (i.e., 
heat sinks and heat spreaders). However, if one assumes 
some. power averaging over a typical mix of software, as 
in Cases 2, 3 and 4, one must consider the time depen- 
dent effects of bursts of maximum power, alternating 
with lower power periods. Also, if the design uses ac- 
tive thermal management, especially in a closed loop 
design with the system responding to a temperature — 
sensor, the lag time between temperature sensing and 
response must be considered. 


The experimental measurements taken on an Intel386 
SL CPU notebook show how Tcasg varies with time at 
different CPU power levels, and allow an approximate 
determination of thermal time constants (see Figure 
5-1). The time constant (approximately 3 minutes for 
this notebook) is defined here as the time elapsed from 
when power was switched from Full On to Standby, to 
when the temperature has declined toward its Standby 
asymptote by 1/e. This means that it is reasonable to 
average the CPU power over approximately one minute 
when calculating average power generated by a mix of 
software. This is a large interval in CPU cycles (billions 
of CPU clocks). 


4000 5000 6000 7000 8000 
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Figure 5-1. Tcasg of Intel386 SL CPU (PQFP) Running Indeo™ Video Software 
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This time constant also indicates a significant lag be- 
tween a temperature sensor reaching some action value 
(the action could be turning off the CPU clock for an 
interval), and a temperature response to that action. 
This means that the setpoint temperature (the tempera- 
ture that triggers power reduction) should be somewhat 
below Tc(max). Note also that the temperature does 
not come to equilibrium until about an hour after a 
major power change. This shows that when testing Tc 
to assure that it does not exceed Tc(max), one should 
run the notebook under worst case conditions continu- 
ally for at least an hour. 


6.0 IMPROVING THERMAL 
HEADROOM 


By using the thermal management theories that have 
been reviewed here and keeping in mind the notebook 
platform limitation, the designer can apply proven ther- 
mal management techniques in several areas, including 
increasing thermal conduction and convection, opti- 
mum system layout, and power management tech- 
niques. 


6.1 Improving Thermal Convection, 
Conduction and Radiation 


The most obvious method for improving thermal con- 
vection is by adding a fan to circulate the air. Unfortu- 
nately, fans are a compromise in mobile designs because 
of extra power and space requirements, and electrome- 
chanical noise. Intel’s Thermal/Mechanical Tools and 
Analysis Group has collected data using more realistic 
techniques for reducing Tcasp. Data was collected 
from an actual Intel386 SL microprocessor notebook 
computer modified to measure Tcasp. The case tem- 
perature was measured by applying 3W to the CPU 
with all other devices/components off. 
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6.1.1 BLACK PAINT 


The inside of the notebook case was painted Flat black 
using a paint that is highly absorptive in the infra-red as 
well as visible. Figure 6-1 shows that Tcasp was im- 
proved by 2.3°C, or 3%. Table 6-1 shows that heat 
transfer improvement by radiation reduces @0j,q by 
0.5°C/W. The remaining experiments were performed 
with the inside of the notebook case painted black. 


6.1.2 COPPER FOIL 


A copper foil (1” x 3” x 1.5 mil) was attached from the 
CPU to the bottom of the keyboard (constructed of 
aluminum material) and then to the plastic case using 
thermal grease. Figure 6-1 shows a 4.8°C (6%) and a 
2.0°C (3%) improvement when the foil is connected 
from the CPU to the underside of keyboard and from 
the CPU to the case, respectively. After connecting the 
CPU to the bottom of the keyboard, 03, improved be- 
cause of the higher thermal dispersion by the aluminum 
plate. A thicker copper foil (1” x 3” x 10 mil) was then 
connected from the CPU to the keyboard bottom which 
yielded an improvement of 11.4°C or 15%. 


6.1.3 PERFLUOROCARBON FLUID AND 
SILICONE ELASTOMERS 


Liquid heat sinks containing perfluorocarbon fluid can 
offer a reasonable substitute for standard heat sinks. 
The heat transfer coefficient for natural convection in a 
perfluorocarbon fluid is greater than that of natural air 
convection. Measurements were taken using a perfluo- 
rocarbon liquid heat sink connected between the CPU 
and the plastic case. Figure 6-2 shows a 7.5°C (10%) 
improvement in temperature. 


Table 6-1. Thermal Enhancements to 0ja 


1.5 mil 
Copper Foil 
to Case 


sarc] os | oe [| te | 38 


1.5 mil 10 mil 
Copper Foil | Copper Foil 
to Keyboard | to Keyboard 
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Case Temp (°C) 


Enhancements 


Notebook Temperatures with CPU Power = 3.0W (all others off) 


Without Case Painted Black Black Case & 1.5 mil Black Case & 1.5 mil 
CU Foil to Case 


Black Case & 10 mil 


CU Foil to Keyboard CU Foil to Keyboard 
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Figure 6-1. Black Paint and Copper Foil Thermal Enhancements | 


Case Temp (°C) 
~ 
a 


Without Enhancements 


Notebook Temperatures with CPU = 3.0W (ail others off) 


Case Painted Black 


Perflurocarbon 
to Black Case 


Silicone Elastomer 
to Black Case 


241812-9 


Figure 6-2. Silicone Elastomer and Perfluorocarbon Thermal Enhancements 


A type of heat sink that helps blanket uneven surfaces 
is the silicone elastomers heat sink. These soft materials 
fill air gaps between hot components and the metal 
chassis. A piece of silicone elastomer was cut out to the 
same size as the CPU and placed between the CPU and 
the case, yielding a 1.5°C (2%) improvement (see Fig- 
ure 6-2). 


6.2 Optimizing System Layout 


Power must be optimally distributed to ensure the low- 
est Ta. The objective is to reduce power density within 
the system to avoid hot spots at any particular device. 
Keeping the CPU away from batteries and power sup- 
plies is one challenge to the system designer. Thermally 
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connecting the CPU case to the PC case can increase 
thermal area thus greatly improving thermal spreading. 
There are many opportunities for creativity in transfer- 
ring heat away from the CPU. 


6.3 Effective System Power 
Management 


One technique which has become a standard in note- 
book designs is using Intel’s System Management Mode 
(SMM) to effectively monitor system activity and shut 
off devices to slow or control clocks when low activity 
is detected. Another approach could be to monitor the 
CPU activity or temperature and slow or stop the CPU 
clock when a long period of system inactivity or a high 
temperature is detected. 
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7.0 CONCLUSION 


System notebook designers need to pay close attention 
to the special thermal requirements for future notebook 
designs. These include demands for increasing comput- 
ing power, and thus power consumption, in decreasing 
sizes. With proper thermal design, the CPU can be kept 
below its stated maximum case temperature, even un- 
der worst case conditions. 


Experiments have shown that 0c, as measured in open 
air in factory tests can be used for a rough estimate for 
an actual notebook PC design, where T, is the air tem- 
perature outside the notebook. Using this 0c, to esti- 
mate thermal headroom provides the simplest model. 
Experiments have also shown that adding a term to the 
thermal equation that includes the board power (Pp), 
with a coupling coefficient (R), provides a better model. 
This enhanced model takes into account the effect of 
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other heat sources inside the notebook on the CPU Tc, 
and allows accurate prediction of the thermal effects of 
upgrades in a notebook design (adding a higher power 
CPU and/or peripherals on the board). Also, a number 
of suggestions have been made about how to improve 
the thermal characteristics of a design, by various pas- 
sive and active techniques. 


The information within this application note allows the 
designer to use initially a simple analytical model, as 
well as to build a semi-empirical model to correlate 
with the actual design. Techniques have been shown 
not only to increase the average thermal margin, but to 
eliminate thermal margin problems under worst case 
operating conditions. By designing for the worst case, 
and taking actual measurements to guarantee proper 
operation within spec limits, the notebook designer can 
provide the highest quality, most reliable products for 
their customers. 
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Table A-1. Power Consumption and Thermal Specifications for SL Enhanced intel486 CPUs 


OJA 
(calc) 


+ 0.30 | 0.250 0.040 | 0.0001 | 0.001 | 4.0 | 36.0} 32.0 
+ 0.30 | 0.300 0.050 | 0.0001 | 0.001 | 4.0 | 36.0} 32.0 
+ 0.25 | 0.430 0.065 | 0.0002/| 0.002] 1.5; 17.0] 15.5 
+ 0.25 | 0:430 0.065 | 0.0002} 0.002| 3.5 | 20.5} 17.0 
+ 0.25 | 0.590 

| + 0.25 | 0.590 


+ 0.30 | 0.330 0.050 | 0.0001 | 0.001} 3.5} 25.0} 21.5 
+0.25 | 0.500} 0. 0.080} 0.0002} 0.002] 1.5) 17.0} 15.5 


+ 0.25 | 0.500 0.080 | 0.0002} 0.002] 3.5 | 20.5 
+0.25 | 0.775 0.100} 0.0002} 0.002) 1.5)17.0} 15.5 


+ 0.30 | 0.375 0.040 | 0.0001 | 0.001} 3.5] 24.0} 20.5 
+ 0.30 | 0.460 0.065 | 0.0001 | 0.001] 3.5 | 24.0} 20.5 
+0.25}0.775 0.050 | 0.0002/ 0.002} 1.5) 17.0) 15.5 
+0.25|0.975 0.090 | 0.0002} 0.002] 1.5; 17.0} 15.5 


+ 0.30 | 0.250 0.0001 | 0.001| 4.0 | 36.0} 32.0 
+ 0.30 | 0.330 N/A | NVA 0.0001 | 0.001} 4.0 | 36.0} 32.0 
+0.25 | 0.430 0.0002 | 0.002| 3.5 | 20.5] 17.0 
+ 0.25 | 0.590 0.0002 | 0.002] 3.5 | 20.5; 17.0 


+ 0.30 | 0.330 | 0.415 N/A | N/A 0.0001 | 0.001} 3.5 | 25.0) 21.5 
+ 0.25 | 0.500 | 0.630 0.0002 | 0.002); 3.5 | 20.5; 17.0 
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Table A-2. Thermal Headroom based on Typical Power Consumption Profiles of SL Enhanced Intel486 CPUs a 


Tcaset | case? ——S«d|SSCases | asesdC 


Icc | Power | Thermal | Icc | Power | Thermal| Icc | Power | Thermal! Icc | Power | Thermal 
AVG| AVG | Headrm | AVG| AVG | Headrm | AVG| AVG | Headrm | AVG| AVG | Headrm 


0.32 | 1.04 
0.42 | 1.37 
: 0.56 | 2.80 
: 33 85 
2X DX | 3.3 33 | SQFP | 85 
5.0 33 | PQFP | 85 
NOTES: 
Case temperature specifications assume a heat spreader and no heat sink. 
CASE 1: Icc Active (max) = 100% 


100% 
10% 
90% 


CASE 2: = Icc Active (typ) 

CASE 3: = Icc Active (max) 
Icc Active (typ) 

CASE 4: _Icc Active (max) 10% 
Icc Active (typ) 80% 
Icc Stop Clock (max) = 10% 


Hoi wt wu 


86P-dV 
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Figure A-1. Maximum Thermal Headroom for SL Enhanced intel486 SX CPUs 
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Figure A-2. Maximum Thermal Headroom for SL Enhanced Intel486 DX2 CPUs 
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| APPENDIX B 
THERMAL ENHANCEMENT VENDORS 


Chomerics, Inc. 


77 Dragon Court 

Woburn, MA 01888-4014 

(617) 935-4850 

FAX: (617) 933-4318 

Product ID: CHO-THERM A274 
(Silicon Elastomer) 


3M Corporation 

Building 223-6S-04 

3M Center 

St. Paul, MN 55144-1000 

(612) 733-3735 or (800) 833-5045 

Product ID: Fluorinert Liquid FC-77 
(Perfluorocarbon Fluid) 
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1.0 INTRODUCTION 


The SL Enhanced Intel486 microprocessors contain 
new features to enable simple, economical, and robust 
power management. The Energy Star program and a 
general world wide trend towards responsible energy 
consumption have generated interest in providing com- 
puter equipment which can enter a low power state 
when not being used. The SL Enhanced Intel486 micro- 
processors enable a simple design which is capable of 
power managing both the CPU itself, and the system as 
a whole. | 


The SL Enhanced Intel486 microprocessors can be 
placed in a low power state through software or hard- 
ware. Execution of the HALT instruction will cause the 
CPU to automatically enter a ~20 mA-55 mA state 
called the Auto HALT Power Down state. The CPU 
will issue a normal HALT bus cycle when entering this 
state.The CPU will transition to the Normal state on 
the occurrence of INTR, NMI, SMI#, RESET, or 
SRESET. 


The STPCLK*# interrupt allows system hardware to 
control the power consumption of the CPU by stopping 


STPCLK4 


Internal 
CPU CLK 


STOP CLK TIMER 
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the internal clock (output of the PLL) to the CPU core 
in a controlled manner. When STPCLK # is asserted 
the SL Enhanced Intel486 CPU enters a low power . 
state, the Stop Grant state, of approximately 20 mA to 
55 mA at the conclusion of the current bus cycle. Deas- 
serting STPCLK # will enable the processor to resume 
functioning on the next CLK cycle. Periodically assert- 
ing and deasserting STPCLK # can result in significant 
power savings while still keeping a base level of CPU 
activity to ensure that interrupts are not missed, time of 
day lost, network connections dropped, etc. The pro- 
cess of rapidly asserting and deasserting STPCLK # to 
provide power management while maintaining a re- 
duced level of system activity is referred to as clock 
throttling (see Figure 1). Clock throttling can be imple- 
mented with a variety of periods and duty cycles, or 
with an event driven period such as deassertion on 
INTR. 


It has been suggested that clock throttling may cause 
problems and performance degradation with networked 
systems. This paper will discuss the power savings pos- 
sible with clock throttling, and results of network test- 
ing. Included also is an overview of the power require- 
ments of typical LAN cards. 


Clock Clock Clock 
ee EN uiainay 
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Figure 1. Clock Throttling 
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2.0 TEST HARDWARE 


All testing was done with the use of a daughter card 
which was inserted into an Intel436 CPU PGA socket 
on a standard Intel486 CPU motherboard. This daugh- 
ter card consisted of a socket for an SL Enhanced In- 
tel486™ CPU, an input for a signal from a pulse gener- 
ator from which STPCLK# was derived, and two 
PLDs to perform the following functions. nneate 2 
shows the hardware setup. 


e STPCLK # specification requires that if STPCLK # 
is asserted, it must be held at least until the Stop 
Grant Bus Cycle is returned (see Figure 3). The 
PLD circuitry ensured that this specification was 
met regardless of the period and duty cycle of the 
signal from the pulse generator. 


Pulse Generator 


INTR Jumper. 


PLD Circuit 
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e At the conclusion of the STPCLK# period, once 
STPCLK # is deasserted it must remain deasserted 
for a minimum of 5 clocks before being asserted 
again (see Figure 3). Again the PLD circuitry en- 
sured that this specification was met. 

® Deassertion of STPCLK# on INTR. The PLD cir- 
cuitry was designed so that STPCLK# could be 
deasserted on INTR if desired. A jumper controlled 
whether or not this occurred (see Figure 4). The 
logic was designed such that INTR could only deas- 
sert STPCLK # after the Stop Grant Bus Cycle had 
been returned. 


Appendix A contains a schematic of the daughter card 
and the PLD equations. 


Ethernet Cable 
(Coax or twisted pair) 


1486™ CPU Motherboard 


Network 
Interface 
Card 


Daughter Card 


SL Enhanced 1486™ CPU 
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Figure 2. Test Hardware Setup 
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Hold STPCLK# at Least Until STPCLK# Must be Deasserted 
Stop Grant Bus Cycle a Minimum of 5 clocks After RDY 
) a See ® ‘ 


CLK 


STPCLK# 


RDY# ? 
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Figure 3. Stop Grant Bus Cycle 


STPCLK# (No Deassertion on INTR) 


i ees 0 ee 


STPLCK# (Deassertion on INTR) 
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Figure 4. STPCLK # and INTR 
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3.0 POWER SAVINGS WITH CLOCK 
THROTTLING 


Asserting STPCLK # saves power on the SL Enhanced | 


Intel486 CPU motherboard by reducing the current to 
the CPU, and by reducing the current to DRAM and 
external cache memory. As the CPU is effectively halt- 
ed no memory cycles will occur. In the case of clock 
throttling, as STPCLK # is asserted and deasserted, the 


motherboard will rapidly toggle between a low power . 


and normal power state. The average of these two states 
will be the effective “sleeping state”. It is obvious, 
therefore, that the duty cycle of STPCLK # will have a 
profound effect on the sleeping energy consumption. 


STPCLK Perlod ~ 10 Milliseconds 


LILI 


STPCLK Period ~ 30 Milliseconds 
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In the implementation where STPCLK # is deasserted 
for the remainder of the current period with INTR (see 
Figure 4) then the period of STPCLK # will also have 
an effect on the sleeping energy consumption. An un- 
used sleeping system (ATT type PC) will experience a 
real time clock interrupt every 55 ms. As the period 
increases, the effect of deasserting STPCLK# on 


-INTR reduces the effective duty cycle (see Figure 5). 


- STPCLK Period ~ 10 M liliseconds with Deasseftion on INTR 


/ 


Power consumption data was collected for a variety of 
duty cycles and periods, and with/without deassertion 


of STPCLK# on INTR. Please see Figures 6 meee 


8. Data for charts is in Appendix B. 
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Figure 5. Effect of Deasserting STPCLK # on INTR 
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System Power (% FullOn) 


STPCLK # Period (ns) ee 


Figure 7. Power vs STPCLK Period, Duty Cycle = 50%, Interrupts Deassert STPCLK # 


2517 


AP-504 — 


System Power (% Full On) 


STPCLK# Period (ms) 
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Figure 8. Power vs STPCLK Period, Duty Cycle = 90%, Interrupts Deassert STPCLK # 


Looking at the graphs it is clear that the duty cycle of 
the STPCLK # pulse has the most effect on the overall 
system power. The period of the pulse also affects the 
power, however not as significantly as the duty cycle 
variation. The period of the pulse has the greatest effect 
when the duty cycle is set at 90%. 


4.0 OVERVIEW OF NETWORK TEST 
PLAN 


Given the almost infinite matrix of network operating 
systems, network topologies, network interface cards, 
host system bus architectures, and _ potential 
STPCLK # periods and duty cycles, an exhaustive test 
of all permutations is not feasible. Therefore, testing 
centered on the Novell network operating system, the 
Ethernet™ network topology, and the ISA bus archi- 
tecture due to their overwhelming market share. An 
understanding of the test philosophy and test condi- 
tions can allow some extrapolation of results to other 
environments. 


Maintaining the network connection while in a low. 


power state (sleeping) is mandatory for any energy effi- 
cient computer. A variety of network cards and 
STPCLK # periods and duty cycles were tested in a PC 
connected to a Novell NetWare? 3.11 network. Section 
5 discusses the procedures and results of the testing of 
the ability of clock throttled systems to maintain a net- 
work connection. 
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Peer-to-peer networks have an added complication for 
sleeping PCs in that any user’s workstation may be con- 
figured as a server as well as a client. A user’s sleeping 
PC may be accessed by another user. This access may 
or may not be considered a wake-up event, depending 
on the particular power management scheme. Section 6 
covers the performance implications of transferring 
files from a sleeping PC in a peer-to-peer network (No- 
vell Lite). 


A similar problem to the previous one involving peer to 
peer networks, can occur in classical client-server net- 
works where user’s PCs are configured as network print 
servers. Section 7 discusses test results of printing to a 
remote sleeping print server in a Netware 3.11 environ- 
ment. 


Intel’s Compatibility Validation Lab continually tests 
new Intel CPUs in a variety of machines to guarantee 
all new microprocessors are completely Intel compati- 
ble. These workstation tests are regularly performed 
over a network to facilitate the process. While an ener- 
gy efficient computer would not normally be sleeping 
during these tests (as they emulate user activity), for 
research purposes these tests were completed with an 
aggressively clock-throttled machine. The details of 
these tests are in Section 7. 


intel. | 
5.0 NETWARE 3.11 TEST SUMMARY 


5.1 Test Environment 


The test environment can be summarized with the fol- 

lowing list: 

® Novell 3.11 File Server running on an Intel486 DX2 
66-MHz CPU 


e Traffic Generating Station using Intel NetSight Pro- 
fessional 


® One Traffic Monitoring Station using Intel LanSight 
¢ Two dummy clients 


® One test station with an SL Enhanced Intel486 DX2 
66-MHz CPU 


e Twisted Pair Ethernet Cable 
e Hub 


5.2 Test Description 


This section of the documents describes the tests per- 
formed and provides explanations for some of the inde- 
pendent variables used in the test. As mentioned in sec- 
tion 4, the primary motive of the test suites was to 
demonstrate that the network connection is not lost 
when the processor enters into the power down or stop 
clock state. The variables initially considered to be a 
factor included: the network utilization, the period of 
the STPCLK # pulse, the duty cycle of the STPCLK # 
pulse, whether interrupts deasserted the STPCLK # 
signal or not, and several others. To perform an exhaus- 
tive test of all possible values of all possible parameters 


Server 


Traffic Traffic 
Generator Monitor 


AP-504 


was not feasible for the scope of this paper, (and not 
particularly useful as shown later), so several parame- 
ters were fixed at chosen values. 


Network Utilization 


The network utilization was fixed at 22.8%. [The traf- 
fic generator was set at 25% but measured values dem- 
onstrate that the actual utilization was 22.8%.] This 
was fixed because preliminary tests demonstrated that 
network load did not have a perceptible effect on the 
outcome of the test. Only broadcast packets, or individ- 
ually addressed packets are actually loaded into the net- 
work interface card’s receive buffer. Given the assump- 
tion that the client is in a low power state because of 
minimal system activity, then it follows that only a 
minute percentage of the network traffic will corre- 
spond to the sleeping station. 22.8% was chosen as the | 
utilization of a fairly busy network. 


interrupts will not Deassert STPCLK # 


In the real world environment, this parameter will be 
implemented by the power management logic of the 
particular design. If interrupts deassert STPCLK #, 
then all interrupts (like the real time clock, and many 
others) will bring the processor out of low power mode. 
This scenario will provide improved performance over 
the case where interrupts do not deassert STPCLK #. 
For our tests, the worst case implementation was cho- 
sen. That is, interrupts will not deassert the STPCLK # 
signal. If the tests succeed in this environment, they 
will succeed if interrupts deassert STPCLK #. 


Test Station 


Daughter 
Card 
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Figure 9. Test Environment 
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Test Station Idle 


The test station is not performing any operations for 
the duration of the test. This parameter is based on our 
initial assumption that the client is asleep because there 
is no activity. 


Time 


| 

intel. 
time that the server drops the connection when no ac- 
tivity is occurring. If the cable is actually disconnected 
from the network interface card, the server will drop 
the connection in less than fifteen minutes. A reason- 
able length of time to test would be 4x this value. So, 
each of the tests was run for at least one hour. As a 


sanity check, the STPCLK # pin was grounded (made 
active) and the length of time for the server to drop the 


connection was measured. 
To determine the length of time to test some of the 
cards, it was first necessary to determine the length of 


Table 1. Network Test Loading (Fixed 25% Loading, Interrupts Disabled) 


Network STPCLK# a . eardkee 
Card Period Fail Time 
3Com Etherlink II 1 ms ee 11 min | ; 
"Ansel 2100 1 ms [som | | so 13 min 
Ansel 2200 1 ms a 12m 

8 ms a ee 
Kingston KNE 2121 1 ms ee [an 11 min 


NOTES: 


90% is percentage time that the STPCLK# is asserted (low). 

Time is the time that each card was tested under 25% network load. 

Pass/Fail: A card is considered to have passed if it maintains connection with the network for 1 hour or more. 

Server Disconnect time is time that it takes for the network to drop a station that does not respond. This time was computed 
by grounding the STPCLK# ( completely halting the CPU). 
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5.3 Test Results 


All of the LAN cards tested passed all of the tests with 
the initial conditions as described in Section 5.1..As all 
the tests passed, it is interesting to consider some of the 


factors at work. One factor is the packet size. The de-. 


fault Novell packet size is approximately 1.5 Kbytes. 
On a card with a receive buffer of 32K, twenty-one full 
size packets can be put into the buffer without dropping 
any. When little or no network traffic is being generat- 
ed by a particular station, the number of packets ad- 
dressed to the station is small. Generally speaking, the 
server will send “hello?” packets every few minutes to 
ensure that the clients are still maintaining the connec- 
tion on the network. It has been shown that the file 
server actually terminates the connection before the 
buffer would be filled with unopened messages. So, in 
the fifteen minutes that a server takes to confirm a lost 
transmission, twenty packets are not sent to the client. 
The client need respond to only one of these “hello” 
packets for the connection to be maintained. Another 
related factor is the protocol itself. Normal Ethernet 
protocol specifies that if a packet is sent without a re- 
sponse, the packet is resent. So, in the event the client 
receive buffer does fill up and a packet is lost because 
the receive buffer is full, the initiating station will re- 
send the message. The chance of dropping a package is 
slim, and the chance of not recovering from a dropped 
package is even less. 


One scenario that has not been mentioned is the scenar- 
io whereby the client goes to sleep in the midst of a 
large network operation. Even if this unlikely situation 
should occur, it will not affect the network connection. 
Protocol calls for a handshaking mechanism for all 
packets. A request is sent and a reply is received. In the 
event that the client enters into a low power state, the 
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CPU will request the data less frequently, and thus re- 
ceive data less frequently than if operating in a fully 
awake state. So as the effective frequency of the proces- 
sor decreases so will the network bandwidth required 
by the operation in progress. 


Another key consideration in the success of the tests 
was a simple matter of processor performance. Even 
when operating with a 90% duty cycle on STPCLK #, 
the SL Enhanced Intel486 processor compares very fa- 
vorably with older processors still connected to many 
of today’s networks. 


In summary, it has been shown that the STPCLK fea- 
tures of Intel’s SL Enhanced processors will not cor- 
rupt the Novell 3.11 Client Server LAN or lose net- 
work connections under normal circumstances. 


6.0 NOVELL LITE TEST SUMMARY 


6.1 Test Environment 


The test environment can be summarized with the fol- 

lowing list: 

@ Novell Lite 1.1 server and client software running 
on SL Enhanced Intel486 DX 33-MHz CPU 


e Traffic generating station using Intel NetSight Pro- 
fessional 


® One traffic monitoring station using Intel NetSight 
Professional 


® One test station with SL Enhanced Intel486 DX2 
66-MHz CPU configured as Novell Lite 1.1 server 
and client 


® Coaxial cable connecting the four stations 


Traffic 
Generator 
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Figure 10. Test Environment 
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6.2 Test Description 


This section describes the tests performed and provides 
explanations for some of the independent variables in 


the test. As mentioned in Section 4, the primary motive 


of the Novell Lite testing was to determine the perform- 
ance implications of clock throttling PCs in a peer-to- 
peer networking environment. : 


The performance measurements were done by running 
two different tests while varying the period of 
STPCLK # assertion from 1 to 55 ms, the duty cycle of 
STPCLK # assertion from 50% to over 90%, and al- 
lowing and not allowing interrupts to deassert 
STPCLK#. The tests consisted of running two batch 
files which transferred files to and from the host ma- 
chine. The first batch file transferred 2 files 10 times 
each. These files had files sizes of 41 Kbytes and 65 
Kbytes. The second batch file transferred two large files 
having sizes of 130 Kbytes and 333 Kbytes twice to and 
from the host machine. Each of these tests were run on 
the seven Ethernet cards in the test suite. The results 
can be seen in Appendix C. 


The range of the period from 1 to 55 ms was chosen 
because this is the minimum and maximum periods of 
typical implementations. The period was varied from 
50% to 90% for maximum power savings. Data was 


also taken in the cases of allowing and not allowing © 


interrupts to deassert STPCLK#, as obtaining per- 


formance results in both situations is necessary for a | 


complete analysis. 


Network Utilization 


The network utilization was fixed at 22.8%. [The traf- 
fic generator was set at 25% but measured values dem- 
onstrate that the actual utilization was 22.8%.] The 
utilization was fixed because preliminary tests indicated 
that network utilization did not have a perceivable ef- 
~ fect on network connection failures or on network per- 


formance. This percentage is considered a heavy traffic ° 
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load but makes the test more representative of an actual 
network. The packets generated by the traffic generator 
were sent to random addresses, and had no effect on the 
performance of the individual cards other than using up 
network bandwidth. The bogus packets broadcasted on 


the network were 64 bytes long, and approximately 
1,500 packets were sent per second. 


Test Station Idle 


Our test station was idle except when responding to the 
network requests generated by the test programs. This 
parameter is based on the initial assumption that the 
client is asleep because there is no activity. 


6.3 Test Results 


Performance 


Performance slowdown is defined as the length of time © 
the test took with clock throttling compared to the 
length of time the test took without clock throttling 
expressed as a percentage. A slowdown of 100% would 
mean that the tests took twice as long. — 


The highest performance levels were obtained by allow- 
ing interrupts to deassert STPCLK #. Using this meth- 
od, the performance slowdown was never more than 
60% of normal operating conditions. The best perform- 
ance using STPCLK # at a 50% duty cycle was seen at 
a very short period (1 ms). The best performance seen 
in the 90% duty cycle occurs at the highest period test- 

ed, 50 ms. See Figures 11 and 12. . 


When interrupts were not allowed to _ deassert 
STPCLK#, the highest performance at a 90% duty 
cycle was on the shortest period tested. While the per- 
formance degradation (120%) is significant, this may 
be acceptable in environments where “sleeping” PCs 
are rarely accessed, or, of course, in non peer-to-peer 
networks where this situation will not occur. 
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Figure 11. Performance Slowdown vs STPCLK # Period Relative to Peak Performance at 50% Duty 
Cycle (with and without STPCLK # Deassertion on INTR) 
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Figure 12. Performance Slowdown vs STPCLK# Period Relative to Peak Performance at 90% Duty 
Cycle (with and without STPCLK # Deassertion) 
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The knee in the graph at approximately 30 ms could be 
a function of the operating system’s interaction with the 
I/O subsystem. As the period increases with the same 
duty cycle, the performance diminishes because the 
waiting time for the disk drive or LAN increases. For 
example many small periods would be more efficient 
than fewer bigger periods. At some point, however, the 
system reaches the point where it can complete an en- 
tire job (i.e. one file copy request) in the period (around 
30 ms). At this point, the efficiency suddenly improves. 
Then once again, the efficiency diminishes as the period 
increases. The system can do 1.1 job, 1.2 job, etc. Theo- 
_retically the system would see another increase in per- 
formance at the two job interval (around 60 ms) 


A connection test was also performed. The systems 
were all run for at least one hour with STPCLK # as- 
serted over 92% of the time with a period of 55 ms. 
During this time, interrupts did not  deassert 
STPCLK #. No systems failed and no connections were 
lost. A period of 55 ms was chosen because this is the 
maximum period that a system can be asleep without 
losing time from the real time clock. All the cards were 
tested in this environment and all the cards passed. An 


explanation of the Novell Lite protocol shows this to be 


a reasonable result. 


If a Novell Lite machine attempts to access another 
machine and its first message is not acknowledged, it 
will retry 15 times at approximately 220 ms intervals. A 
retry is only counted towards the 15 if the request is 
successfully transmitted onto the network media (no 
collision occurred). If there is heavy traffic on the net- 


work, it may transmit at longer intervals than 220 ms. 


due to collisions. 


Novell Lite also uses a stop and wait protocol. The stop 
and wait protocol mandates that after one packet is 
transmitted, the next one will not be transmitted until 
an acknowledgment is received. If a packet is lost, the 
protocol will retransmit the packet (or request) after a 
time-out period. Standard Ethernet was never assumed 
to be an error-free transmission media, and procedures 
are already in place to handle lost packets. Therefore, 
even if the packet arrives successfully, and for some 
reason was lost in the destination machine (which did 
not appear to happen), the sending machine will re- 
transmit the package since an acknowledgment was not 
received. 
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7.0 NOVELL PRINT SERVER TEST 
SUMMARY 


7.1 Test Environment 


The test environment can be summarized with the fol- 


lowing list: 


© Novell 3.11 File Server running on an Intel486 DX2 
66-MHz CPU 


¢ Traffic Generating Station using Intel Netsight Pro- 
fessional 


® One Traffic Monitoring Station using Intel LanSight 
© Two dummy clients 


® One test station with an SL Enhanced Intel486 DX2 
66-MHz CPU 


— Configured as a Novell 3.1 network print server 
— Intel EtherExpress LAN card 

© Twisted Pair Ethernet Cable 

© Hub | 

e HP DeskJet} 550C Printer 


7.2 Test Description 


The primary motive of this test is to show that there 
will be no significant loss in performance from a net- 
work workstation configured as a print server if it goes 
into power down mode. Testing was performed to com- 
pare normal network printing (STPCLK# disabled) 
with a worst case scenario in which STPCLK # is as- 
serted and interrupts are disabled. The test consisted of 
printing a bitmap file and a text file from a client sta- 
tion to the test station configured as a print server. 
Testing was performed with the test station fully 
“awake” and “asleep”. The “asleep” or worst case test 
consisted of a 25% network load (as did awake case), a 
STPCLK # duty cycle of 90%, and a STPCLK ¥ peri- 
od of 1 ms. The tests were performed using an Intel 
EtherExpress LAN card. The two file types used for 
testing were a 13 page 36K text file and a 308K bitmap 
file. 
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7.3 Test Results 
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Table 2. Test Results for Novell Print Server 


NOTES: 
STPCLK# disabled: CPU fully “awake’”’. 


STPCLK# enabled: CPU ‘“‘asleep’’, 90% duty cycle, 1ms period, interrupts disabled. 
Duty Cycle: 90% is the percentage time that the STPCLK# is asserted (low). 


Period: STPCLK# period. 


Network Load: actual network load as recorded by network monitor was 22.8%. 


File Print Time: time to print entire file beginning at client. 


7.4 Test Conclusions 


As expected, there was no significant difference be- 
tween file print times for a fully “awake” CPU and a 
“sleeping”? CPU. The slight variations in time can be 
due to the Novell protocol itself. First, the file to be 
printed is transferred over the network to the network 
server; it is sent to the print server where it is stored in 
a print queue. From there it is spooled into the printer 
itself. Although a “sleeping” CPU transfers data much 
slower from the LAN card to memory, and from mem- 
ory to the printer, it is sufficiently fast enough to keep 
up with the printer. The limiting factor here was not 
the CPU speed, but actually the printer speed. Also, 
note that this is a “worst” case, in which interrupts do 
NOT disable the STPCLK #. In summary, STPCLK # 
features do not have a significant impact on print server 
performance. 


8.0 INTEL COMPATIBILITY 
VALIDATION LAB TEST 
SUMMARY 


Intel’s Compatibility Validation Lab is chartered with 
ensuring all Intel microprocessors are Intel compatible. 
To this end, they extensively test all new Intel CPUs in 
a variety of environments. Passing their tests is an indi- 
cation that the device under test is Intel compatible. A 
PC equipped with the daughter card to enable clock 
throttling was submitted to the CV Lab for testing. In 
effect, the testing was to determine whether or not a 
clock throttled CPU was Intel compatible. 


For logistic reasons, the CV Lab performs its worksta- 
tion tests over a network. CV Lab’s workstation tests 
consist of running various industry application pack- 
ages. The test suites emulate typical user activity in 
applications such as Microsoft Excelt , Word? , etc. 
An Energy Star system would normally be in a full on 
state during such activity; however, for study purposes 
this testing was done with a clock throttled system to 
investigate any issues involved with interaction between 
a clock throttled CPU and heavy network traffic. 


Nine standard workstation tests were performed on a 
PC equipped with the daughter card to enable clock 
throttling. The following “typical” STPCLK # imple- 
mentation was chosen. 


1. Period equal to 8 ms 
2. Duty Cycle equal to 90% 


3. STPCLK # was deasserted for remainder of period 
on INTR 


All tests were run successfully, and no network failures 
were observed. In conclusion, the CV Lab testing did 
not indicate any incompatibility between the 
STPCLK # clock throttled PC and the network. 
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9.0 RECOMMENDATIONS FOR 
CLOCK THROTTLING 


Clock throttling provides a clean solution for hardware 
power management of the CPU and memory subsys- 
tem. Clock throttling offers an easy way to design a 
system which can enter a low power state (less than 
30W) during periods of inactivity, while still providing 
adequate functionality to maintain a network connec- 
tion and even respond to printing or data requests. 
Hardware power management through clock throttling 
means that the power managed system will not have to 
depend on software drivers or a particular operating 
system to meet Energy Star compliance. This offers the 
OEM greater flexibility in the system configuration, 
and frees the end user from concerns over software up- 
grades. — 


It is important in an ATT type personal computer that 
the real time interrupt (once every 55 ms) not be 
missed. This can be guaranteed by either choosing a 
clock throttling period of less than 55 ms, or by mo- 
mentarily deasserting STPCLK# on the real time in- 
terrupt. 


Clock throttling enables the system designer to choose 
the level of power savings necessary for a particular 


implementation. It has been demonstrated that increas- | 
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ing the STPCLK # duty cycle dramatically affects the 
power requirements of the system, with little impact on 
performance of “sleeping systems”. If the performance 
of “sleeping systems” is a major concern, then momen- 
tarily waking for system interrupts can obviate this deg- 
radation to a great extent. If the system will be awak- 
ened for the remainder of the STPCLK# period, as 


was done in the test daughter card, then the period of 
STPCLK # clock throttling will affect energy savings. 


An alternative method of STPCLK # clock throttling 
is with a variable period driven by external events. Basi- 
cally STPCLK# is asserted until an external event 
(such as interrupt) occurs. At this ttme STPCLK # is 
deasserted for a programmable period (2 ms—4 ms), 
and subsequently reasserted. This dynamic clock throt- 
tling scenario comes very close to always operating the 
CPU in a low power state, except for when there is 
actually work to do (such as service the interrupt). 


Finally, it should be noted that STPCLK # testing re- 
vealed no inconsistencies with clock throttling an 
Intel486 SL Enhanced CPU in a network environment, 
and therefore its proven network compatibility should 
be considered when judging the merits of using clock 
throttling as part of a power management program. 
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APPENDIX A 
STPCLK# TEST CIRCUITRY 


STPCLK# Daughter Card Schematic 


INTR PLO STPCLK PLD 
CPUCLK 
A4 


| STP_OUT# 


- © ON OM S&S WN — 


(fo) 


CPUCLK | 


PULSE 
GENERATOR 


STP_IN& 
OUTPUT 


INTR 


INTRA JUUMPER 
SL ENHANCED i486™ DX CPU 


TO MOTHERBOARD 
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Figure A-1. STPCLK # Test Circuitry 
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INTR PLD 
(PLDasm format) 


Pattern 1 
Revision Cc 


CHIP INTR 16R4 


; inputs 

pin 1 CLOCK 

pin 2 /SYSRDY 

pin 3 INTR 

pin 4 /STPCLK_OUT 
pin 5 /STP_GNT 
pin 10 GND 

pin 11 /0OE 


pin 20 vec 
; outputs 


pin 12 /CPU_RDY 
pin 13 /GNTRDY | 
pin 14 /INTLATCH 
pin 15 /RDYDONE 
pin 16 /RDYKILL 
pin 17 /GNTLATCH 
pin 18 /CPU_STPCLK 
pin 19 /STOPGATE 
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Title — INTR and RDY | 


Company Intel Corporation 


REALE LAE ELSA EAA ARRAS EE EAA EERE ERE ERE RRA RALE RRR H RE ERE Sf 


;* Works with STPCLK.PLD to deassert STPCLK on INTR after */ 


7* making sure that the Stop Grant cycle has occurred */ 
EEX AERA EASE OLAS AA ESS RAL AAA EAA SEERA EEE AAAS ORS A RRR 


CPU Clock 

RDY# from board 

INTR signal to CPU 

STP_OUT from STPCLK. PLD 

Stop Grant Detect from STPCLK PLD 


; Ground for PLD 


Output Enable 
Power 


RDY# output to CPU 
Internal signal 


STPCLK# output to CPU 
Internal signal 
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EQUATIONS 
CPU_RDY.TRST 
STOPGATE 
GNTLATCH 
RDYKILL 
GNTRDY 
CPU_RDY 


RDYDONE 
INTLATCH 


CPU_STPCLK 


;end of equations 
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STPCLK_OUT* /INTLATCH + STOPGATE* STPCLK_OUT 
+ STOPGATE* /RDYDONE 


STP_GNT ; GENERATE RDY AFTER STOP 
GNTLATCH ; GRANT FOR ONE BUS CYCLE 


GNTLATCH* /RDYKILL 
SYSRDY 


RDYKILL + STOPGATE*RDYDONE ; HOLD RDYDONE TILL 
; STPCLK GOES AWAY 


INTR + STOPGATE* INTLATCH ; HOLD INTLATCH TILL 
; STPCLK GOES AWAY 


STOPGATE* /INTLATCH + STOPGATE* /RDYDONE 
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STPCLK PLD 


Name 

Partno 

Revision . 

Company 

Assembly 

Location 
. Device 


/* Hardware 


< Inputs > 
pin 1 

pin 2. 
pin 3 
pin 4 
pin 5 
pin 6 
pin 7 
pin 11 
pin 14 
pin 16 


< Outputs > 
pin 8 

pin 9 

pin 10 

pin 15 

pin 19 
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{CUPL Format) 


STPCLK. PLD; 


85Cc060) ; 


OS; 

Intel Corporation, Inc.; 
CUPL; 

U00; 


- EP600; /* PLDXXXXX Intel 85cC060 */ 


[BERR RRR EKER ERE ER EE ERE RERERERER ER EKER EERREREEEKEREREREKE / 


implementat.ion assert minimum STPCLK# duration */ 


/* until CPU acknowledges with STOP GRANT CYCLE. */ 


[RRR IRR IRR RRR RIOR KR RR IRR KOR RK TORK RIOR RRR RIOR RR OR RR RR KKK / 


= CPUCLK =; <pin #1 and #13 connected together> 
= A4- ; 

= Me IO ; 

= De ;: 

= W_R ; . : 

= /STP_IN ; < asynchronous input from system > 
= RESET ; ° 

/BOFF : < omit if not used > 

HLDA ; 

= !ADS; 


= /STP_OUT ; < STPCLK# output to CPU > 
= STP_LATCH ; 
= /STP_GNT ; 
= GNT_DETECT; 
FLOATED ; 
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< Logic Equations 

STP_LATCH.D = STP_IN; 

STP_LATCH.AR = RESET; 

STP_GRANT_STATE ( /M_IO * /D_C * WLR * A4 ); 

STP_OUT STP_LATCH * !STP_GRANT_STATE <pass thru IF not STP_GNT> 


STP_IN * GNT_DETECT <pass while in 
STP_GNT> 


/STP_GNT * STP_OUT ) * /RESET; <hold before STP_GNT> 


STP_GNT.D STP_GRANT_STATE * STP_OUT * /HLDA * /FLOATED * ADS 
STP_GNT * STP_OUT; . 
<STP_GNT remains active until STP_OUT# is inactive> | 


GNT_DETECT STP_GRANT_STATE * STP_OUT * STP_IN * /HLDA * /FLOATED 
+ GNT_DETECT * STP_IN; 
< wait for /STP_LATCH to deassert AFTER STP_GNT active > 


FLOATED .D BOFF; <first latch sequence> 
FLOATED.ar RESET; 
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_ APPENDIX B 
SYSTEM POWER MEASUREMENTS 
WITH CLOCK THROTTLING 


Period = 40 ms 50% Duty Cycle 
STPCLK # Not Deasserted on interrupts STPCLK # Deasserted on Interrupts 


STPCLK# ‘| Duty Current Power Period DC Current Power 
(ms) (mAmps) (Watts) 


Asserted (ms) | Cycle | (mAmps) | (Watts) 
376% | 214 | 1068 | | 15 | 206 | 10.81 
| | 
4 


oO; 


15 


62.5% 1.84 


90% STPCLK# Asserted 
STPCLK # Deasserted on interrupts 


(ms) (mAmps) (Watts) 
ee ee 
[ive | eos 


Gq 
io) 


75% 1.69 _ 8.45 
87.5% | 1.55 7.75 


ND 
io) 
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APPENDIX C 
NOVELL LITE 
COMPLETE TEST RESULTS 


Network Card: Ansel 2000 
Type: AUI/BNC/TPI 
Extended Test: Passed 


Stop Clock Dut STPCLK STPCLK nisrrunte 
Period y Asserted Deasserted p Test 1 Test 2 
Cycle Enabled 
(ms) (ms) — 


ee ae a a oe ae TS 
[+ J soo% [050 | 050 | yes | tase | ere 
a a 
8 | 
a a 
ae 


[8 [0.00% [400400 
[20 [50.00% | 1000 | 1000 
[20 | 50.00% | 1000 | 1000 | yes | tara | 708 
[20 [80.00% [1600 [400 
[20 [80.00% | 1600 | 400 | yes (| taa5 | 798 
[eo [ 867% | 2600 [400 no ~+Y aes? | eens 
[s0|saae% | te00 | 1400 | no +) zaae | tasa 
[so | 500% | 1600 | 1400 | yes | tase | 7a 
(«| 50.00% | 2000 | 2000 | no | tase | 11s 
[#0 | 50.00% | 2000 | 2000 | yes | taa | 798 
[10 | soson | os00 | aco [nee [na 
[| 80.00% | e600 [400 | yes | 1408 | 790 | 
2 
yes 
ee 
a 
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Network Card: . Ansel 2100 
| Type: AUI/BNC/TPI 
Extended Test: Passed 


STPCLK STPCLK ners 
Asserted | Deasserted P Test 1 Test 2 
Enabled | : 

(ms) (ms) 

| 50.00% | 050 | 060 | mo 18.12 
| | 50.00% | 10.00 10.00 | no | 1642 | 12.027 
80.00% 16.00 400 | no | 2488 16.64 
| 20, 80.00% 16.00 4.00 13.07 
| 30s |= 50.00% 15.00 15.00 17.13 10.32 
50.00% 15.00 15.00 13.18 7.36 


Stop Clock 
Period 


, 


86.67% 26.00 4.00 34.21 24.27 


pn 
r 

| no | 8 
86.67% 26.00 400 | yes | 1296 | 7.41 
53.33% 16.00 1400 | no | 1829 | 10,76 
53.33% 16.00 1400 | yes | 1296 | 7.36 
50.00% 20.00 2000 | no | 1834 | 10.98 
50.00% 20.00 20.00 7.25 
90.00% 36.00 400 | no | 4564 | 
90.00% 36.00 4.00 7.57 
| 55 | 50.00% 27.50 2750 | mo | 2065 | 12.85— 
pe 


92.73% 51.00 400 | no | 77.06 | 5206 | 
92.73% | 51.00 4.00 12.85 
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Network Card: Eagle NE2000 Plus 3 
Type: AUI/BNC/TPI 
Extended Test: Passed Eagle 


STPCLK STPCLK iAteiubte 
Period Asserted Deasserted P 

Enabled 
(ms) (ms) (ms) 


| 


Stop Clock 


Duty 
Cycle 


Test 1 


10.71 


[so | s687% | 2600 | 400 ~| yes tare 


f 
oO 
oO 


yes 
yes 
yes 

es 


y 
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Period Asserted Deasserted Interrupts 


_ Network Card: Intel EtherExpress 16C 
| Enabled Test 1 Test 2 
(ms) (ms) 


Type: AUI/BNC/TPI 
| Extended Test: Passed 

Stop Clock - STPCLK STPCLK 
oso | 050 | mo | 194 


| [mo | t7aa_| 10.48 
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[00% [wa ANA tae 
[000% [080 | oso | eine | 78 
[1 [so00% [oso | oso | yes | 1208 | 757 
8 | s000% [400 | 400 | no | 625 | 10.16 
[8 _ | s000% [400 | 400 | yes | tar | 774 
[20 [80.00% [1000 | 1000 | no | 1708 | 1087 
[20 | s000% [1600 | 400 | no | 200g | 1048 
[30 | s000% | 1600 | 1800 | no | 1708 | 1054 
[30 | s000% [1500 | 1500 | yes | tate | 703 
Ts [ 88.67% [2600 | 400 | no | sear | 2000 
[| eee7% [2600 | 400 | yes | 1200 | 752 
[| sas0% [1800 | 1400 | no | tres | tia 
[so [sase% | 1600 | 1400 | yes | 1208 | 708 
[40 | s0.00% | 2000 | 2000 | no | 1763 | 1104 
Tasot | ee2e 

[yes [1328 | 807 

[no | 10.48 | 218 

[yes [128s [748 

es 


Network Card: Kingston KNE2121 
Type: BNC/TPI 
Extended Test: Passed 


STPCLK 
Asserted 
(ms) 


STPCLK 
Deasserted 
(ms) 


Stop Clock 
Period 
(ms) 


Interrupts 
Enabled 


[40 | 80.00% | e600 [400 | no 
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Netword Card: SMC Elite 16C Ultra 
Type: AUI/BNC/TPI 
Extended Test: Passed 


@ 


Stop Clock STPCLK STPCLK ee 
Period Asserted Deasserted P Test 1 Test 2 

Enabled 

(ms) (ms) 

| 80 | 83.33% 1400 | no | 129 | 101 | 
50.00% | 2000 =| ~—no_—s«Ts t8.01_-(| 11.36 
| 55s | 50.00% 27.50 27.50 yes 13.34 | 8.29 | 
| 5 | 92.73% 51.00 | 400 | no | 59.75 | 46.13 
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Netword Card: 3Com Etherlink II 
Type: AUI/BNC/TPI 
Extended Test: Passed 


Stop Clock 
Period 
(ms) 


STPCLK 
Asserted 


STPCLK 
Deasserted 


Interrupts 
Enabled 


ne 
pe [er | os 
[yes [1962 [796 
[no [178 | 7008 
[yes t78 [807 
re 
[9 sasox | “oso | nso | yee ines [or 
eae 
ae 
ae 
a 
ae et 
a 


[20 | e0.00% [1600 | 400 
[20 | a000% | 1600 | 400 
[30 [s0.00% [1800 | 1500 
[80 | s000% [1800 | 150 
[30 | e667% [2800 | 400 oe | ass 
sw | eser% | 2600 | 400 
30 | sasa% | te00 | 1400 | mo | 1787 | 1082 
so | sa80% | 1600 | t400 | yes | tas | 78 
[#0 | s000% [2000 | 2000 | no | 1703 | 1100 
[a0 [50.00% [2000 | 20007 | yes | tate | 752 
[a0 [80.00% | ae00 | 400 | mo | se20 | area 
[ss | se7ex [5x00 | 400 | mo (| sero | area 
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Averages 


Stop Clock STPCLK STPCLK | 
Period Asserted Deasserted 
(ms) 


N/A 
0.50 
0.50 
4.00 
4.00 

10.00 
10.00 
16.00 
16.00 | 
15.00 
15.00 


Interrupts 
Enabled 
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APPENDIX D 


TYPICAL NETWORK INTERFACE CARD 


POWER REQUIREMENTS 


Card Current Power 
oo gaa Loaded) ae <a 


| Ansel20000 2000 0. | 0.33 | 1 | 1.65 | 


Teagiene2000 | 010 | os 
Tkingsonknezt2i | oat | 205 
Tswceitetec | ove | 20 
acométherineit [| 008 | 04s 


NOTE: 

It is important to properly understand the results in the 
above table. The network cards with a power consumption 
of approximately 2.5W represent a class of cards with older 
generation technology. The NICs with power consumption 
below iW represent the newer integrated single chip lan 
controllers. For Energy Star compliant systems, 2.5W could 
represent as much as 17% of the power budget (depending 
on the efficiency of the power supply). So highly optimized 
systems with sufficient power budget could easily accommo- 
date any network card. A system with a smaller marginal 
power budget may wish to consider a NIC with the newer 
technology. 
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1.0 INTRODUCTION 


The IntelDX4™ processor is the newest and highest 
performing member of the Intel4&6™ processor fami- 
ly. At internal speeds of up to 100 MHz, the IntelDX4 
Processor is the fastest 486, designed for users who 
want the best value in 486 desktop computing today. 
With its larger 16K internal cache size and improved 
core speed, the 100 MHz IntelDX4 processor outper- 
_ forms the 66 MHz IntelDX2™ processor by as much 
as 50%. Intel’s unique 5V tolerant input buffers make 
this performance improvement achievable with mini- 
mal modifications to existing IntellIDX2 processor based 
desktop designs. This document provides a straightfor- 
ward process for updating your IntelDX2 processor 
based desktop system design to match the potential of 
the IntelDX4 processor, while maintaining system de- 
sign compatibility with previous generations of the 
Intel486 processor family. 


The IntelDX4 processor is based on proven Intel486 
technology and is compatible with the huge installed 
base of over 50,000 applications written for the Intel 
Architecture. To ensure end-user investment protec- 
tion, IntelDX4 processor based systems should be veri- 
fied for upgradability to a future Pentium™ 
OverDrive™ Processor. This application note provides 
clear guidance on how to prepare your system for Pen- 
tium OverDrive Processor verification. 


Intel appreciates your interest in the IntelDX4 proces- 
sor. This document has been developed to allow you to 
minimize your investment in development time and 
bring a reliable design to market quickly. 


am 
intel. 
NOTE: 


Important recommendations that should be carefully 
addressed for a reliable design are highlighted in bold. 


These recommendations must be followed precisely to. 


help ensure that your design will be ready for manu- 
facture with minimal redesign. 


The checklist and flowchart in Appendices A and B 
will help you quickly confirm that you have incorporat- 
ed all of the critical design recommendations. By using 
these tools you can be confident that your system is 
ready to meet the new standard for 486 computing set 
by the 100 MHz IntelDX4 processor. 


2.0 HARDWARE 
RECOMMENDATIONS 


The following design recommendations cover upgrad- 
ing an existing IntelDX2 processor based desktop sys- 
tem design to support the IntelDX4 processor. It is 
assumed that the existing IntelDX2 processor based de- 
sign supports both the processor and its corresponding 


Pentium OverDrive processor using a single socket. 


There are two categories of recommendations included: 
those covering features which are new to the IntelDX4 
processor, and those which apply to the IntelDX2 proc- 
essor as well as the IntelDX4 processor, but have re- 
newed importance for the IntelDX4 processor. 


Table 1. Hardware Differences between the IntelDX4™ Processor and the 


intelDX2™ Processor and Their Respective Pentium™ OverDrive™ Processors 


IntelDX4™ 
Processor 


Max. Internal Speed 


100 MHz 


Supply Voltage 


16 Kbytes 
CLKMUL 
VOLDET 


IN 
N 
V 


Pentium OverDrive Processor 
Socket Pins J1, K1 and L1 


NOTES: 
* Contact your local Intel representative for details. 
** Pentium OverDrive Processor socket pin number. 
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intelDX2T™ 


verDrive™ Processor for 
Processor e ticearnda 


the IntelDX4™ Processor 


Teme pO 
i 


NA. ~ 5.0V for Fan/Heatsink 


Pentium™ OverDrive™ 
Processor for the 
IntelDX2T™ Processor 


Future Pentium™ 


intel. 


Table 1 summarizes the differences between the 
IntelDX4 processor and the IntelDX2 processor. Note 
that the operation of SMM and STPCLK are identical 
to the SL Enhanced IntelDX2 processor. Also note that 
the future Pentium OverDrive processor for the 
IntelDX4 processor is different from the Pentium Over- 
Drive processor for the IntelDX2 processor. 


2.1 Power Supply 


The new features of the IntelDX4 processor bring with 
them some new requirements on the processor power 
supply. Since the IntelDX2 processor is a 5V part, ex- 
isting IntelDX2 processor based designs typically run 
all system logic at SV. The IntelDX4 processor is a 
3.3V part, so it is necessary to modify the processor 
power supply to provide this voltage. Care was taken in 
the design of the IntelDX4 processor to ensure that a 
single system board could be designed which would 
function with any Intel486 processor, while getting 
maximum performance from the IntelDX4 processor. 


2.1.1 PROVIDING 3.3V IN A 5V SYSTEM 
In most system board designs, the 5V system power 


supply is routed to the components on the board 
through a dedicated board layer. With the requirement 
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of a new 3.3V supply for the IntelDX4 processor, it is 
not necessary to add a completely new power supply 
layer to the circuit board, as it is possible to create a 
3.3V “island” around the processor in the existing pow- 
er supply plane. Figure 1 shows a recommended “‘is- 
land” layout. Note the connection from the 5V plane to 
the Vccsp pins, which will power the integrated fan/ 
heatsink in the future Pentium OverDrive processor. 
The IntelDX4 processor’s 5V tolerant input buffers and 
TTL compatible outputs allow the processor to inter- 
face with existing TTL compatible external logic with- 
out requiring extra components Thus, the processor 
can run at 3.3V while the system logic runs at SV. The 
“island” needs to be large enough to include the proces- 
sor, the required power supply decoupling capacitance 
(see section 2.2), and the necessary connection to the 
3.3V source. To minimize signal degradation, the gap — 
between the 3.3V “island” and the 5V plane should be 
kept small. A typical gap size is about 0.02 inches. Min- 
imize the number of traces routed across the power 
plane gap, since each crossing introduces signal degra- 
dation due to the impedance discontinuity that occurs 
at the gap. For traces that must cross the gap, route 
them on the side of the board next to the ground plane 
to reduce or eliminate the signal degradation caused by 
crossing the gap. If this is not possible, route the trace 
to cross the gap at a right angle (90 degrees). 
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3.3 Volt 
"island" 


~ 
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Figure 1. Creating a Power “Island” 
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| 3.3 Volt regulator (upright) and heatsink 


242034-6 
3.3V Supply Using Linear Regulator 
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Use a wide trace to power supply connector 
242034-7 
3.3V Supply Using System Power Supply 


Figure 2. Recommended Power Supply Connection Layout 


2.1.2 CHOOSING A POWER SOURCE 


The three principle concerns which must be addressed 
when selecting a power source are maximum and mini- 
mum load current requirements, and response time. 
The processor power supply must be able to maintain 
correct voltage regulation at current levels below 
0.5 mA for the IntelDX4 processor in the Stop Clock 
State, and up to the maximum current of 3.0A for the 
future Pentium OverDrive processor. The power saving 
technology in Intel SL Enhanced processors, including 
the IntelDX4 processor, will cause the processor to 
switch to very low power levels during normal opera- 
-tion, even if external power management is not used. 
For example, executing a HALT instruction will cause 
the IntelDX4 processor to enter the Auto HALT Pow- 
er Down State, which will cause a significant reduction 
in the current consumption of the processor in as little 
as 100 ns. The transition. from HALT to the Normal 
State will cause current consumption to return to the 
normal levels in a similarly short period of time. The 


processor power supply must be able to maintain cor- 


rect voltage regulation during these transitions. 


There are basically two options for supplying 3.3V to 
the processor, either adding a 3.3V tap to the primary 
system power supply or using on-board secondary regu- 
lation to derive 3.3V from the 5V system power supply. 


For on-board secondary regulation, a linear voltage 
regulator will perform adequately for most desktop and 
server designs. If low heat or power dissipation is a 
design goal, the higher complexity and cost of a switch- 
ing regulator may be warranted. Switching regulators 
offer better efficiency, thereby lowering regulator pow- 
er consumption and heat. See section 2.4 for related 
thermal considerations. 


Figure 2 shows recommended layouts for power supply 
or linear regulator connection to the 3.3V “‘island.” 


Appendix C includes a list of possible vendors for Dow 
er supplies and voltage regulators. 
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2.1.3 POWER SUPPLY SELECTION FOR 
FLEXIBLE MOTHERBOARDS 


Using the voltage detect sense feature of the IntelDX4 


processor, you may design a flexible system mother- 
board which will automatically use the proper proces- 
sor voltage for an IntelDX4 processor or a different 
Intel486 processor. It is also possible to make the selec- 
tion of processor voltage an option during system board 
_ assembly. 


2.1.3.1 VOLDET Automatic Voltage Select 
Circuit Option 


By sampling the VOLDET pin at power up, system 
boards can automatically select the processor power 
supply voltage, enabling a design that may use the 
IntelDX4 processor or a 5V Intel486 processor without 
jumpers or assembly time changes. The VOLDET pin 


is only present in the PGA package version of the - 


IntelDX4 processor. This pin, which is an NC (No 
Connect) on previous Intel486 family processors, is 
connected internally to Vsg on the IntelDX4 processor. 
This pin should be left unconnected in designs that do 
not use the voltage detect feature. Figure 3 shows an 
example of the use of the VOLDET pin with a linear 
regulator circuit to automatically select the correct 
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power supply voltage. If the VOLDET pin is not con- 


_ nected inside the processor, indicating a 5V part, the 


gate of MOSFET Q1 is pulled high, which causes it to 
bypass the 3.3V regulator, supplying 5V directly to the 
processor. Shorting the input of the regulator to the 
output in this way is harmless for most linear regula- 
tors, due to regulator feedback circuitry which shuts 
the regulator off (contact regulator manufacturers for 
specifics). Note that in this case, most regulators will 


require Q1 to handle all the processors current require- 


ments, and so it should be a high-current, low on-state- 
resistance MOSFET. If the VOLDET pin is connected 
to Vss, indicating a 3.3V part, the Q1 transistor is 
turned off, allowing the regulator to function normally. 
Figure 4 shows a suggested placement and layout for 
MOSFET QI. 


2.1.3.2 Other Voltage Selection Options 


It is also possible to design a flexible system board > 
where the processor supply voltage is selected by an 
assembly time option. There are several methods to 
achieve this; the key requirement being that the design 
must handle the maximum current of 3.0A for the fu- 
ture Pentium OverDrive processor. Note that normal 
jumpers may not be adequate, and it may be necessary 
to use several in parallel. 
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Open Drain/ 
Collector 


intel4aa6 ™ 
processor 


3.3V, 5.0V 


NOTES: 242034-8 
* D1 internal on all MOSFETs 
** D2 internal on some MOSFETs 


Figure 3. Example Voltage Auto-Select Circuit Topology 
(Courtesy of Linear Technology Corporation) 


Outline of Socket 3 
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Figure 4. Suggested Placement and Layout for MOSFET Used in Optional Voltage Auto-Select Circuit 
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2.1.4 Vocs PIN REQUIREMENT 


For mixed voltage systems where the processor inter- 
faces with 5V components, the Vc‘c5 pin must be con- 
nected to 5V for proper 5V tolerant buffer operation. 
The Vcocs input should not exceed Vcc by more than 
2.25V during power-up, power-down or during opera- 
tion. If this requirement is not met, current flow 
through the pin may exceed 55 mA, and damage to the 
component may begin to occur. _ 


To meet this requirement, one of two things must be 
done: either the power supply must be designed to turn 
on and off such that the difference between the Vocs 


. | 
intel. 
and Vcc voltages never exceeds 2.25V, or a 1002 resis- _ 
tor must be put in series with the Vccs pin to limit the 
current through this path (Figure 5 shows a possible 
layout for this connection). The 1002 series resistor is 
required for power supplies which do not meet the volt- 
age difference specification, and also provides protec- 


tion in the case of a power supply failure (where the 5V 
supply remains on, but the 3.3V supply goes to zero). 


Note that the Vccsp pins for the future Pentium Over- 
Drive processor fan/heatsink unit should be connected 
directly to the 5V supply, and not through a series re- 
sistor. 


100 Ohm resistor (1/2 W | 


3.3 Volt "island" 


NOTE: 


+ & 


8: 1 wees 3 ee eee ! 


ee @eese 
tied ~-e2eee 
oe ,e0een 
H oo "tees aie eee 
ee 18000 
ee eevee 
coe: » Sey: KR eee 7006 
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The future Pentium™ OverDrive™ processor requires a 5V supply through the Vccsp pins. The Vccsp pins must be 
connected directly to the 5V supply, and not through the Vcocs pin protection resistor. 


Figure 5. Possible Layout for Vccsp Pin Connection 
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VCC5 + 0.3 {Max. input voltage} 
5.0 


Accepts input 
voltages in the 
0 to 5 V range 


2.0 TTL Vih 


0.8 TTL Vil 


Inputs 
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2.4 Voh (min) 
™ 2.0 TTLVih 
IintelDX4 


rocessor Outputs are 
E TTL compatible 


0.8 TTL Vil 
0.45 Vol (max) 


Outputs 
242034-19 


Figure 6. Voitage Relationships in an IntelDX4 Processor Based System with 5V External Logic 


2.1.5 EXPLANATION OF 5V TOLERANT 
INPUTS AND TTL COMPATIBLE OUTPUTS 


The IntelDX4 processor and future Pentium Over- 
Drive processor include 5V tolerant input buffers 
and TTL compatible outputs. This feature enables 
the processor to interface with existing 5V logic 
even though the processor is running at 3.3V inter- 
nally. 


In a system with the 3.3V IntelDX4 processor inter- 
faced to 5V components, the Vccsp pin on the 
IntelDX4 processor must be attached to the 5V sys- 
tem power supply. The Vccsp pin provides a volt- 
age reference for the processor’s input buffers. With 


Vccsp connected to 5V, the processor can accept 
‘input signals up to 5.3V, making its inputs compati- 
ble with 5V TTL or CMOS level outputs. 


Output voltages from the processor are guaranteed 
to be at or above 2.4V for a logic “1” and at or 
below 0.45V for a logic “0.” This allows the 
IntelDX4 processor to drive TTL compatible input 
levels (2.0V and 0.8V), but not 5V CMOS levels 
(“rail to rail’). Figure 6 shows the input and output 
voltage relationships for the IntelDX4 processor in 
a 5V system. 


In a 3.3V only system, the Vccsp pin should be 
connected to the 3.3V supply, as 5V tolerant opera- 
tion is not required. ' 
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2.2 Processor Power Supply 
Decoupling 


Processor power supply decoupling is critical for reli- 
able operation. With the IntelDX4 processor, there are 
two areas of concern: high frequency decoupling, neces- 
sitated by the high speed operation of the processor, 
and low frequency decoupling, necessitated by the pow- 
er saving features of the processor. 


2.2.1 HIGH FREQUENCY POWER SUPPLY 
DECOUPLING 


High frequency decoupling is critical on the IntelDX4 
processor because of its high speed external bus, and 


- 


NOTE: 
All values in microFarads 


: | 

intel. 
because of its very fast 100 MHz internal operation. A 
reliable design will include a minimum of nine 0.1 uF | 
capacitors and nine 0.01 ,.F surface mount capacitors 
between power and ground, evenly distributed, close to . 
the processor. The capacitors must be placed as close to 
the processor as possible, attached directly to the pow- 
er and ground planes, or circuit board inductance will 
significantly reduce their effectiveness. A typical fail- 
ure mode caused by inadequate high frequency decou- 
pling is unreliable or inconsistent program behavior. 
These failures are often intermittent, and are very hard 
to debug. Figure 7 shows a recommended layout for the 
high frequency capacitors, with values as shown. 


Appendix C includes a list of possible vendors for pow- 
er supply decoupling capacitors. 


@ 


fetes we te ne oe oe 
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Figure 7. Recommended High Frequency Capacitor Values and Layout 
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2.2.2 BULK POWER SUPPLY DECOUPLING 


Bulk, or low frequency, decoupling is needed on all SL 
Enhanced Intel486 processors, including the IntelDX4 
processor, since the processor may switch between nor- 
mal and low power states very quickly, causing large 
instantaneous current changes. To properly handle 
these instantaneous current changes, all designs must 
have adequate bulk decoupling. In 5V only systems, the 
processor can use the bulk decoupling capacitance all 
over the system board, but with the processor on a sep- 
arate power plane “island,” it is necessary to place ade- 
quate bulk capacitance on the processor “island.” For 
bulk decoupling, multiple capacitors each in the range 
of 10 »F to 100 pF are typically used in parallel to 
achieve the required capacitance while maintaining a 
low effective series resistance (ESR). You can deter- 
mine the amount of bulk decoupling required with the 
following formula: 


C = (Al* AT) / AV 


where AI is the maximum change in current, AT is the 
time it takes the power supply to adjust to the current 
change, AV is the allowable voltage change to remain 
within specification. 


The effective series resistance (ESR) must also be taken 
into account. You can find the maximum allowable 
ESR with this formula: 

ESR = AV/ Al 


where AV and AI are the same as in the first equation. 


For example, for the future Pentium OverDrive proces- — 


sor, the maximum change in current is about 2.8A. 
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The response time of a linear regulator may be around 
15 ys (contact regulator manufacturer for precise val- 
ue). With no guard band, the maximum allowable sup- 
ply voltage deviation from 3.3V is 0.3V, yielding the 
following: 


C = (2.8A * 15 ys) / 0.3V = 140 pF 
with a maximum allowable ESR: 
ESR = 0.3V / 2.8A = 0.110 


Placing four 47 .F tantalum surface mount capacitors 
in parallel, directly between the power and ground 
planes, will reduce the ESR below this limit and pro- 
vide adequate capacitance. Figure 8 shows a recom- 
mended layout for this example. 


For an example program which exercises the power 
saving features, Appendix D includes a program that 
alternates between HALT and normal operating modes 
with a keypress. This represents a typical load change 
for bulk capacitance testing with the IntelDX4 proces- 
sor, but does not cover the future Pentium OverDrive 
processor. The Intel Verification Program provides a 
Voltage Regulator Transient Tester in the IVP Pretest 
Kit. This tool simulates worst case load changes with 
the future Pentium OverDrive processor. Another tool 
available from Intel for power transient testing is the 
Power Validator, which includes on board limit testers 
and failure indicators. To order the Power Validator or 
the IVP Pretest Kit, contact your local Intel represent- 
ative. 


Appendix C includes a list of possible vendors for pow- 
er supply decoupling capacitors. 
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NOTE: 
All values in microFarads 


Figure 8. Recommended Bulk Decoupling Capacitor Values and Locations 
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2.2.3 WHY IS DECOUPLING NECESSARY? 


CMOS logic only consumes significant current 
when switching. This allows great power savings by 
shutting down elements of a processor not being 
used. With the IntelDX4 processor, virtually the 
entire chip may be shut down and quickly restarted. 
Switching this amount of current on and off in very 
short periods of time may cause serious power sup- 
ply voltage surges and droops in systems with inad- 
equate bulk decoupling. Adequate power supply 
bulk decoupling capacitance, located near the proc- 
essor, is necessary to filter these surges and droops 
(see Figure 8). The delayed response of voltage reg- 
ulators to load increases is the principle cause of 
power supply droops, with the amount of droop de- 
pending on the response time of the regulator as 
well as power supply inductance. Adequate bulk ca- 
pacitance is necessary to provide a current reservoir 
until the power supply or regulator can respond to 
the load increase. Surges are caused by inductance 
in the power supply, their severity being determined 
by the value of the inductance and the speed with 
which the current load drops. Most regulators. only 
boost the regulated supply if it falls below the speci- 
fied voltage, with the regulator turning off if the 
output voltage rises above the specified voltage. 
This means that a fast regulator will not lessen the 
effect of voltage surges. Another factor concerning 
the filtering ability of the bulk capacitance is the 
effective series resistance of the capacitor(s), which 
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is an element of the non-ideal behavior of real com- 
ponents. The effect of this resistance must be low 
enough to not offset the desired filtering effect of 
the capacitance. Figure 9 shows an oscilloscope 
measurement of a surge in the power supply at the 
processor in an IntelDX4 processor based system 
with poor low frequency decoupling capacitance as 
the processor enters the Auto HALT Power Down 
State. 


In addition, the increased internal speed of the In- 
telDX4 processor over the IntelDX2 processor 
means higher frequency noise components in the 
processor power supply. Traffic on the external bus 
causes high frequency power supply current spikes 


due to the large number of external outputs switch- 
ing. High frequency (low inductance) capacitors, 
connected between the power and ground planes, 
near the processor, are required to filter these high 
frequency components of the noise. The inductive 
effects of circuit board traces and component leads 
become critical at these frequencies. For this reason, 
it is critical that the high frequency capacitors are 
placed as near as possible to the processor, using 
short traces to minimize inductance. Surface mount 
capacitors placed directly next to the processor and 
inside the socket cavity are recommended (see Fig- 
ure 7). Figure 10 shows an oscilloscope measure- 
ment of the power supply at the processor in an 
IntelDX4 processor based system with poor high 
frequency decoupling capacitance. 
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Figure 9. Oscilloscope Measurement Showing Power Supply Surge with 
Processor Entering HALT Mode in System with Poor Bulk Decoupling 


Tek RH 2.00GS/s 1244 Acqs 


Senna: eee nee 


,:... © 2 Ss oF .. . . . . * . . 2 SS 
i 3 


a Sc 


a ee i cs | i i, i i ee 


i yy : " | | }. 384mV 


i. Cr eo) 


ee 17) 


es ¢ 2 § 5 5 6 » § 5» 6 6 6 § 4 5 6 6 § «og gw fg Fw wg gw gw Fg gw og gw Fw wg og Sw cw ew ww gw 


Moo ors Cat? SITY 25 Mar 1994 
15:25:03 


242034-—14 


Figure 10. Oscilloscope Measurement Showing Power Supply Noise 
in System with Poor High Frequency Decoupling 
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2.3 Clock Multiplier Selection 


The clock multiplier on the IntelDX4 processor may be | 


selected in the system using the CLKMUL pin. This 
pin is an INC (Internal No Connect) on the previous 
Intel486 processors. The CLKMUL pin is sampled 
during cold (power on) processor resets. The clock mul- 
tiplier cannot be changed during warm resets, and 
SRESET cannot be used to select the clock multiplier 
ratio. Typically, CLKMUL will be connected via a 
jumper to the proper setting. In systems with 33 MHz 
bus speeds, the IntelDX4 processor will operate with an 
internal clock frequency of 100 MHz if the CLKMUL 
pin is left unconnected, or is connected to Vcc. Exist- 
ing IntelDX2 processor based systems running at 
66 MHz internally with a 33 MHz bus speed require no 
modification to use the IntelDX4 processor at 
100 MHz internally and 33 MHz externally if the 
CLKMUL pin is left unconnected. In 50 MHz bus sys- 
tems, it is necessary to connect the CLKMUL pin to 
Vsg to achieve 100 MHz internal frequency operation. 
These relationships are shown in Table 2, and an exam- 
ple circuit is shown in Figure 11. 


Table 2. Clock Multiplier Selection 


CLKMUL External Internal 
Clock Clock 


at 
RESET Frequency | Frequency 
25 MHz 75 MHz 


Voc or 3 
Not Driven 


33 MHz 100 MHz 


[ves | 2 | someiz | 100M 


Clock 
Multiplier 


IntelDX4™ 
Processor 


CLKMUL 


242034-1 


Figure 11. Jumpers for Clock 
Multiplier Selections 


AP-505 


2.4 Thermal Considerations 


In desktop systems, proper thermal management is crit- 
ical to prevent system reliability problems caused by 
excess heat. The principle concern is with the 3.3V 
power supply when implemented with secondary on- 
board regulation. 


2.4.1 Voltage Regulator Considerations 


Special thermal consideration is required for systems 
implementing the 3.3V processor supply with a linear 
voltage regulator. The 10W dissipation of the future 
Pentium OverDrive processor requires a 3.0A power 
supply current. A linear voltage regulator will dissipate 
approximately 5W of power to provide 3.0A. To keep 
the voltage regulator within thermal specification, a 
heatsink will be necessary to dissipate the heat generat- 
ed by the regulator, If the temperature of the regulator 
is not maintained within the manufacturer’s specifica- 
tion, improper regulator operation may occur, jeopar- 
dizing regulator and processor reliability. 


The size and performance of the voltage regulator heat- 
sink are dependent upon the regulator specifications 
and system air flow. If the regulator is located adjacent 
to the upgrade processor socket, the ambient tempera- 
ture should not exceed 55°C; the limit for the future 
Pentium OverDrive processor. 


The following formula may be used to calculate the 
performance of a heatsink in a particular application: 


Oca = (Ty — Ta) / Pp — 95c 


where Oc, is the maximum allowable thermal resist- 
ance of the heatsink and insulator, Ty is the maximum 
regulator junction temperature, T, is the maximum al- 
lowable ambient temperature, Pp is the maximum 
power dissipated in the regulator, and @yc is the ther- 
mal resistance from the regulator junction to its case. 


For example, for the Linear Technology LT1085CT 
regulator, the values would be 125°C for Ty, 55°C for 
Ta, 3°C/W for 8y¢ so: 


(125°C — 55°C) / 5W — 3°C/W 
11°C/W (for the LT1085CT) 


OCA 


In this example, the thermal resistance of the heatsink 
and insulator (@cq4) cannot exceed 11°C/W to meet 
specifications in still air. 


Appendix C includes a list of possible vendors for heat- 
sinks. 
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2.4.2 PROCESSOR CONSIDERATIONS _ 


The power consumption of the IntelDX4 processor is 
comparable to that of the IntelDX2 processor, so exist- 
ing thermal solutions that adhere to the published spec- 
ifications should be adequate for the IntelDX4 proces- 
sor. 


The future Pentium OverDrive processor upgrade for 
the IntelDX4 processor will have an on-package fan/ 
' heatsink and is thermally equivalent to the Pentium 
OverDrive processor for IntelDX2 processor based sys- 
tems. The on-package fan/heatsink unit is powered by 
the Vocsp pins, which should be connected directly to 
the 5V system power supply. 


2.5 Placement and Layout 
Suggestions 


Figure 12 shows a complete suggested layout for the 
processor including the 3.3V power supply “island,” 
adequate power supply decoupling, a linear voltage reg- 
ulator, and a possible placement for the FET used with 
the VOLDET automatic voltage select circuit option. 
Note the placement of the capacitors close to the proc- 
essor, and the wide connection from the voltage regula- 
tor to the “island.” 


Figure 13 shows the clearance required for the 
IntelDX4 processor and the future Pentium OverDrive 
_ processor. The future Pentium OverDrive processor is 
physically larger than the IntelDX4 processor, so it is 
not sufficient to merely provide clearance for the 
IntelDX4 processor if the system is to be upgradeable 
with the future Pentium OverDrive processor. 
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2.6 Intel Verification Program 


The Intel Verification Program establishes mini- 
mum system design criteria for reliable and 
straightforward CPU upgradability with the Penti- 
um OverDrive processors. The criteria encompass 
physical, functional, electrical, thermal, and instal- 
lation attributes of the Pentium OverDrive proces- 
sors. Upon successful system design verification by 
Intel, licensed OEMs will be able to advertise and 
promote their branded systems as “Intel Verified.” 
An Intel published list of verified systems will pro- 
mote consumer awareness providing greater confi- 
dence in both system and upgrade buying decisions. 


Converting An “Intel Verified: For the 
Pentium OverDrive Processor” System 
Design 


The Intel Verification program for IntelDX4 proc- 
essor based system designs is similar to the recently 
introduced program for the IntelDX2 processor 
based systems. The criteria are basically the same 
with additional electrical and thermal tests. For 
IntelDX2 processor based designs that meet the 
criteria for the Pentium OverDrive processor, 
the additional criteria for the future Pentium 
OverDrive processor for the IntelDX4 processor 
pertain to the 3.3V supply specifications and voltage | 
supply thermal requirements. This application note 
addresses both issues. Contact your local Intel rep- 
resentative for further information on the Intel Ver- 
ification Program. 
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Figure 12. Complete Suggested Layout 
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Figure 13. Clearance for Future Pentium™ OverDrive™ Processor 


2.7 Cache and Memory 
| Considerations 


The IntelDX4 processor, because of its higher internal 
speed, will benefit even more from external perform- 
ance enhancing features than the IntelDX2 processor. 
In particular, a second level cache can provide signifi- 
cant performance gains compared to systems without a 
second level cache. In addition, the future Pentium 
OverDrive processor for the IntelDX4 processor will 
allow write back operation of the processor’s internal 
cache, which must be supported for maximum perform- 
ance. 


2.7.1 SECOND LEVEL CACHE 


For maximum performance, a second level cache 
should be used. A simple 128 Kbyte second level cache 
will improve performance 10%-20% over systems 
without a second level cache. Increasing the cache size 
to 256 Kbytes will improve performance by roughly an 
additional 5%. Using a write-back algorithm will pro- 
vide. a 10%-15% improvement over a write-through 
system. If a write-back algorithm is used, a two-way 
cache architecture will yield about 10% performance 
gain over a direct-mapped cache, but this improvement 
-is much less if a write-through algorithm is used. The 
greatest benefit is achieved when no wait states are re- 
quired, but if wait states are necessary, it is better to 
have one wait state on the leadoff cycle and no wait 
states in the burst cycles, than to have no wait state on 
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the leadoff and one wait state on the bursts. It may be 
necessary to use interleaving to achieve zero wait state 
burst access. Memory system performance also has a 
significant effect on system performance. The effect of 
these system design parameters will vary between appli- 
cations; the performance figures given above are for _ 
typical applications. See application note AP-469: 
Cache and Memory Design Considerations for the 
Intel486™ DX2 Microprocessor (Order Number: 
241261-001) for more information on cache and memo- 
ry system design. 


2.7.2 WRITE-BACK SUPPORT FOR THE 
FUTURE PENTIUM OVERDRIVE 
PROCESSOR 


For optimal performance with the future Pentium 
OverDrive processor, your system design should be 
able to support processors with write-back internal 
caches. Contact your local Intel representative for more 
information. 


3.0 SOFTWARE VISIBLE 
DIFFERENCES 


Table 3 summarizes the system programmer visible dif- 
ferences between the IntelDX4 processor and the 
IntelDX2 processor and their respective Pentium 
OverDrive processors. Note that the future Pentium 
OverDrive processor for IntelDX4 processor based de- 
signs is different from the Pentium OverDrive proces- 
sor for IntelDX2 processor based designs. 


AP-505 


Table 3. Programmer Visible Differences between the InteiDX4™ Processor, 
the IntelDX2T™ Processor and Their Corresponding Pentium™ OverDrive Processors 


intelIDX4™ 
Processor 


IntelDX2T™ 
Processor 


Component 
Identifier in DX 
Register at 
Reset 


WT Unified 
16 Kbytes, 
4-way 


Internal Cache 


NOTES: 
* Not available in older non-SL Enhanced versions 
** Contact your local Intel representative for details 


The following sections highlight the key points to con- 
sider when converting an IntelDX2 processor based 
design to an IntelDX4 processor based design, particu- 
larly processor identification and cache test register dif- 
ferences. 


Information concerning the IntelDX4™ processor Vir- 
tual Mode Extensions is not provided in this document. 
This information is specifically targeted at writers of 
operating system kernels and virtual memory manag- 
ers, and is available with the appropriate non-disclosure 
agreements in place. Please contact your local Intel rep- 
resentative for details. 


3.1 Processor Identification 


Processor identification is needed principally for testing 
of the processor during the power-on-self-test (POST) 
typically performed by the BIOS on RESET. Improper 
processor identification may result in improper cache 
testing, or an attempt to use a non-implemented in- 
struction. The correct way to identify the IntelDX4 
processor is by examining the contents of the DX regis- 
ter immediately after RESET, or by executing the 
CPUID instruction. All Intel SL Enhanced processors 


Future Pentium™ 
OverDrive™ Processor for 
the IntelDX4T™ Processor 


WT Unified WT/WB ** 
8 Kbytes, 
4-way 


Pentium™ OverDrive™ 
Processor for the 
intelDX2T™ Processor 


WT/WB ** 


implement the CPUID instruction. See application note 
AP-485: Intel Processor Identification with the CPUID 
Instruction (Order Number: 241618-001), for the prop- 
er use of the CPUID instruction. Alternately, the BIOS 
may read the DX register at startup and write the proc- 
essor information into a location in memory for later 
use. 


The IntelDX4 processor can be distinguished by its 
component identifier of 048x (hex), where the last nyb- 
ble is a version number. Since the system processor may 
be a Pentium OverDrive processor, the BIOS should be 
written to correctly identify and be compatible with the 
Pentium OverDrive processor. 


3.2 Cache Test Register Difference 


The cache test registers of the IntelDX4 processor are 
slightly different from those of the IntelDX2 processor, 
as shown in Figure 14. To avoid false indications of 
failure, BIOS or other software routines which test the 
cache should be modified to account for this difference. 
It should also be remembered that the future Pentium 
OverDrive processor for the IntelDX4 processor may 
be different from the IntelDX4 processor and the 
IntelDX2 processor in this respect. Contact your local 
Intel representative for details. 
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Cache Test Registers in IntelDX2T™ Processor 


Valid Bits 


(used only 
during reads)| during reads) 


Valid Bits 
(used only 


TR3 
Cache Data 
Test Register 


YTR4 
Cache Status 
4 Test Register 


TR5 
Cache Control 
Test Register 
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TR3 
Cache Data 
Test Register 


y TR4 
| Cache Status 
4 Test Register 


TRS 
Cache Control 
Test Register 
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Figure 14. Cache Test Register Difference between 
the InteIDX4™ Processor and the IntelDX2T™ Processor 
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4.0 SUMMARY 


As you plan your design using the IntelDX4 processor, 
please keep in mind the points addressed in this appli- 
cation note. Some of the key points are: 


© The IntelDX4 processor is a 3.3V component 


e Adequate high frequency power supply decoupling 
must be used 


e Adequate low frequency power supply decoupling 
must be used 


@ When using on-board regulation for processor pow- 
er supply, a heatsink will be needed to dissipate reg- 
ulator heat when supplying 3.0A for OverDrive 
processor 


Other points to remember are: 


¢ Meet the voltage difference requirement on Vccs 
pin, or use a 1002 resistor in series with the 5V 
supply 

e The IntelDX4 processor CLKMUL pin is used to 
select internal clock multiplier 


e The IntelDX4 processor provides 5V tolerant in- 
puts, and TTL compatible outputs 


e The future Pentium OverDrive processor for 
IntelDX4 processor requires 5V supply through 
Vccsp pins for fan/heatsink unit 


e The VOLDET pin may be used to automatically se- 
lect the power supply voltage 


® Correct processor identification is necessary for cor- 
rect testing by the BIOS 


Intel appreciates your interest in the IntelDX4 proces- 
sor. This document has been developed to allow you to 
minimize your investment in development time and 
bring a reliable design to market quickly. By keeping 
the points addressed in the application note in mind, 
you can be confident that your system will be ready to 
meet the new standard for 486 computing set by the 
100 MHz IntelDX4 processor. 
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APPENDIX .A 
-DESIGN CHECKLIST 


Reliable system designs must meet the requirements given in the IntelDX4 processor data book. The following 
design elements should be checkeci against the specifications in the IntelDX4 processor data book: . 


1) 3.3V processor power supply (Section 2.1) 


Type: System power supply or Regulator (linear or switching) 

Tolerance: +0.3V at processor Vcc pins 

Max current: At least 3.0A for OverDrive processor 

VCCS: | Less than 2.25V difference between Vcc pins and Vccs at all times (Use a 1002 resistor in 


series with Vocs pin if this requirement cannot be met) 


2) High frequency decoupling capacitance on board (Section 2.2.1) 


_ Type: High quality (low inductance, low resistance) 
Quantity: Nine 0.01 pF, nine 0.1 wF 
Location: As near as possible to the processor 
Test: . Unreliable or inconsistent operation may be caused by inadequate high frequency decoupling 


3) Bulk decoupling capacitance on board (Section 2.2.2) 

Value: Use formulas given in Section 2.2.2 

Test: Using available test programs and tools, does the processor supply voltage stay in specification? 
(Note: the OverDriveT™ processor may switch by 2.8A) 


4) Clock multiplier selection (Section 2.3) 
Note: Use the CLKMUL pin to select the desired multiplier 
(May be left unconnected to achieve 100 MHz internal with 33 MHz external operation) 


5) Thermal requirements (Section 2.4) 
Test: Is measured processor Tyase Within specification? 


Check: Does voltage regulator have an adequate heatsink to provide 3A for the OverDrive processor? (If 
using on-board regulation) 


6)-Physical requirements (Section 2.5) 
Check: Is clearance sufficient for OverDrive processor? 


7) Software (Section 3.0) 


Test: . Will the processor be properly identified (if applicable)? 
Check: Is the cache tested correctly? (If BIOS tests the cache) 
Check: Will an OverDrive processor be properly identified? 
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APPENDIX B 
DESIGN FLOWCHART 


Is design 3.3 V 
ready? 


Modify layout for a 
3.3 V "island" around 
the processor 
(Sect. 2.1.1) 


Use on-board 
voltage regulator? 
(Sect 2.1.2) 


Yes 


You need 3.0A 
(for OverDrive™ 
processor compatability) 
and adequate clearance 
and heatsinking. 


Use system power 
supply - add 
necessary connector 
and change layout. 


Implement 
flexible 
motherboard? 


Use VOLDET auto- 
voltage select or 
some other method 
to select processor 
voltage. 
(Sect. 2.1.3) 


Use recommended 
decoupling 
Capacitance with 


recommended 
layout. 
(Sect. 2.2) 


Observe Vccs pin 


requirement. 
(Sect. 2.1.4) 


v 


Use CLKMUL pin to 
select clock 
multiplier. 
(Sect. 2.3) 


v 


Check thermals and 


physical clearance. 
(Sect. 2.4-2.5) 


Make necessary 
circuit board 
changes. 


Does design meet 
thermal and power 
supply specifications 


. Fix BIOS. 


Does BIOS use the 
correct CPU 
identification routines 
and testing procedures? 
(Sect. 3.1-3.2) 


intelDX2™ processor to 
IntelDX4™ processor 


design conversion flowchart 
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APPENDIX C 
SUPPORT COMPONENT 
VENDOR LIST 


NOTE: 


This list is meant to be representative only, and may not include all vendors of a particular type. Intel has not 
tested all of the components listed below and cannot guarantee that these components will meet every PC manu- 


facturers specific requirements. 


Voltage Regulators: 


Linear Technology 

1630 McCarthy Blvd 

Milpitas, CA 95035 
Tel. (408) 432-1900 


Maxim Integrated Products 
120 San Gabriel Drive 
Sunnyvale, CA 94086 

Tel. (408) 737-7600 


Micrel Semiconductor 
- 1849 Fortune Dr. 
- San Jose, CA 95131 
Tel. (408) 944-0800 


National Semiconductor Corp. 
2900 Semiconductor Drive 
Santa Clara, CA 95052 

Tel. (800) 272-9959 


3.3V System Power Supplies: 


Astec America, Inc. 
401 Jones Road 
Oceanside, CA 92054 
Tel. (619) 757-1880 


Golden Systems, Inc. 
2125B Madera Road 
Simi Valley, CA 93065 
Tel. (805) 582-4400 


Teapo Electronic Corporation 


No. 3 Lane 89. Sec. 3 Chung-Yang Rd 
Tu-Cheng Hsiang, Taipei Hsein, Tiawan, R.O.C. 


Tel. 886 2 260-4151 
In USA: (404) 449-6220 
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Heatsinks for Linear Regulators: 


Aavid Engineering, Inc. 
P.O. Box 400 

Laconia, NH 03247 
Tel. (603) 528-3400 


-Thermalloy, Inc. 


2021 W.Valley View Lane 
Dallas, TX 75234 
Tel. (214) 243-4321 


Decoupling capacitors: 


Kemet Electronics Corporation 
P.O. Box 5928 

Greenville, SC 29606 

Tel. (803) 963-6348 


Mallory/North American Capacitor Company 
P.O. Box 1284 

Indianapolis, IN 46206 

Tel. (317) 273-0090 


Sanyo Video Components (USA) Corp. 


2001 Sanyo Ave. 


San Diego, CA 92073 
Tel. (619) 661-6835 


Power MOSFETS: 


Harris Semiconductor 
1301 Woody Burke Road 
Melbourne, FL 32902 
Tel. (800) 442-7747 
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International Rectifier Corp. Augat Inc. 
233 Kansas St. . 452 John Dietsch Blvd. 
El] Segundo, CA 90245 Attleboro Falls, MA 02763 
Tel. (310) 322-3331 , Tel. (800) 999-7646 
National Semiconductor Corp. Foxconn / Hon Hai Precision Industry Co., Ltd. 
2900 Semiconductor Drive . 2, Tzu Yu St., Tu-Chen 
Santa Clara, CA 95052 Taipei Hsien, Taiwan, R.O.C. 
Tel. (800) 272-9959 Tel. 02-268-3466 

In USA: (800) 727-3699 
Siliconix ji 
2201 Laurelwood Rd. Loranger International Corp. 
Santa Clara, CA 96056 | 817 Fourth Ave. 
Tel. (800) 554-5565 Warren, PA. 16365 

Tel. (814) 723-2250 
ZIF Sockets: Yamaichi Electronics Co., Ltd. 
Note: 3-28-7, Nakamagomi, Ohta-Ku 
Contact your local Intel representative for a list of qualified Tokyo 143 
sockets for the Intel Verification Program. Japan 

Tel. 03-3778-6160 
AMP Ince. In USA: (800) 769-0797 


219 American Avenue 
Greensboro, NC 27409-1806 
Tel. (800) 522-6752 


2-567 


AP-505 


=e Se Se Ne 


MEnter 
MExit 
Storage 
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; Test Program 


This program 
between Halt 


. MODEL 
- STACK 
. DATA 


DB 
DB 


APPENDIX D 


TEST PROGRAM FOR BULK | 
POWER SUPPLY DECOUPLING 


small 
100h 


‘Out of halt. 


DB: 0; 0, 0,0 


. CODE 


mov 
mov 
in 

mov 


BeginLoop: 


mov 
mov 
int 
MOV 
out 
iseha 
Hit 


mov 
MOV 
Mov 
int 
mov 
mov 
int 
mov 
mov 
mov 
int 
cmp 
je 

jmp 


ExitLoop: 


mov 
out 
mov 
int 
END 


ax, @data 
ds,ax 
al,21h 
Storage, al 


ah,9 

dx,OFFSET MEnter 
21h 

al,Ofdh 

21h, al 


al1,0OFFh 

ah, 0Ch 

al,6 — 

21h 

ah,9 

ax,OFFSET MExit 


ExitLoop 
BeginLoop 


al, Storage 
2ih,al 
ah,4ch 

21h 


for Bulk Power Supply Decoupling 


allows the user to cycle the processor 
and Normal modes by pressing a key. 


;Set up the stack and data segments. 


‘Entering halt, press a key to exit halt.',13,10,'$' 
Press ESC to quit, any other key to re-enter halt.',13,10,'$' 


;Beginning of the code segment. 
;Set up DS for data segment. 


;We only want keyboard interrupts, 

; so save the old state and we'll mask 

; the other ones in the loop. 

;Loop, alternating between halt and 

; waiting for keypress modes, until user 
; presses ESC key. 

;Print entering halt message using 

; string print function. 


;Mask all interrupts except keyboard. 


;There is an interrupt for keydown, and 

; another for keyup. Using two 'hlt' instructions 
; gives the expected action, although with a 
;.momentary spike in-between. 

;Get the last key pressed and discard. 


;Otherwise, print exiting halt message. 


;Clear keyboard buffer and 
; wait for a keypress. 


;If key is ESC, exit loop. 
;Otherwise, loop forever. 


;If the ESC key is pressed, 
;Restore normal interrupts. 


jump here. 


;Terminate program execution normally. 
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intel : PRELIMINARY 
INTEL OverDrive™ PROCESSORS 


m Powerful Processor Upgrades for 
Intel486™ Microprocessor-Based 
Systems 
— Significantly Accelerates All 

Software Applications 


m Intel OverDrive™ Processor Family 
Includes: 
— IntelSX2T™ OverDrive™ Processor 
— IntelDX2™ OverDrive™ Processor 
— IntelDX4™ OverDrive™ Processor 


m Will Upgrade Systems Based on: 
— Intel486™ SX Processors 
— Intel486™™ DX Processors 


m Large Installed Base of Thousands of 
Applications 


m Based on Advanced Intel486™ 
Processor Technology for 100% 
Compatibility 


Backed by intel’s Lifetime Warranty 


m Incorporates SMM Power Saving 
Features 
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Intel OverDrive processors are powerful, processor upgrades for Intel486 microprocessor-based systems. The 
IntelSX2 OverDrive processor is the most affordable processor upgrade designed for Intel486 SX processor- 
based systems. It features the Intel speed doubling technology, thereby accelerating all non-floating point 
intensive software. This provides home and small office users increased performance across a wide variety of 
applications. The IntelDX2 OverDrive processor is a powerful processor upgrade designed for Intel486 SX and 
Intel486 DX processor-based systems. Utilizing the Intel speed doubling technology, the IntelDX2 OverDrive 
processor accelerates both integer and floating point software, providing business users significantly in- 
creased performance across a wide variety of applications. The IntelDX4 OverDrive processor is a high-end 
processor upgrade designed for Intel486 SX and Intel486 DX processor-based systems. Utilizing the Intel 
speed tripling technology, the IntelDX4 OverDrive processor accelerates both integer and floating point soft- 
ware, providing power users with significant software performance improvements across a wide variety of 
applications. 


*Other brands and names are the property of their respective owners. 
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1.0 INTRODUCTION 


This data sheet describes the Intel OverDrive proc- 
essors, a family of CPU upgrades for Intel486 proc- 
essor-based systems. This family includes the In- 
telSX2 OverDrive processor, the IntelDX2 OverDrive 
processor and the IntelDX4 OverDrive processor. 
These processor upgrades significantly accelerate 
all software applications run on your existing system, 
thereby increasing the overall performance of your 
PC. 


It is important to note that this data sheet is intended 
to be used in conjunction with the data sheet for the 
original processor in the system—the Intel486 SX or 
Intel486 DX processor—which describes the Intel 
Family Architecture and functionality (Order # 
241731-001). All enhancements or differences be- 
tween the OverDrive processor and the original 
processor (i.e., IntelSX2 OverDrive vs. Intel486 SX2 
processor, IntelDX2 or IntelDX4 OverDrive vs. In- 
tel486 DX processor) are described in this data 
sheet. 


Intel486 SX or Intel486 DX processor-based sys- 
tems that are compatible to the Intel OverDrive proc- 
essor(s) must be designed to both the original proc- 
essor specifications and the Intel OverDrive proces- 
sor(s) specifications. 
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1.1 Product Overview 


The following sections provide an overview of each 
of the OverDrive processors. Refer to the specific 


_ product section(s) for more detailed information. 


Figure 1-1 lists some of the key features of each 
OverDrive processor. Figure 1-2 describes the up- 
grade choices available for an existing intel486 SX 
or DX system. 


1.1.1 IntelSX2T OverDrive™’ PROCESSOR 


The IntelSX2 OverDrive processor is the most af- 
fordable upgrade designed for Intel486 SX proces- 
sor-based systems. It features the Intel speed dou- 
bling technology, thereby accelerating all non-float- 
ing point intensive software. This provides home and 
small office users increased performance across a 
wide variety of software applications. 


Based on the Intel486 SX2 processor technology, 
the IntelSX2 OverDrive processor integrates an inte- 
ger unit, a memory management unit and an 8 KByte 
cache on a single chip. The speed doubling technol- 
ogy allows the processor to operate internally at 
twice the speed of the system bus; up to a maximum 
of 50 MHz for a 25 MHz system. 


eFor Up To 25 MHz Intel486 SX Processor-Based Systems 


IntelSx2/™ 
OverDrive 


Processor e Intel487SX Pinout 


e Based on Intel486 SX2 
Speed Doubling Technology 


e Hardware Floating Point Math Unit Not Supported 
e8 KByte Integrated Instruction and Data Cache 


eFor Up To 33 MHz Intel486 SX or DX Processor-Based Systems 


IntelDX2/™ 
OverDrive 
Processor 


e Based on Intel486 DX2 Technology, which Includes =pere 
Doubling and Floating Point Math Unit 
eintel486 DX and Intel487 Pinouts 


e 8 KByte Integrated Instruction and Data Cache 


intelDx4'M 
OverDrive 
Processor 


eFor Up To 33 MHz Intel486 SX or DX Processor-Based Systems 
@ Based on Intel486 DX4 Speed Tripling Technology, 

which Includes Floating Point Math Unit 
e Intel486 DX and Intel487 Pinouts 


© 16 KByte Integrated Instruction and Data Cache 
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Figure 1-1. Key Features 


PRELIMINARY 


3-3 


INTEL OverDrive™ PROCESSORS 


Current 
System 


intel486/Msx 
System 


Intel486'Mpx 
System 


Upgrades To 


OverDrive! ™ 


Processor 


IntelSx2'™ 
OverDrive 
Processor 


IntelDx2/™ 
OverDrive 
Processor 


IntelDx4™M 
OverDrive 
Processor 


290436-28 


Figure 1-2. Upgrade Choices 


The IntelSX2 OverDrive processor is designed to be 
installed into the OverDrive processor socket of In- 
tel486 SX processor-based systems. It can also re- 
place the existing Intel486 SX processor in single- 
socket systems. 


1.1.2 IntelDX2T™ OverDrive™’ PROCESSOR 


The IntelDX2 OverDrive processor is a powerful 
processor upgrade designed for Intel486 SX and In- 
tel486 DX processor-based systems. Based on the 
Intel486 DX2 processor, it contains the Intel speed 


doubling technology. This accelerates both integer 


and floating point software, provicing business users 
significantly increased performarice across a wide 
variety of software applications. 


The IntelDX2 OverDrive processor integrates an in- 


teger unit, a floating point math coprocessor unit, a 
memory management unit and an 8 KByte cache on 
a single chip. The speed doubling technology allows 
the processor to operate internally at twice the 
speed of the system bus; up to a maximum of 
66 MHz for a 33 MHz system. 


The IntelDX2 OverDrive processor comes in two 
package offerings; 168-lead Pin Grid Array (PGA) 
and 169-lead PGA. It is designed to be installed into 
the OverDrive processor socket cf Intel486 SX and 
DX processor-based systems. It can also replace 
the existing processor in single-socket systems. 


1.1.3 IntelDX4™ OverDrive™’ PROCESSOR 


The IntelDX4 OverDrive processor is a high-end 


processor upgrade designed for Intel486 SX and 
Intel486 DX processor-based systems. Utilizing the 
Intel speed tripling technology, the IntelDX4 
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OverDrive processor accelerates both integer and 
floating point software, providing power users with 
significant software performance improvements 
across a wide variety of applications. 


The IntelDX4 OverDrive processor integrates an in- 
teger unit, a floating point math coprocessor unit, a 
memory management unit and a 16-KByte cache on 
a single chip. The speed tripling technology allows 
the processor to operate internally at three times the 
speed of the system bus; up to a maximum of 
100 MHz for a 33 MHz system. 


The IntelDX4 OverDrive processor comes in two 


- package offerings; 168-lead Pin Grid Array (PGA) 
_ and 169-lead PGA. It is designed to be installed into 


the OverDrive processor socket of Intel486 SX and 
DX processor-based systems. It can also replace 
the existing processor in single-socket systems. 


1.2 Pinouts 


Refer to Figures 1-3 and 1-4 for an illustration of 
each of the two PGA packages. Figure 1-3 shows 
the 169-lead PGA package, while Figure 1-4 illus- 
trates the 168-lead PGA package. 


Table 1-1 cross-references the pin number to pin 
function for the 169-lead PGA package. Table 1-2 is 
a cross-reference for the 168-lead package. 


Table 5-1 in Section 5 gives a brief description of the 
function of each pin. 


Refer to each specific OverDrive processor section 


for a description of any differences from the pinouts 
described in this section. 
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Figure 1-3. 169-Lead PGA Bottom View Pinout (ODP) 
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Figure 1-4. 168-Lead PGA Bottom View Pinout (ODPR) 
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Table 1-1. 169-Lead PGA Pin Cross Reference by Pin Name (ODP) 


A20M # PLOCK # 
ADS # PWT 
AHOLD RDY # 
BEO# RESET 
BE1# SMI # 
BE2# SMIACT # 
BES# SRESET 
BLAST # STPCLK# 
BOFF # UP# 
BRDY # W/R# 
BREQ# 

BS8 # 

BS16# 

CLK 

D/C# 

DPO 

DP1 

DP2 

DP3 

EADS # 

FERR# 

FLUSH # 

HLDA 

HOLD 

IGNNE # 

INTR 

KEN # 

LOCK # 

M/lIO# 

NMI 

PCD 

PCHK # 


NOTES: 

1. All NC pins must remain unconnected. 

2. Refer to each specific OverDrive section for differences in pin functions. 
3. NC = No Connection. 

4. INC = internal No Connect. 
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Table 1-2. 168-Lead PGA Pin Cross Reference by Pin Name (ODPR) 


PLOCK# 
PWT 
RDY # 
RESET 
SMI # 
SMIACT # 
SRESET 
STPCLK # 
UP# 
W/R# 


' NOTES: 

1. All NC pins must remain unconnected. 

' 2. Refer to each specific OverDrive section for differences in pin functions. 
3. NC = No Connection. 

4. INC = Internal No Connect. 
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2.0 IntelSX2T™ OverDrive™ PROCESSOR FOR Intel486™ SX 
MICROPROCESSOR-BASED SYSTEMS 


m Processor Upgrade for Intel486™ SX m Utilizes Intel486™ SX2 Speed-Doubling 
Processor-Based Systems Technology 
— Single-Chip Upgrade — Processor Core Runs at Twice the 
: , : Frequency of the System Bus | 
m 169-Lead Pin Grid Array Package é : 
— Pin Compatible With Intel487 SX ne ompal ve WN 22 Mire ao. rane ane 
Math CoProcessor 16 MHz Systems 
— 169th Alignment Pin Ensures Proper = Binary Compatible with Large Installed 
Chip Orientation Software and Operating System Base 
‘ : , — MS-DOS, OS/2™, Windows — 
mg High Integration Enables On-Chip ; : 
— 8 KByte Code and Data Cache — UNIX System V/386 


— IRMX, IRMK™ Kernals 
m@ High Performance Design 


— Paged, Virtual Memory Management 


m= Complete 32-Bit Architecture 7 
== ieee and Data Busses — Core Clock Speed up to 50 MHz 
— Registers — 80 Mbyte/sec Burst Bus 


— 8-, 16-, 32-Bit Data Types — CHMOS V Process Technology 


m SL Enhanced Intel486™ 
Microprocessor Features Included 
On-Chip 


The IntelSX2 OverDrive processor for Intel486 SX microprocessor-based personal computers is a processor 
upgrade that offers excellent price/performance. Based on the new Intel486 SX2 technology, the IntelSX2 
OverDrive processor integrates an integer unit, a memory management unit and an 8 KByte cache on a single 
chip. 


Using the Intel486 SX2’s speed doubling technology, the IntelSX2 OverDrive processor operates internally at 
twice the speed of the system bus. This allows users of Intel486 SX microprocessor-based systems to double 
the frequency of their computer’s processor by adding a single chip, without upgrading or modifying any other 
system components. For example, adding an IntelSX2 OverDrive processor to an Intel486 SX 25 MHz system 
will double the processors internal operating speed to 50 MHz. 


The IntelSX2 OverDrive processor is designed to either replace the existing processor or be installed into the 
Math CoProcessor socket of Intel486 SX microprocessor-based systems. 
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The IntelSX2 OverDrive processor is based on the 
Intel486 SX2 microprocessor technology. This tech- 
nology doubles the clock speed of the internal proc- 
essor core, while interfacing with the system at the 
same external clock speed. When installed in a 
25 MHz Intel486 SX microprocessor-based system, 
the internal processor core, integer unit and cache 
operate at 50 MHz, while the speed of the external 
bus remains at 25 MHz. This provides increased 
processor performance while maintaining compati- 
bility with the existing system design. 


The IntelSX2 OverDrive processor is currently avail- 
_ able in one product version. The 50 MHz IntelSX2 
OverDrive processor is designed to upgrade 
25 MHz, 20 MHz and 16 MHz Intel486 SX micro- 
processor-based systems. The speed doubling tech- 
nology wiil thereby double the internal speed of the 
processor to 50 MHz, 40 MHz and 32 MHz, respec- 
tively. 


This product comes with a (0.25” high) heat sink 
attached to the standard 169-lead PGA package to 
aid in heat dissipation. ; 


intel. 


The IntelSX2 OverDrive processor includes the pow- 
er management features of the SL Enhanced In- 
tel486 SX2 microprocessor and is binary compatible 
with a large base of software based on DOS, OS/2, 
Windows and Unix operating systems. 


For more detailed information about the operation of 
the IntelSX2 OverDrive processor, refer to the Intel 
SX2 microprocessor data book (Order #241731- 
001). : | 


2.1 Socket Configurations 


The IntelSX2 OverDrive processor canbe used to 
upgrade a single-socket Intel486 SX microproces- 
sor-based system by replacing the original proces- 
sor with the OverDrive processor. In a two-socket 
system, the IntelSX2 OverDrive processor can sim- 
ply be placed into the empty OverDrive processor 
socket, while the original processor remains in its 
socket. Figure 2-1 illustrates this. 


Figure 1-3 shows the bottom-view (pin-side) pinout 
diagram of the 169-lead Pin Grid Array (PGA) pack- 
age. Table 1-1 cross references the device’s pin 
numbers to the pin names. | 


IntelSX2T™ OverDrive™ Processor (50 MHz) 


intelSx2™™ 

OverDrive 

Processor 
(SX20DP50) 


Intel 
OverDrive 
Processor 

Socket 


IntelSx2™™ 

OverDrive 

Processor 
(SX20DP50) 


16 MHz, 20 MHz 
or 25 MHz 
intel486 SX 

Processor 


Single Socket 
System 


16 MHz, 20 MHz 
or 25 MHz 
Intel486 SX 

Processor 


Dual Socket 
System 
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Figure 2-1. InteliSX2T™ OverDrive™ Processor Sockets 
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2.2 Differences between the 
IntelSX2T’ OverDrive™ 
Processor and the Intel486T™ SX 
Processor 


A complete pinout diagram for the 169-lead PGA 
package can be seen in Figure 1-3. This diagram 
shows the standard pinout for the package, which is 
utilized by all three of the OverDrive processors. Ta- 
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ble 1-1 contains a complete pin description for the 
same package. Again, this is the standard pin de- 
scription for the package which is used by all three 
OverDrive processors. 


Table 2-1 identifies the differences between the 
standard ODPR pinout and the pinout used on the 
IntelSX2 OverDrive processor. 


Table 2-1. IntelSX2T Pinout Differences 


ODP (IntelDX2, IntelDX4) 


ODP (IntelSX2) 


: PRELIMINARY 


[Package OverDrive Processor) | Pinata | Pina | Pats 
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3.0 IntelDX2T™ OverDrive™’ PROCESSOR FOR intel486™ SX AND DX 
MICROPROCESSOR-BASED SYSTEMS 


a Processor Upgrade for |intel486™ SX mw Floating Point Math Unit Included On- 
and DX Processor-Based Systems Chip 
— Single-Chip Upgrade : : : pre 
— Increases Both Integer and Floating s ds aps pace aoe 
Point Performance — Paged, Virtual Memory Management 
m Two Package Variations to Support , : , | 
Systems with and without an - a a al we age netalee 
OverDrive™ Processor Socket | — MS-DOS. 0S/2™. Windows 
m 169-Lead Pin Grid Array Package | — UNIX System V/386 
— Pin Compatible with Intel487™ Sx — IRMX, IRMKT Kernals 
Math CoProcessor , ‘ 
— 169th Alignment Pin Ensures Proper . ie ieee eaeccan ie 66 MHz 
Sip Onentaten — 106 Mbyte/sec Burst Bus 
m 168-Lead Pin Grid Array Package — CHMOS V Process Technology 
— Pin Compatible with Intel486™ DX = Complete 32-Bit Architecture 
Processor | — Address and Data Busses - 
@ Utilizes Intel486™ DX2 Speed-Doubling — Registers 
Technology — §-, 16-, 32-Bit Data Types 
ee re ee eee aS = Compatible with Intel SL Enhanced 
Frequency of the System Bus Features 


- —Compatible with 33, 25, 20 and 
16 MHz Systems 


The Intel486 DX2 OverDrive processor for Intel486SX and DX microprocessor-based systems is a powerful, 
processor upgrade that offers excellent price/performance. Based on Intel’s Intel486 DX2 technology, the 
IntelDX2 OverDrive processor integrates an integer unit, a floating point unit, a memory management unit, SL 
Enhanced features and an 8 KByte cache on a single chip. 


Using the Intel486 DX2’s speed doubling technology, the IntelDX2 OverDrive processor operates internally at 
twice the speed of the system bus. This allows users of Intel486 SX and DX microprocessor-based systems to 
double the frequency of their cornputer’s processor by adding a single chip, without upgrading or modifying 
any other system components. For example, adding an IntelDX2 OverDrive pleceesor to an Intel486 DX 33 
MHz system will double the processors internal operating speed to 66 MHz. 
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The IntelDX2 OverDrive processor is based on the 
Intel 486 DX2 microprocessor technology. This tech- 
nology doubles the clock speed of the internal proc- 
essor core, while interfacing with the system at the 
same external clock speed. When installed in a 


33 MHz Intel486 SX or DX microprocessor-based 


system, the internal processor core, integer unit, 
floating point unit and cache operate at 66 MHz, 
while the speed of the external bus remains at 
33 MHz. This provides increased processor perform- 
ance while maintaining compatibility with the existing 
system design. 


The IntelDX2 OverDrive processor is currently avail- 
able in four product versions, which consist of two 
speed options (50 MHz and 66 MHz) and two pack- 
age options (168-lead Pin Grid Array (PGA) and 169- 
lead PGA). 


The 50 MHz IntelDX2 OverDrive processor is de- 
signed to upgrade 25 MHz Intel486 DX microproces- 


INTEL OverDrive™ PROCESSORS 


sor-based systems and 16 MHz, 20 MHz and 


~ 25 MHz Intel486 SX microprocessor-based systems. 


The 66 MHz IntelDX2 OverDrive processor is de- 
signed to upgrade 33 MHz Intel486 SX and DX 
microprocessor-based systems. Figures 3-1 and 3-2 
illustrate this. The speed doubling technology 
doubles the internal speed of the processor to twice 
the bus speed of the existing system. 


These products come with a (0.25” high) heat sink 
attached to the standard 169-lead PGA or 168-lead 
PGA package to aid in heat dissipation. All IntelDX2 
OverDrive processors are binary compatible with a 
large base of software based on DOS, OS/2, Win- 
dows and Unix operating systems. 


For more detailed information about the operation of 
the IntelDX2 OverDrive processor, refer to the In- 
tel486 DX2 microprocessor data. book (Order 
#241731-001). 


intelIDX2™ OverDriveT™ Processor (50 MHz) 


IntelDX2'™™ 
OverDrive 


Processor 
(DX20DPR50) 


IntelDx2/™ Intel 

OverDrive OverDrive 

Processor 
Socket 


Processor 


(DX20DP50) 


ns 


25 MHz Intel486 DX Processor 


or Single Socket 
16,20 or 25MHz System 


Intel486 SX Processor 


25 MHz Intel486 DX Processor 


Figure 3-1. 50 MHz intelDX2T™ OverDrive™ Processor Sockets 
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IntelDX2'™ 

OverDrive 

Processor 
(DX20DPR66) 


IntelDx2™ | intel. 

OverDrive OverDrive 

Processor 
Socket 


Processor 
(DX20DP66) 


Figure 3-2. 66 MHz IntelDX2™ QOverDrive™ Processor Sockets 


3.1 Socket Configurations 


Both single-socket and two-socket system configu- 
rations can be upgraded with the IntelDX2 OverDrive 
processor. In a single-socket Intel486 microproces- 
sor-based system, this is done by replacing the origi- 
nal processor with the OverDrive processor. In a 
two-socket system, the IntelDX2 OverDrive proces- 
sor can simply be placed into the empty OverDrive 
processor socket. Figures 3-1 and 3-2 illustrate this. 


3.2 169-Lead PGA Device (DX20DP) 


The 169-lead version of the IntelDX2 OverDrive 
processor is currently available in two speeds; 
50 MHz (DX2ODP50) and 66 MHz (DX2ODP66). 
The 169-lead versions are designed to be used in 
two-socket systems and contain a key pin to assure 
proper orientation of the device (refer to Figures 3-1 
and 3-2). The OverDrive processor is simply inserted 
into the OverDrive processor socket, while the origi- 
nal processor remains in its socket. 


IntelDX2™ OverDrive™ Processor (66 MHz) 


33 MHz Intel486 DX Processor 


2, OF System 
| 33 MHz Intel486 SX Processor 
33 MHz Intel486 DX Processor |. Dual Socket 
| or System 


33 MHz Intel486 SX Processor 


Single Socket 


290436-33 


Figure 1-3 shows the bottom-view (pin-side) pinout 
diagram of the 169-lead Pin Grid Array (PGA) pack- 
age. Table 1-1 cross references the device’s pin 
numbers to the pin names. 


3.3 168-Lead PGA Device (DX20DPR) 


The 168-lead version of the IntelDX2 OverDrive 
processor is currently available in two speeds; 
50 MHz (DX2ODPR50) and 66 MHz (DX2ODPR66). 
The 168-lead versions are designed to be used ina 
single-socket system (refer to Figures 3-1 and 3-2). 
The existing processor is removed and the upgrade 
processor is simply inserted into the same socket. 


Figure 1-4 shows the bottom-view (pin-side) pinout 
diagram of the 168-lead Pin Grid Array (PGA) pack- 
age. Table 1-2 cross references the device’s pin 


~ numbers to the pin names. 
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4.0 IntelDX4™ QOverDrive™ PROCESSOR FOR iIntel486™ SX AND DX 
MICROPROCESSOR-BASED SYSTEMS 


The IntelDX4 OverDrive processor is a high performance, processor upgrade for Intel486 SX and DX micro- 
processor-based systems. It operates at a maximum internal core frequency of 100 MHz and is available in 
two package versions. When installed in a system, the IntelDX4 OverDrive processor significantly increases 


Processor Upgrade for Intel486™ SX 

and DX Processor-Based Systems 

— Single-Chip Upgrade 

— Increases Both Integer and Floating 
Point Performance 


Two Package Variations to Support 
Systems with and without an 
OverDrive™ Processor Socket 


169-Lead Pin Grid Array Package 

— Pin Compatible with Intel487™ Sx 
Math CoProcessor 

— 169th Alignment Pin Ensures Proper 
Chip Orientation 


168-Lead Pin Grid Array Package 
— Pin Compatible with Intel486™ DX 
Processor 


High Integration Enables On-Chip 
— 16 KByte Code and Data Cache 
— Paged, Virtual Memory Management 


Floating Point Math Unit Included 
On-Chip 


both the integer and floating point performance. 


The IntelDX4 OverDrive processor offers several new features not found in the previous Intel OverDrive 
processors for Intel486 SX and DX microprocessor-based systems. It has 16 KByte on-chip cache and the 
internal core operates at 3x (speed tripled) the external clock frequency. The IntelDX4 OverDrive processor 
supports System Management Mode (SMM) and Stop Clock Mode. The SMM and Stop Clock Mode, identical 
to those implemented in SL Enhanced Intel486 SX and DX microprocessors, make the IntelDX4 OverDrive 
processor compatible with the advanced power management, system security and device emulation features 


of SL Enhanced systems. 


The underlying technology behind the IntelDX4 OverDrive processor is the IntelDX4 microprocessor core with 
on-package voltage regulation. This allows the OverDrive processor to plug directly into existing 5V systems. 
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Utilizes Intel486™ DX4 Speed-Tripling 
Technology 


— Processor Core Runs at Three Times 


the Frequency of the System Bus 
-— Compatible with 33, 25, 20 and 
16 MHz Systems 


Binary Compatible with Large Installed 
Software and Operating System Base 
— MS-DOS, OS/2™, Windows 

— UNIX System V/386 

— IRMX, IRMK™ Kernals 


High Performance Design 
— Core Clock Speed up to 100 MHz 
— CHMOS V Process Technology 


Complete 32-Bit Architecture | 
— Address and Data Busses 
— Registers 

— 8-, 16-, 32-Bit Data Types 


SL Enhanced Intel486™ 
Microprocessor Features Included 
On-Chip 
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The IntelDX4 OverDrive processor is based on the 


Intel486 DX4 microprocessor technology. This tech- 
nology triples the clock speed of the internal proces- 
sor core, while interfacing with the system at the 
same external clock speed. When installed in a 
33 MHz Intel486 SX or DX microprocessor-based 
system, the internal processor core, integer unit, 
floating point unit and cache operate at 100 MHz, 
while the speed of the external bus remains at 
33 MHz. This provides increased processor perform- 
ance while maintaining compatibility with the existing 
system design. In addition, the internal cache has 
been doubled to 16 KBytes. 


The IntelDX4 OverDrive processor is currently avail- 
able in four product versions, which consist of two 
speed options (75 MHz and 100 MHz) and two pack- 
age options (168-lead Pin Grie Array (PGA) and 169- 
lead PGA). 


- The 100 MHz OverDrive processors are designed to 
upgrade 33 MHz Intel486 SX and DX microproces- 
sor-based systems. The 75 MHz OverDrive proces- 


intel. 


- sors are designed to upgrade 25 MHz Intel486 DX 


microprocessor-based systems and 16 MHz, 
20 MHz and 25 MHz Intel486 SX microprocessor- 
based systems. Figures 3-1 and 3-2 illustrate this. 
The speed tripling technology will triple the internal 
speed of the processor to three times the bus speed 
of the existing system. 


These products come with a (0.6” high) heat sink 
attached to the standard 169-lead PGA or 168-lead 
PGA package to aid in heat dissipation. Refer to 
Sections 10.0 and 11.0 for clearance and thermal 
requirements. All IntelDX4 OverDrive processors are 
binary compatible with a large base of software 
based on DOS, OS/2, Windows and Unix operating 
systems. 


For more detailed information about the operation of 
the IntelDX4 OverDrive processor, refer to the Intel 
DX4 microprocessor data book (Order #241944- 
001). 
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Figure 4- 
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1. 75 MHz IntelDX4T™ OverDrive™ Processor Sockets 
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IntelDX4™ OverDrive™ Processor (100 MHz) 


IntelDx4™™ 

OverDrive 

Processor 
(DX40DPR100) 


IntelDX2™ intel 
OverDrive OverDrive 


Processor Processor 
(DX40DP 100) Socket 


33 MHz Intel486 DX Processor 


33 MHz Intel486 DX Processor 


Single Socket 


oF System | 
33 MHz Intel486 SX Processor 


Dual Socket 
or System 


33 MHz Intel486 SX Processor 
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Figure 4-2. 100 MHz IntelDX4™ OverDrive™ Processor Sockets 


4.1 Socket Configurations 


Both single-socket and two-socket system configu- 
rations can be upgraded with the IntelDX4 OverDrive 
processor. In a single-socket Intel486 microproces- 
sor-based system, this is done by replacing the origi- 
nal processor with the OverDrive processor. In a 
two-socket system, the IntelDX4 OverDrive proces- 
sor can simply be placed into the empty OverDrive 
processor socket. Figures 4-1 and 4-2 illustrate this. 


4.2 169-Lead PGA Device (DX40ODP) 


The 169-lead version of the IntelDX4 OverDrive 
processor is currently available in two speeds; 
75 MHz (DX4ODP75) and 100 MHz (DX4ODP100). 
The 169-lead versions are designed to be used ina 
two-socket system and contain a key pin to assure 
proper orientation of the device (refer to Figures 4-1 
and 4-2). The processor is simply inserted into the 
OverDrive processor socket, while the original proc- 
essor remains in its socket. 
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_Figure 1-3 shows the bottom-view (pin-side) pinout 


diagram of the 169-iead Pin Grid Array (PGA) pack- 
age. Table 1-1 cross references the device’s pin 
numbers to the pin names. 


4.3 168-Lead PGA Device 


The 168-lead version of the IntelDX4 OverDrive 
processor is currently available in two speeds; 
75 MHz (DX4ODPR75) and 100 £=MHz 
(DX4ODPR100). The 168-lead versions are de- 
signed to be used in a single-socket system (refer to 
rigures 4-1 and 4-2). The existing processor is re- 
moved and the upgrade processor is simply inserted 
into the same socket. | 


Figure 1-4 shows the bottom-view (pin-side) pinout 
diagram of the 168-lead Pin Grid Array (PGA) pack- . 
age. Table 1-2 cross references the device’s pin 
numbers to the pin names. 
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5.0 PIN DESCRIPTIONS 


Tables 5-1 through 5-4 list pin descriptions of the 
signals present on the IntelSX2, Intel DX2 and In- 
telDX4 OverDrive processors. 


DATA BUS 


D31-D0 1/O 


DATA PARITY 


DPO-DP3 


PCHK # 


3-18 


Table 5-1. Pin Descriptions 
Name and Function 


Clock provides the fundamental timing for the bus interface unit and is multiplied by 
two (2x) for the IntelSX2 and IntelDX2 OverDrive Processors or three (3x) for the 
IntelDX4 OverDrive Processor to provide the internal frequency for the Intel 
OverDrive processor. All external timing parameters are specified with respect to 
the rising edge of CLK. | | 


A31-Az2 are the address lines of the processor. A31-A2, together with the byte 
enables BEO # -BE3#, define the physical area of memory or input/output space 
accessed. Address lines A31—-A4 are used to drive addresses into the processor to 
perform cache line invalidations. Input signals must meet setup and hold times too 
and to3. A31-—A2 are not driven during bus or address hold. 


The byte enable signals indicate active bytes during read and write cycles. During 
the first cycle of a cache fill, the external system should assume that all byte 
enables are active. BE3# applies to D24—D31, BE2# applies to D16-D23, BE1# 
applies to D8-D15 and BEO# applies to DO-D7. BEO #-BE3# are active LOW and 
are not driven during bus hold. 


These are the data lines for the Intel OverDrive processor. Lines DO—D7 define the 
least significant byte of the data bus while lines D24-—D31 define the most 
significant byte of the data bus. These signals must meet setup and hold times too 
and to3 for proper operation on reads. These pins are driven during the second and 
subsequent clocks of write cycles. 


There is one data parity pin for each byte of the data bus. Data parity is generated 
on all write data cycles with the same timing as the data driven by the Intel 
OverDrive processor. Even parity information must be driven back into the 
microprocessor on the data parity pins with the same timing as read information to 
insure that the correct parity check status is indicated by the Intel OverDrive 
processor. The signals read on these pins do not affect program execution. _ 
Input signals must meet setup and hold times too and to3. DPO—DP3 should be 
connected to Vcc through a pullup resistor in systems which do not use parity. 
DPO-DP*9 are active HIGH and are driven during the second and subsequent clocks 
of write cycles. 


Parity Status is driven on the PCHK # pin the clock after ready for read operations. 
The parity status is for data sampled at the end of the previous clock. A parity error 
is indicated by PCHK # being LOW. Parity status is only checked for enabled bytes 
as indicated by the byte enable and bus size signals. PCHK # is valid only in the 
clock immediately after read data is returned to the microprocessor. At all other 
times PCHK # is inactive (HIGH). PCHK # is never floated. 
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Table 5-1. Pin Descriptions (Continued) 


| Symbol | Type | Name and Function 


BUS CYCLE DEFINITION 


The memory/input-output, data/control and write/read lines are the primary bus 


definition signals. These signals are driven valid as the ADS # signal is asserted. 
M/iO# D/C# W/R# Bus Cycle Initiated 


Interrupt Acknowledge 
Halt/Special Cycle 
1/O Read 


I/O Write 
Code Read 
Reserved 
Memory Read 
Memory Write 


—-O-- O- Oo =| © 


The bus definition signals are not driven during bus hold and follow the timing of the 
address bus. Refer to Section 7.2.11 for a description of the special bus cycles. 


LOCK # The bus /ock pin indicates that the current bus cycle is locked. The Intel OverDrive 
processor will not allow a bus hold when LOCK # is asserted (but address holds are 
allowed). LOCK # goes active in the first clock of the first locked bus cycle and goes 
inactive after the last clock of the last locked bus cycle. The last locked cycle ends 
when RDY # is returned. LOCK # is active LOW and is not driven during bus hold. 


Locked read cycles will not be transformed into cache fill cycles if KEN # is returned 
active. 


PLOCK # The pseudo-lock pin indicates that the current bus transaction requires more than 
one bus cycle to complete. Exampies of such operations are floating point long 
reads and writes (64 bits), segment table descriptor reads (64 bits), in addition to 
cache line fills (128 bits). The Intel OverDrive processor will drive PLOCK # active 
until the addresses for the last bus cycle of the transaction have been driven 
regardless of whether RDY # or BRDY # have been returned. 

Normally PLOCK # and BLAST # are inverse of each other. However during the first 
bus cycle of a 64-bit floating point write, both PLOCK # and BLAST # will be 
asserted. - 

PLOCK # is a function of the BS8#, BS16# and KEN# inputs. PLOCK# should be 


sampled only in the clock RDY # is returned. PLOCK # is active LOW and is not 
driven during bus hold. 


BUS CONTROL | | 

ADS# The address staius output indicates that a valid bus cycle definition and address are 
available on the cycle definition lines and address bus. ADS # is driven active in the 
same clock as the addresses are driven. ADS # is active LOW and is not driven 
during bus hold. 


must satisfy setup and hold times t;¢ and t17 for proper chip operation. 


RDY # The non-burst ready input indicates that the current bus cycle is complete. RDY # 
indicates that the external system has presented valid data on the data pins in 
response to a read or that the external system has accepted data from the Intel 
OverDrive processor in response to a write. RDY # is ignored when the bus is idle 
and at the end of the first clock of the bus cycle. 

RDY # is active during address hold. Data can be returned to the processor while 
AHOLD is active. 
RDY # is active LOW, and is not provided with an internal pullup resistor. RDY # 
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BURST CONTROL 


BRDY # : 
BLAST# _ 


INTERRUPTS 
RESET 


SRESET 7 
SMI# - 


SMIACT # 


Table 5-1. Pin Descriptions (Continued) 


Name and Function 


The burst ready input performs the same function during a burst cycle that RDY # 
performs during a non-burst cycle. BRDY # indicates that the external system has 
presented valid data in response to a read or that the external system has accepted 
data in response to a write. BRDY # is ignored when the bus is idle and at the end of 
the first clock in a bus cycle. . 

BRDY # is sampled in the second and subsequent clocks of a burst cycle. The data 
presented on the data bus will be strobed into the microprocessor when BRDY # is 
sampled active. If RDY # is returned simultaneously with BRDY #, BRDY # is ignored 
and the burst cycle is prematurely interrupted 
BRDY # is active LOW and is provided with a small pullup resistor. BRDY # must 
satisfy the setup and hold times ty, and t47. 


The burst /ast signal indicates that the next time BRDY # is returned the burst bus cycle 
is complete. BLAST # is active for both burst and non-burst bus cycles. BLAST # is 
active LOW and is not driven during bus hold. 


The RESET input forces the processor to begin execution at a known state. Reset is 
asynchronous, but must meet setup and hold times t20 and t21 for recognition in any 
specific clock. The processor cannot begin execution of instructions until at least 1 ms 
after Voc and CLK have reached their proper AC and DC specifications. However, for 
soft resets, RESET should remain active for at least 15 CLK periods. The RESET pin 
should remain active during this time to ensure proper processor operation. RESET is 
active HIGH. | | 

RESET sets the SMBASE descriptor to a default address of 30000H. If the system 
uses SMBASE relocation, then the SRESET pin should be used for soft resets. 


The SRESE‘T pin duplicates all the functionality of the RESET pin with the following two 
exceptions: 

1. The SMBASE register will retain its previous value. 

2. If UP # (l} is asserted, SRESET will not have an effect on the host microprocessor. 
For soft resets, SRESET should remain active for at least 15 CLK periods. SRESET is 
active HIGH. SRESET is asynchronous but must meet setup and hold times tag and to, 
for recognition in any specific clock. 


The System Management interrupt input is used to invoke the System Management 
Mode (SMM). SMI # is.a falling edge triggered signal which forces the processor into 
SMM at the completion of the current instruction. SMI # is recognized on an instruction 
boundary and at each iteration for repeat string instructions. SMI # does not break 
LOCKed bus cycles and cannot interrupt a currently executing SMM. The processor will 
latch the falling edge of one pending SMI# signal while the processor is executing an 
existing SMI. The nested SMI will not be recognized until after the execution of a 
Resume (RSM) instruction. . 


The System Management Interrupt ACTive is an active low output, indicating that 
the processor is operating in SMM. It is asserted when the processor begins to execute 
the SMI state save sequence and will remain active LOW until the processor executes 
the last state restore cycle out of SMRAM. | | 
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Table 5-1. Pin Descriptions (Continued) 


INTERRUPTS (Continued) 


STPCLK# The SToP CLockK request input signal indicates a request has been made to turn off 
the CLK input. When the processor recognizes a STPCLK #, the processor will stop 
execution on the next instruction boundary, unless superseded by a higher priority 
interrupt, empty all internal pipelines and the write buffers and generate a Stop Grant 
acknowledge bus cycle. STPCLK # is active LOW and is provided with an internal pull- 


up resistor. STPCLK# is asynchronous but setup and hold times tag and to; must be 
met to ensure recognition in any specific clock. 


INTR The maskable interrupt indicates that an external interrupt has been generated. If the 
internal interrupt flag is set in EFLAGS, active interrupt processing will be initiated. The 
Intel OverDrive processor will generate two locked interrupt acknowledge bus cycles in 
response to the INTR pin going active. INTR must remain active until the interrupt 
acknowledges have been performed to assure that the interrupt is recognized. 


INTR is active HIGH and is not provided with an internal pulldown resistor. INTR is 
asynchronous, but must meet setup and hold times tao and te4 for recognition in any 
specific clock. 


The non-maskable interrupt request signal indicates that an external non-maskable 
interrupt has been generated. NMI is rising edge sensitive. NMI must be held LOW for 
at least four CLK periods before this rising edge. NMI is not provided with an internal 
pulldown resistor. NMI is asynchronous, but must meet setup and hold times tap and 


to, for recognition in any specific clock. 
BUS ARBITRATION 


BREQ The internal cycle pending signal indicates that the Intel OverDrive processor has 
internally generated a bus request. BREQ is generated whether or not the Intel 


OverDrive processor is driving the bus. BREQ is active HIGH and is never floated. 


HOLD The bus hold request allows another bus master complete control of the Intel 
OverDrive processor bus. In response to HOLD going active the Intel OverDrive 
processor will float most of its output and input/output pins. HLDA will be asserted 
after completing the current bus cycle, burst cycle or sequence of locked cycles. The 
Intel OverDrive processor will remain in this state until HOLD is deasserted. HOLD is 


active high and is not provided with an internal pulldown resistor. HOLD must satisfy 
setup and hold times t;g and t1g for proper operation. 


HLDA Hold acknowledge goes active in response to a hold request presented on the HOLD 
pin. HLDA indicates that the Intel OverDrive processor has given the bus to another 
local bus master. HLDA is driven active in the same clock that the Intel OverDrive 
processor floats its bus. HLDA is driven inactive when leaving bus hold. HLDA is active 


HIGH and remains driven during bus hold. 
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Table 5-1. Pin Descriptions (Continued) — 


| Symbol | Type. = Name and Function 


BUS ARBITRATION (Continued) 


The backoff input forces the Intel OverDrive processor to float its bus in the next clock. 
The microprocessor will float all pins normally floated during bus hold but HLDA will not 
be asserted in response to BOFF #. BOFF # has higher priority than RDY # or BRDY#; 
if both are returned in the same clock, BOFF # takes effect. The microprocessor 
remains in bus hold until BOFF # is negated. If a bus cycle was in progress when 
BOFF # was asserted the cycle will be restarted. BOFF # is active LOW and must meet 
setup and hold times t;g and t;g for proper operation. 


| CACHE INVALIDATION 


AHOLD The adaress hold request allows another bus master access to the Intel OverDrive 
processor’s address bus for a cache invalidation cycle. The Intel OverDrive processor 
will stop driving its address bus in the clock following AHOLD going active. Only the 
address bus will be floated during address hold, the remainder of the bus will remain 


active. AHOLD is active HIGH and is provided with a small internal pulldown resistor. For 
proper operation AHOLD must meet setup and hold times tig and ty9. 


| EADS # This signal indicates that a valid external address has been driven onto the Intel 
OverDrive processor address pins. This address wili be used to perform an internal 
: cache invalidation cycle. EADS # is active LOW and is provided with an internal pullup 


resistor. EADS# must satisfy setup and hold times ty2 and t13 for proper operation. 
CACHE CONTROL | | 


KEN# The cache enable pin is used to determine whether the current cycle is cacheable. 
When the Intel OverDrive processor generates a cycle that can be cached and KEN # is 
active, the cycle will become a cache line fill cycle. Returning KEN # active one clock 
before RDY # during the last read in the cache line fill will cause the line to be placed in 


the on-chip cache. KEN # is active LOW and is provided with a small internal pullup 
resistor. KEN# must satisfy setup and hold times t;4 and t15 for proper operation. 


FLUSH # The cache flush input forces the Intel OverDrive processor to flush its entire internal 
cache. FLUSH # is active low and need.only be asserted for one clock. FLUSH # is 
asynchronous but setup and hold times tap and to; must be met for recognition in any 


specific clock. FLUSH # being sampled low in the clock before the falling edge of 
RESET causes the Intel OverDrive processor to enter the tri-state test mode. 


PAGE CACHEABILITY 


PWT The page write-through and page cache disable pins reflect the state of the page 

PCD attribute bits, PWT and PCD, in the page table entry or page directory entry. If paging is 
disabled or for cycles that are not paged, PWT and PCD reflect the state of the PWT and 
PCD bits in control register 3. PWT and PCD have the same timing as the cycle definition 
pins (M/IO #, D/C# and W/R#). PWT and PCD are active HIGH and are not driven | 
during bus hold. PCD is masked by the cache disable bit Py! in Control ——_ 0. 


NUMERIC ERROR REPORTING 


FERR# O | The floating point error pin is driven active when a floating point error occurs. FERR # is 
| similar to the ERROR ¥ pin on the Intel387™ math coprocessor. FERR # is included for 
compatibility with systems using DOS type floating point error reporting. FERR# will not 
go active if FP errors are masked in FPU register. FERR # is active LOW, and is not 
floated during bus hold. 
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Table 5-1. Pin Descriptions (Continued) 


NUMERIC ERROR REPORTING (Continued) 
IGNNE # 


When the /gnore numeric error pin is asserted the Intel OverDrive processor will ignore a 
numeric error and continue executing non-control floating point instructions, but FERR # 
‘will still be activated by the Intel OverDrive processor. When IGNNE # is deasserted the 
Intel OverDrive processor will freeze on a non-control floating point instruction, if a 
previous floating point instruction caused an error. IGNNE# has no effect when the NE 
bit in control register 0 is set. IGNNE # is active LOW and is provided with a small 
internal pullup resistor. IGNNE # is asynchronous but setup and hold times tao and to, 
must be met to insure recognition on any specific clock. © 


BUS SIZE CONTROL 


The bus size 16 and bus size 8 pins (bus sizing pins) cause the Intel OverDrive 
processor to run multiple bus cycles to complete a request from devices that cannot 
provide or accept 32 bits of data in a single cycle. The bus sizing pins are sampled every 
clock. The state of these pins in the clock before ready is used by the Intel OverDrive 
processor to determine the bus size. These signals are active LOW and are provided 
with internal pullup resistors. These inputs must satisfy setup and hold times t;4 and ty5 
for proper operation. 


ADDRESS MASK 
A20M# 


When the aaaress bit 20 mask pin is asserted, the Intel OverDrive processor masks 
physical address bit 20 (A20) before performing a lookup to the internal cache or driving 
a memory cycle on the bus. A20M# emulates the address wraparound at one Mbyte 
which occurs on the 8086. A20M # is active LOW and should be asserted only when the 
processor is in real mode. This pin is asynchronous but should meet setup and hold 
times top and to; for recognition in any specific clock. For proper operation, A20M # 
should be sampled high at the falling edge of RESET. 


i486 DX AND i486 SX PROCESSOR INTERFACE 
UP #(1,2) 


The upgrade present pin is used to signal the Intel486 Microprocessor to float its outputs 
and get off the bus. It is active low and is never floated. UP # is driven low at power-up 
and remains active for the entire duration of the Upgrade Processor operation. 


KEY PIN 
KEY (2) The KEY pin is an electrically non-functional pin which is used to ensure correct Upgrade 
Processor orientation in a 169-pin socket. . 


NOTE: 

1. The UP # pin was previously named the MP# pin in the i486 SX Microprocessor/i487 SX Math CoProcessor data book. 
The functionality is the same, only the name has changed. 

2. The UP# input pin and KEY pin are not defined on the OverDrive processor for replacement of PGA Intel486 DX Micro- — 
processor (ODPR). 
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Table 5-2. Output Pins 


| Name level 
Name 
Level 


When 
Floated 


BREQ 
~HLDA 
| BEO#-—BE3# Bus Hold 
PWT, PCD Bus Hold 
W/R#, D/C#, M/IO# Bus Hold 
LOCK # Bus Hold 
PLOCK# Bus Hold 
ADS # Bus Hold 
BLAST # Bus Hold 
PCHK # 
FERR# 
SMIACT # 
UP# 
A2-A3 HIGH | Bus, Address Hold 


Table 5-3. Input Pins 


Synchronous/ 
Asynchronous 
CLK — 
RESET Asynchronous 
HOLD Synchronous 
AHOLD Synchronous 
EADS # Synchronous 
BOFF # Synchronous 
FLUSH # . Asynchronous 
~ A20M# Asynchronous 
Synchronous 


BS16#, BS8# 


KEN # 


RDY # 
BRDY # 
INTR 

NMI 
SRESET 
SMI# 

-STPCLK# 

IGNNE# 
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Synchronous 
Synchronous 
Synchronous 
Asynchronous 
Asynchronous 
Asynchronous 


Asynchronous | 
Asynchronous 


Asynchronous 


i | 3 
intel. 
Table 5-4. Input/Output Pins 


| Active When 
Level . ~ Floated 
DO-D31 HIGH Bus Hold 


DPO-DP3 HIGH _ | Bus Hold 
A4-A31 HIGH Bus, Address Hold 


5.1 Architecture Block Diagram 


Figure 5-1 shows a block diagram of the Intel Over- 
Drive Processor Architecture. There are a few minor 
architectural differences between each of the Over- 
Drive processors. These differences are summa- 
rized below, with respect to Figure 5-1. 


The IntelSX2 OverDrive processor does not contain 
the floating point unit located in the lower left of the 
diagram. 


The IntelDX4 OverDrive processor contains a clock 
tripling circuit, as opposed to the clock doubling cir- 
cuit used in the IntelSX2 and IntelDX2 OverDrive 
processors. This is located in the upper left of the 
diagram. 


The IntelDX4 OverDrive processor contains a 
16 KByte cache, as opposed to the 8 KByte cache 
used in the IntelSX2 and IntelDX2 OverDrive proces- 
sors. This is located in the center of the diagram. 
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Intel OverDrive™ Processor Pipelined 32-Bit Microarchitecture 


64 Bit Interunit Transfer Bus 


32-bit Data Bus 


32-bit Data Bus 


Seg a ealtn 
Unit 


Descriptor 
Registers 


Linear Address (ae 


Pagin 
Borel shitter | Shifter ats 


Register File 


ALU 


aay ame 8k Byte 


Cache 


Saga 
Address 


Translation 
Lookaside 
Buffer 


Limit and 
Attribute 
PLA 


NJ 


Displacement Bus 


32 


Code 
Stream 


32 Byte Code 
Queue 


Floating Control and 
Point Protection Test 
Unit Unit Decode 


Decoded 


: Instruction 
F.P. Register Control Path 


File ROM 


Instruction 2 x 16 Bytes 


Core 
Clock 
: Clock CLK 
doubler 
Bus Interface A2~A31, 
BEO#-BE34 
7 a Address Drivers 
32 
DO-D31 


Data Bus 


Transceivers ADS# W/R# D/C# . 


M/10# PCD,PWT 
RDY# LOCK# PLOCK# 
BOFF# A20M# BREQ 
HOLD HLDA RESET 
Request Sequencer INTR NMI 


FERR# IGNNE# 


Bus Control 


BRDY# BLAST# 


BS16# BS8&# 


KEN# FLUSH# 
AHOLD, EADS# 


PCHK# 
Parity Generation DPO-DP3 
and Control 
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6.0 DIFFERENCES IN 
FUNCTIONALITY BETWEEN THE 
OverDrive™’ PROCESSOR 
FAMILY AND THE Intel486™ SX 
AND Intel486™ DX 7 
PROCESSORS 


The Intel OverDrive processors are an enhanced 
family of Intel486 microprocesscrs. There are, how- 
ever, four functional differences. First, the Intel 
OverDrive processors have an internal clock dou- 
bling (IntelSX2, IntelDX2) or clock tripling (IntelDX4) 
circuit which decreases the time required to execute 
instructions. Second, the Intel OverDrive processor 
family does not support the JTAG boundary scan 
test feature. Third, the Intel OverDrive processors 
have different processor revision identifications than 
the Intel486 SX or Intel486 DX processors. Finally, 
the IntelDX4 OverDrive processor contains a 
16 KByte cache, as opposed to the 8 KByte cache 
on the IntelSX2 and IntelDX2 OverDrive processors. 
These four differences are described in the following 
- gections, according to how they affect the processor 
functionality. 


6.1 Hardware Interface 


The bus of the Intel OverDrive processors has been 
designed to be identical to the Intel486 Microproces- 
sor bus. Although the external clock is internally 
doubled or tripled, and data and instructions are ma- 
nipulated in the processor core at twice or three 
times the external frequency, the external bus is 
functionally identical to that of the Intel486 proces- 
sor. 


The four boundary scan test signals (TCK, Test 
clock; TMS, Test Mode select; TDI, Test Data Input; 
TDO, Test Data Output), defined for some Intel486 
_ processors, are not specified for the Intel486 DX2 
OverDrive processor. 


The UP# (Upgrade Present) signal, which is defined 
as an input for some Intel486 processors, is an out- 
- put signal on the Intel OverDrive processor. The 
UP # pin on the Intel OverDrive processor provides 
a logical low output signal which can be used to en- 
able logic to recognize and configure the system for 
the Intel OverDrive processor. This signal is identical 
to the MP# output defined for the Intel487 SX Math 
CoProcessor. Refer to Section 7 for examples of use 
of the UP# signal. 


The DX register always contains the component 
identifier at the conclusion of RESET. The Intel 
OverDrive processor has a different revision identifi- 
er in the DL register than the Intel486 SX or Intel486 
DX microprocessors (refer to Section 8.1). When the 
OverDrive processor is installed in a system the 
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component identifier is supplied by the OverDrive 
processor, rather than the original processor. The 
stepping identification portion of the component 
identification will change with different revisions of 
the OverDrive processor. The designer should only 


assume that the component identification for the 
OverDrive processor will be 045x for the IntelSX2 


OverDrive processor, 043x for the IntelDX2 Over- 


Drive processor and 148x for the IntelDX4 Over- 
Drive processor, where “x” is the stepping identifier. 


6.2 Testability 


As detailed in Section 6.1, the Intel OverDrive proc- 


~-essor does not support the JTAG boundary scan 
- testability feature. 


6.3 Instruction Set Summary 


The Intel OverDrive processor supports all Intel486 
extensions to the 8086/80186/80286 instruction 
set. In general, instructions will run faster on the Intel 
OverDrive processors than on the Intel486 micro- 
processor. Specifically, an instruction that only uses 
memory from the on-chip cache executes at the full . 
core clock rate while all bus accesses execute at the — 
bus clock rate. To calculate the elapsed time of an 
instruction, the number of clock counts for that in- 
struction must be multiplied by the clock period for 
the system. The instruction set clock count summary 
tables from the Intel486 SX and Intel486 DX Micro- 
processor Data Sheets can be used for the Over- 
Drive processor with the following modifications: 


— Clock counts for a cache hit: This value repre- 
sents the number of internal processor core 
clocks for an instruction that requires no external 
bus accesses or the base core clocks for an in- 
struction requiring external bus accesses. 


— Penalty clock counts for a cache miss: This value 
represents the worst-case approximation of the 
additional number of external clock counts that 
are required for an instruction which must access 
the external bus for data (a cache miss). This 
number must be multiplied by 2 (for the IntelSX2 
and IntelDX2 OverDrive processors) or 3 (for the 
IntelDX4 OverDrive processor) to convert it to an. 
equal number of internal processor core clock 
counts and added to the base core clocks to 
compute the number of core clocks for this in- 
struction. 


The actual number of core clocks for an instruction 
with a cache miss may be less than the base clock 
counts (from the cache hit column) plus the penalty 
clock counts (2 times the cache miss column num- 
ber for the IntelSX2 and IntelDX2, 3 times the cache 
miss column number for the IntelDX4). The clock 
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counts in the cache miss penalty column can be a 
cumulative value of external bus clocks (for data 
reads) and internal clocks for manipulating the data 
which has been loaded from the external bus. The 
number of clocks which are related to external bus 
accesses are correctly represented in terms of inter- 
nal core clocks by multiplying by two. However, the 
clock counts related to internal data manipulation 
should not be multiplied by two. Therefore the total 
number of processor core clock counts for an in- 
struction with a cache miss represents a worst-case 
approximation. 


To calculate the execution time for an OverDrive 
processor instruction, multiply the total processor 
core clock counts by the core clock period. For ex- 
ample, in a 25 MHz system upgraded with a 50 MHz 
IntelDX2 OverDrive processor, the core clock period 
is 20 ns (1/50 MHz). 


Additionally, the assumptions specified below 
should be understood in order to estimate instruc- 
tion execution time. 


A cache miss will force the OverDrive processor to 
run an external bus cycle. The Intel486 microproces- 
sor 32-bit burst bus is defined as r—b—w. 


Where: 


r= The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle is a non-burst read. 


b = The number of bus clocks for the second and 
subsequent cycles in a burst read. 


w = The number of bus clocks for a write. 


The fastest bus the OverDrive processor can sup- 
port is 2—1—2 assuming 0 waits states. The clock 
counts in the cache miss penalty column assume a 
2-1-2 bus. For slower busses add r—2 clocks to 
the cache miss penalty for the first dword accessed. 
Other factors also affect instruction clock counts. 


Instruction Clock Count Assumptions 


1. The external bus is available for reads or writes at 
all times. Else add bus clocks to reads until the 
bus is available 


2. Accesses are aligned. Add three core clocks to 
each misaligned access. 
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3. Cache fills complete before subsequent accesses 
to the same line. If a read misses the cache dur- 
ing a cache fill due to a previous read or prefetch, 
the read must wait for the cache fill to complete. If 
a read or write accesses a cache line still being 
filled, it must wait for the fill to complete. 


4. lf an effective address is calculated, the base reg- 
ister is not the destination register of the preced- 
ing instruction. If the base register is the destina- 
tion register of the preceding instruction add 1 to 
the core clock counts shown. Back-to-back PUSH 

‘and POP instructions are not affected by this rule. 


5. An effective address calculation uses one base 
register and does not use an index register. 
However, if the effective address calculation 
uses an index register. 1 core clock may be add- 
ed to the clock shown. | 


6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, add 
a maximum of 3b bus clocks. If the destination 
instruction is not completely contained in the first 
16 byte burst, add a maximum of another r+ 3b 
bus clocks. 


7. If no write buffer delay, w bus clocks are added 
_ only in the case in which all write buffers are full. 


8. Displacement and immediate not used together. 
If displacement and immediate used together, 1 
core clock may be added to the core clock count 
shown. 


9. No invalidate cycles. Add a delay of 1 bus clock 
for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external bus 
when the OverDrive processor needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13, 21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. This 
assumes that neither page entry is in the data 
cache and a page fault does not occur on the 
address translation. 


11. No exceptions are detected during instruction 
execution. Refer to interrupt core Clock Counts 
Table for extra clocks if an interrupt is detected. 


12. Instructions that read multiple consecutive data 
items (i.e., task switch, POPA, etc.) and miss the 
cache are assumed to start the first access ona 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+ 3b) 
bus clocks to the cache miss penalty. 
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7.0 INTEL OverDrive™ PROCESSOR allows the Intel486 processor to directly recognize 
CIRCUIT DESIGN when the Intel OverDrive processor socket is popu- 


lated. When the UP# pin is driven active to the 
Intel486 processor, the Intel486 processor tri-states 


7.1 Upgrade Circuit for Intel486 all of its output pins and enters power-down mode. 
PLOceseor-uaeec Systems with 
UP # 


Figure 7-1 shows the Intel OverDrive processor 
socket circuit for Intel486 processor-based systems 
using UP#. The Upgrade Present input, UP# pin, 


HLDA 
+ _ CTRL RL 


> CLK HLDA | HLDA 


FLUSH# &) 
Intel 
™ 
ie i486 


PROCESSOR Processor 
BOFF# © 


IGNNE# FERR# HOLD 


“IGNNE# FERR# Voc BOFF# —- FLUSH# 


290436-6 


Figure 7-1. Intel OverDriveT™’ Socket Circuit Diagram for Systems 
Based on Intel486™ Processors That Have the UP # Input Pin 
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8.0 BIOS AND SOFTWARE 


The following should be considered when designing 


a system for upgrade with an Intel OverDrive proces- | 


sor. 


8.1 Intel OverDrive™ Processor 
Detection 


The component identifier and the stepping/revision 
identifier for the Intel OverDrive processors is read- 
able in the DH and DL registers, respectively, imme- 
diately after RESET. The value loaded into each reg- 
ister is defined in Table 8-1. The “x” value defines 
the device stepping. | 


Table 8-1. CPU ID Values 


Intel486DX Oxh, 1xh 


As it is difficult to differentiate between Intel486 DX 
processor and some of the Intel OverDrive proces- 
sors in software, it is recommended that the BiOS 
save the contents of the DX register immediately af- 
ter RESET. This will allow the information to be used 
later, if required, to identify an Intel OverDrive proc- 
essor in the system. 


Alternately, for those OverDrive processors support- 
ing it, the CPUID instruction can be used to identify 
the processor. Refer to the Intel486 Microprocessor 
Data Book for additional information on the CPUID 
instruction and its use. 
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NOTE: | 

Initialization routines for IntelSX2 OverDrive proces- 
sor and Intel486 SX processor-based systems 
should check for the presence of a floating point 
unit and set the CRO register accordingly (refer to 
the Intel486 SX Microprocessor Data Book for spe- 
cific details). In addition, the BIOS should check for 
the presence of the 16 KByte cache in the 
IntelDX4 OverDrive processor. 


8.2 Timing Dependent Loops 


The Intel OverDrive processors execute instructions 
at two times (for the IntelSX2 and IntelDX2 Over- 
Drive processors) or three times (for the IntelDX4 
OverDrive processor) the frequency of the input 
clock. Thus, software (or instruction based) timing 
loops will execute faster on the Intel OverDrive proc- 
essor than on the Intel486 DX or Intel486 SX proc- 
essor (at the same input clock frequency). Instruc- 
tions such as NOP, LOOP, and JMP $+2, have 
been used by BIOS to implement timing loops that 
are required, for example, to enforce recovery time 
between consecutive accesses for I/O devices. 
These instruction based timing loop implementa- 
tions may require modification for systems intended 
to be upgradable with the Intel OverDrive proces- 
sors. 


In order to avoid any incompatibilities, it is recom- 
mended that timing requirements be implemented in 
hardware rather than in software. This provides 
transparency and also does not require any change 
in BIOS or I/O device drivers in the future when 
moving to higher processor clock speeds. As an ex- 
ample, a timing routine may be implemented as fol- 
lows: The software performs a dummy |/O instruc- 
tion to an unused I/O port. The hardware for the bus 
controller logic recognizes this I/O instruction and 
delays the termination of the |/O cycle to the proc- 
essor by keeping RDY # or BRDY# deasserted for 
the appropriate amount of time. 
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9.0 ELECTRICAL DATA 


The following sections describe recommended elec: 
trical connections for the Intel OverDrive processor, 
and its electrical specifications. 


9.1 Power and Grounding 


9.1.1 POWER CONNECTIONS 


~ Power and ground connections must be made to all 
external Vcc and GND pins of the Intel OverDrive 
processor. On the circuit board, all Voc pins must be 


connected on a Vcc plane. All Vss pins must be. 


likewise connected on a GND plane. 


9.1.2 POWER DECOUPLING 
RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near the Intel OverDrive processor. The _ Intel 
OverDrive processor driving its 32-bit parallel ad- 
dress and data busses at high frequencies can 
cause transient power surges, particularly when driv- 
ing large capacitive loads. | 


a 

intel. 
Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 


Circuit board traces between the Intel OverDrive 
processor and decoupling capacitors as much as 


possible. Capacitors specifically for PGA packages 


are also commercially available. 


9.1.3 OTHER CONNECTION 
RECOMMENDATIONS 


N.C. pins should always remain unconnected. 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to Vcc through a pullup 
resistor. Pullups in the range of 20 KN are recom- 
mended. Active HIGH inputs should be connected to 
GND. | 


9.2 Maximum Ratings 


Table 9-1 lists the absolute maximum ratings for 
each of the OverDrive processors. This table is a 
stress rating only, and functional operation at the 
maximums is not guaranteed. Functional operating 
conditions are given in Section 9.3, D.C. Specifica- 
tions, and Section 9.4, A.C. Specifications. 


Table 9-1. Absolute Maximum Ratings 


Case Temperature under Bias 
Storage Temperature 


Voltage on any Pin 
with Respect to Ground 


Supply Voltage with Respect to Vss 
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IntelSX2T™ 
OverDrive™ 
—65°C to +110°C —65°C to + 110°C —30°C to +110°C 


—65°C to + 150°C 


—0.5V to Voc + 0.5V | —0.5V to Voc + 0.5V | —0.5V to Voc + 0.5V 


—0.5V to +6.5V —0.5Vto+6.5V | —0.5V to +6.5V 


IntelDX2™ 
OverDriveT™ 


IntelDX4™ — 
OverDrive™ 


—65°C to +150°C | —30°Cto +125°C 
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9.3 D.C. Specifications 9.3.1 IntelSX2™ OverDrive™’ PROCESSOR 
D.C. SPECIFICATIONS 

The D.C. specifications for each of the OverDrive 

processors are contained in the tables in Sections | Table 9-2 details the D.C. Specifications of the 

9.3.1, 9.3.2 and 9.3.3. For additional information, re- | _IntelSX2 OverDrive processor. 

fer to the appropriate Intel microprocessor hand- 

book. 


Table 9-2. D.C. Specifications for the IntelSX2T™’ OverDriveT™ Processor | 


Functional operating range: Vcc = 5V + 5%; Tsinx = 0°C to + 85°C 


-someat[——__praner | win typ | mn_| vt [Ton Conaon 
re 
aT a ee ae eT os 
[vos | owoniignvetegs ee [|v ne 
Tuy [input Leckage Curent =i -s | | ws | pa | Wow 


Input Leakage Current 
(all pins except SRESET) 200 pA (Note 4) 
Input Leakage Current for SRESET 300 yA 
Power Supply Current 
CLK = 25 MHz 700 mA 
Input Leakage Current ae we (Note 5) 
a Input Capacitance PGA a nao ee (Note 7) 


NOTES: 


1. This parameter is measured at: Address, Data, BEn: 4.0 mA 
Definition, Control: 5.0 mA 

2. This parameter is measured at: Address, Data, BEn: —1.0 mA 

Definition, Control: —0.9 mA 
This parameter is for inputs without pullups or pulldowns and OV < Vin < Vcc. 
This parameter is for inputs with pulldowns and Viy = 2.4V. 
This parameter is for inputs with pullups and Vy, = 0.45V. 
When the processor is in Stop Grant state, the Iccy of the host processor is less than 2 mA. 
Fo = 1 MHz; Not 100% tested. 


ee Ee 
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9.3.2 IntelDX2T™ OverDrive™ PROCESSOR D.C. SPECIFICATIONS 
Table 9-3 details the D.C. Specitications of the IntelDX2 OverDrive processor. 


Table 9-3. D.C. Specifications for the IntelDX2™ OverDrive™ Processor 


Parameter 


[| Unit | Notes 

ViL Input Low Voltage a rere 
Vin Input High Voltage = ae 
loc Power Supply Current _ -* 

CLK = 33 MHz 

CLK = 25 MHz mA (Note 4) 
lu. Input Leakage Current 
mm Input Leakage Current 
lie Input Leakage Current 

CLK Capacitance a ee eee Fo = 1 MHz(8) 


NOTES: | j 
1. The function operating temperature range is: 

OverDrive processor—25 MHz, Tsink = 0°C to + 95°C 

OverDrive processor—33 MHz, Tsink = 0°C to + 95°C 
2. This parameter is measured at: 

Address, Data, BEn 4.0 mA 

Definition, Control 5.0 mA 
3. This parameter is measured at: 

Address, Data, BEn -—1.0 mA 

Definition, Control —0.9 mA 
4. Typical supply current: 

775 mA @ CLK = 25 MHz 

975 mA @ CLK = 33 MHz 
5. This parameter is for inputs without internal pullups or pulldowns and 0 < Vin < Vcc. 
6. This parameter is for inputs with internal pulldowns and Vi = 2.4V. 
7. This parameter is for inputs with internal pullups and V;, = 0.45V. 
8. Not 100% tested. 


w 
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9.3.3 IntelDX4T™ OverDriveT™’ PROCESSOR D.C. SPECIFICATIONS 
Table 9-4 details the D.C. Specifications of the IntelDX4 OverDrive processor. 


Table 9-4. D.C. Specifications for the IntelDX4™ OverDrive™ Processor 
Functional operating range: Vcc = 5V + 5%, Tsink = 0°C to + 95°C. 


[unit 
"vin | Inputrighvotage——=S=S*~*~“~*é‘“‘dtC OC ce | | 
a 
110 
Power Supply Current in Stop: % 


Output High Voltage 2 es 
loc Stop 
Grant 
20 
Clock State 
0 


mA 
- 
Clock | 
LI Input Leakage Current ye | tts] pA 
fo! 


CLK = 25/75 MHz 
CLK = 33/100 MHz 


Power Supply Current 

CLK = 25/75 MHz 

CLK = 33/100 MHz 
Input Leakage Current 
Input Leakage Current _ | | 400 


NOTES: 
1. This parameter is measured at: 
4.0 mA: Address, Data, BEn 
5.0 mA: Definition, Control 
2. The maximum and typical values shown here are design estimates. Typical supply current: 
Icc = 835 mA @ CLK = 25 MHz 
Iocc = 1085 mA @ CLK = 33 MHz 
3. The loc Stop Grant specification refers to the Icc value once the IntelDX4 OverDrive processor enters the Stop Grant or 
Halt Auto Powerdown State. . 
4. The Icc Stop Clock specification refers to the Ico value once the IntelDX4 OverDrive processor enters the Stop Clock 
State. Vi, and Vi, levels must be Voc and OV, respectively, in order to meet the Icc Stop Clock specification. 
5. This parameter is for inputs without pullups or pulldowns and 0 < Vin < Voc. 
6. This parameter is for inputs with pulldowns and Vi = 2.4V. 
7. This parameter is for inputs with pullups and Vj, = 0.45V. 
8. Not 100% tested. 


(Note 6) 
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9.4 A.C. Specifications 


The A.C. specifications for each of the OverDrive 
processors are contained in the tables in Sections 
- 9.4.1, 9.4.2 and 9.4.3. These specifications consist 
_of output delays, input setup requirements and input 
hold requirements. All A.C. specifications are rela- 
tive to the rising edge of the CLK signal. 


A.C. specification measurements are defined by Fig- 


ures 9-1 through 9-6. All timings are referenced to | 


1.5V, unless otherwise specified. Inputs must be 
driven to the voltage levels indicated by Figure 9-3 
when A.C. specifications are measured. Intel Over- 
Drive processor output delays are specified with 
minimum and maximum limits, measured as shown. 
The minimum Intel OverDrive processor delay times 
- are hold times provided to external circuitry. Intel 
OverDrive processor input setup and hold times are 
specified as minimums, defining the smallest ac- 
ceptable sampling window. Within the sampling win- 
dow, a synchronous signal must be stable for cor- 
rect Intel OverDrive processor operation. 


Table 9-5 defines the A.C. timing specifications for a 
33 MHz system. Table 9.7 defines the A.C. timing 
specifications for a 25 MHz system. Table 9-8 de- 
fines the A.C. timing specifications for a 20 MHz sys- 
tem. Table 9-8 defines the A.C. timing specifications 
for a 16 MHz system. 


Each Intel OverDrive processor meets the A.C. 
specifications for the processor it is upgrading. For 
example, a 100 MHz IntelDX4 OverDrive processor 
meets the system A.C. timing specifications for the 
33 MHz processor it is upgrading. 


Refer to Sections 9.4.1 through 9.4.3 for any timing 
differences from those specified in the following ta- 
bles. 


For additional information, refer to the appropriate 
Intel microprocessor handbook. 
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9.4.1 IntelSX2T™ OverDrive™ PROCESSOR 


A.C. SPECIFICATIONS 


The IntelSX2 OverDrive processor can be placed 
into an existing 16 MHz, 20 MHz or 25 MHz Intel486 
system, doubling the internal processor speed to 
32 MHz, 40 MHz. or 50 MHz, respectively. 


Tables 9-6 through 9-8 contain the A.C. timing speci- | 
fications for the processors in those systems. 


9.4.2 IntelDX2™ OverDriveT™’ PROCESSOR 
A.C. SPECIFICATIONS 


The IntelDX2 OverDrive processor can be placed 
into an existing 16 MHz, 20 MHz, 25 MHz or 33 MHz 
Intel486 system, doubling the internal processor 
speed to 32 MHz, 40 MHz, 50 MHz or 66 MHz, re- 
spectively. 


Tables 9-5 through 9-8 contain the A.C. timing speci- 
fications for the processors in those systems. 


9.4.3 IntelDX4™ OverDriveT™’ PROCESSOR 
A.C. SPECIFICATIONS 


The IntelDX4 OverDrive processor can be placed 
into an existing 16 MHz, 20 MHz, 25 MHz or 33 MHz 
Intel486 system, tripling the internal processor 
speed to 48 MHz, 60 MHz, 75 MHz or 100 MHz, 
respectively. 


Tables 9-5 through 9-8 contain the A.C. timing speci- 
fications for the processors in those systems. 
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Table 9-5. 33 MHz Intel Processor Characteristics(‘) 
Voc = 5V +5%; Tsink = See Note 6; C; = 50 pF unless otherwise specified(3) 


[symbol[ ‘Parameter === Min| Max | Unit] Figure| Notes 
i nce all 
OverDrive processor 

ri ___| GLK Period reofwsfm [st 
ee Cer P fot | 8 |__| Adjacent Clocks | 
OLK High Time caf [as | et fatey 
CC 


fy [ciKFatime ——S~S~S Sd 
Ci a Ce 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK#, SMIACT #, 
FERR #, BREQ, HLDA Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK# Float — 


T 85 _| Wotea) 


RESET, FLUSH#, A20M#, NMI, INTR, 
SMI#, STPCLK#, SRESET, IGNNE# 
Setup Time 


RESET, FLUSH#, A20M#, NMI, INTR, 
SMI #, STPCLK #, SRESET, IGNNE# 
Hold Time 


[gg | D0-084, DPO-8, Ad-AdT Read Setup Time] 6 | [re [Sasa] 
[za [D0-D1, DPO=3, Ad-AGi ReadHoid Time | 3 | | ns [5a5a; 


NOTES: 
1. To be used for 66 MHz IntelDX2 and 100 MHz IntelDX4 OverDrive processors. 
2. Not 100% tested. Guaranteed by design characterization. 
3. All timing specifications assume Ci, = 50 pF. 
4. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
5. A reset pulse width of 15 CLK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and CLK are stable. 
6. Tsink temperatures are: 
IntelSX2 OverDrive processor: 0°C to + 85°C 
IntelDX2 OverDrive processor: 0°C to + 95°C 
IntelDX4 OverDrive processor: 0°C to + 95°C 


to4 
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3 Table 9-6. 25 MHz Intel Processor Characteristics(1) 
Voc = 5V +5%:; Tsink = See Note 6; C; = 50 pF unless otherwise specified(3) 


Symbol Parameter | Min | Max | Unit | Figure | == Notes 
i Be Kl ad == 
OverDrive Processor 
CLK Period | 40 | 125 | ns | 51 | sd 
CLK Period Stability _ P_fosx | a {_[adacenicccks 
CLK High Time 14]; | ons | at 2V. | 
CLK Low Time 4] ns | ost fatosy 
CLK Fall Time eke 
CLK Rise Time 


A2-A31, PWT, PCD, BEO-3#, M/IO#, fa 
D/C#, W/R#, ADS#, LOCK#, ot 
FERR#, BREQ, HLDA, SMIACT#, 
Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#. ADS#, LOCK# Float 
Delay < 


PCHK # Valid Delay ag (Note 4) 
€ | 55 | (Notes) 


— fou 
ph) 


— | | {© Io Io 
: ‘ 


MN 
m 
> 
O 
7) 
we 
op) 
@ | 
Cc 
TG 
|Z 


ive) 


£ 


~“ 


BLAST #, PLOCK# Valid Delay“ ® Bel 24.4% (Note 4) 
BLAST #, PLOCK# Float Dg lay > dowel ns | 5.6 | (Note 2) 


oO 
ws) 
7 
9 
& 
0]: 
U 
“a 
| do 
aes 
=. 
»| © 
es 


3 


t13 EADS # Hold time™ 


—— | ‘a\ a 
tia _| KEN#, BS16#, BS8# Sétil Hreget_o | [ns | 52] 
tis__| KEN#, BS16#, BS8 @ihold Tima). > ee 
ROY. BROY¢ Settht ZN a a ee eS 
RDY#, BRDY@ Mold Time (a 5 
HOLD, AHQED,BOFF# SeupTime [8 | | ns | 62 | 


HOLD, Ag io1 BOFF#HoldTime | 3 | | ns | 52, 


RESET, FEUSH, A20M#, NMI, SMI#, dd 
STPCLK#, SRESET, INTR, IGNNE# 
Setup Time ; 
toy RESET, FLUSH#, AZOM#, NMI, SMI#, 5.2 
STPCLK#, SRESET, INTR, IGNNE# 
Hold Time 
too DO-D31, DPO-3, A4--A31 Read 5.2, 5.3 
Setup Time . 
to3 DO-D31, DPO-3, A4--A31 Read 3 5.2, 5.3 
. Hold Time | 


NOTES: 
1. To be used for 50 MHz IntelSX2, 50 MHz or 60 MHz IntelDX2 and 75 MHz or 100 MHz IntelDX4 OverDrive processors. 
2. Not 100% tested. Guaranteed by design characterization. 
3. All timing specifications assume C,_ = 50 pF. 
4, The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
5. A reset pulse width of 15 CLK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and CLK are stable. 
6. Tsink temperatures are: 
IntelSX2 OverDrive processor: 0°C to + 85°C 
IntelDX2 OverDrive processor: 0°C to +95°C 
IntelDX4 OverDrive processor: 0°C to +95°C 
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Table 9-7. 20 MHz Intel Processor Characteristics() 
Voc = 5V +5%; Tsinxk = See Note 6; C; = 50 pF unless otherwise specified(3) 


Symbol Parameter 


win [Max | Unit | Figure | Notes 
i 


1X Clock Driven to 

OverDrive Processor 
jt; | ClKPeriog i | 1H | ns | 5 | 
CLK Period Stability | form] A | | Adjacent Clocks | 
CLK High Time 16} [ns | 54 fata 
CLK Low Time | pi6 | | ns | 5.1 [atosv 
[ty | ClkKFallTime = | CT ls | 5.4 | VtoOBV 
jis | CLKRiseTime =| S| | ns 

t 


6 A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK#, | she 
FERR #, BREQ, HLDA, SMIACT # 

Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK# 
Float Delay 


5.6 (Note 2) 


5.4 (Note 4) 


) 5.5 | (Note 4) 


Note 2) 


| 5.6 
| 5.5 | (Note 4) 


55 
3° | 226 © 5.5 
£ ay th” <i 87 Note 2) 
EADS # Setup Tim : iFM 
oe roy 7 F< : — 59 
| 


EN #, BS16#: BS8# Setup Tinie 


SS 


apt 
S 


it = = 
Se 
Hold Fimes 
. A 


SR 


HOLD, AHOKDy (ee ee (oe 
too ;PLUSH#, A20M#, NMI, SMI#, | 12 5.2 | (Note 5) 
STPCLK #, SRESET, INTR, IGNNE # 
Setup Time : 
to RESET, FLUSH #, A20M#, NMI, SMI#, 


STPCLK #, SRESET, INTR, IGNNE# 


3 5.2 (Note 5) 
Hold Time 
too DO-D31, DPO-3, A4—A31 Read 5.2; 5.3 
Setup Time 
to3 DO-D31, DPO-3, A4—A31 Read 3 5.2, 5.3 
Hold Time 


NOTES: 
1. To be used for 50 MHz IntelSX2, 50 MHz or 60 MHz IntelDX2 and 75 MHz or 100 MHz IntelDX4 OverDrive processors. 
2. Not 100% tested. Guaranteed by design characterization. 
3. All timing specifications assume C, = 50 pF. 
4. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
5. A reset pulse width of 15 CLK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and CLK are stable. 
6. Tsink temperatures are: 
IntelSX2 OverDrive processor: 0°C to + 85°C 
IntelDX2 OverDrive processor: 0°C to +95°C 
IntelDX4 OverDrive processor: 0°C to +95°C 
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Table 9-8. 16 MHz Intel Processor Characteristics(") 
cc = 5V +5%; Tsinx = See Note 6; C; = 50 pF unless otherwise specified 


___ Parameter | Min | Max | Unit | Figure | Notes 
_ ee 
OverDrive Processor 

CLK Period 625/125 | ns | 51 | 
CLK Period Stability | foi% | A | | AdjacentClocks 
CLK High Time )20 | | ns | 51 [atv 


CLK Low Time | 20] [ns | 51 [atoev 
CLK Falll Time ——+ 8 | ne fet 2Vt00.8V_ 

CLK Rise Time | tt AB ne 9.sv to 2v 
A2-A31, PWT, PCD, BEO-3#,M/IO#, | 3 

D/C#, W/R#, ADS#, LOCK#, 
FERR#, BREQ, HLDA, SMIACT# 
Valid Delay 


A2-A31, PWT, PCD, BEO-3#, M/IO#, 
D/C#, W/R#, ADS#, LOCK# 
Float Delay 


PCHK # Valid Delay 
BLAST #, PLOCK# Vailid Delay ‘ 


< 


8 


i<o) 
© 


t 


+ 
ie) 


a7 o-* o> | em oa ~- or ee | ek -- Comal 
—_ _h —_ — —_ ~ fe>) & 160 — —_ 


KEN #, BS16#; BS8# Se 
KEN#, BS16#, BS8# Ho 
RDY #, BRDY# Sou : 
RDY #, BRDY #,Hold ¥ 


HOLD, AHOL, = ale Time 7 ae 5s es eT 
HOLD, ALLO? BOFF # HoldTime | 4 | | ns | 52 | 


EROSH#, A20M#, NMI, SMI#, | 14 (Note 5) 
STPCLK# SRESET, INTR, IGNNE# 
Setup Time 
RESET, FLUSH #, A20M#, NMI, SMI#, (Note 5) 
STPCLK #, SRESET, INTR, IGNNE# 
Hold Time | 
DO-D31, DPO-3, A4-A31 Read 10 5.2, 5.3 
Setup Time | | 
to3 DO-D31, DPO-3, A4-A31 Read | 4 

Hold Time 


NOTES: 
1. To be used for 50 MHz IntelSX2, 50 MHz or 60 MHz IntelDX2 and 75 MHz or 100 MHz IntelDX4 OverDrive processors. 
2. Not 100% tested. Guaranteed by design characterization. 
3. All timing specifications assume C_ = 50 pF. 
4. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
5. A reset pulse width of 15 CLK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Voc and CLK are stable. 
6. Tsink temperatures are: 
IntelSX2 OverDrive processor: 0°C to + 85°C 
IntelDX2 OverDrive processor: 0°C to + 95°C 
IntelDX4 OverDrive processor: 0°C to + 95°C 
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Figure 9-1. CLK Waveforms 
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Figure 9-2. Input Setup and Hold Timing 
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Figure 9-3. Input Setup and Hold Timing 
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CLK [ 


BRDY#, RDY# [ 


_  DO-D31 
DPO-DP3 


PCHK# [ 


CLK 


A2-A31, PWT, PCD, 
BEO-3#, M/10#, 
D/C#,W/R#, ADS#, 
LOCK#, FERR#, BREQ, 
HLDA 


' DP0-D31, DPO-3, 
(WRITE) 


BLAST#, PLOCK# 


CLK 


A2-A31, PWT, PCD, 
BEO-3#, M/lO#, 


D/C#,W/R#, ADS#, 


LOCK#, FERR#, BREQ, 


‘HLDA 


DO-D31, DPO-3, 
(WRITE) 


BLAST#, PLOCK# 


Wk "SW 
Pa, 
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Figure 9-4. PCHK # Valid Delay Timing 
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Figure 9-5. Output Valid Delay Timing 
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Figure 9-6. Maximum Float Delay Timing 
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10.0 MECHANICAL DATA 10.1 Package Dimensions 

The following sections describe the physical dimen- Figure 10-1 describes the physical dimensions of the 
sions of the OverDrive processor packages and heat PGA packages (168-lead PGA and 169-lead PGA) 
sinks. used with the IntelSX2, IntelDX2 and IntelDX4 Over- 


Drive processors. 


SEATING 


SEATING 
PLANE 


@B (ALL PINS) 


ia 


SWAGGED 
PIN 
DETAIL | 


© 
CRORCROROROROMOKO 
t |©©0©000000 


1. Pin not present on 168 lead PGA package. 
2.29 
1.52 aes SWAGGED 
45° CHAMFER 
(INDEX CORNER) 


290436-17 


Family: Ceramic Pin Grid Array Package 


| Millimeters_— | snches, 

| Min | Max | Notes | Min | Max | Notes | 
| A | 3se{ 457] | ota | oreo | 
| Ar | 064 | 1.14 | souinuip | 0.026 | 0.045 | SOLIDLID_ 
| Ae | 28 | 35 | SOLIDLID | 0.110 | 0.140 | SOLIDLID” 
| Aa {| 1t4 | tao] | 004s | ooss | 
|B | 043 | 081] | oor7 | coz | 
|b | 4407 | agea | 4.795 | 1765 | 
| Dy | 40st | 4077 | | 1.505 | 1605 | 
|e; | 229 | 279] | ooso | orto | 
pou | asa] a0} | too | ots | 
| oN | 168,169 108,169 | | 
a ee 
| issue | ws Revx vises | 


Figure 10-1. OverDrive™ Processor Package Dimensions 
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10.2 Heat Sink Dimensions j 10.2.1 0.25" HEAT SINK 


There are two different heat sinks used on the Intel | Figure 10-2 describes the physical dimensions of the 
OverDrive processors. The IntelSX2 and IntelDX2 0.25” heat sink used with the IntelSX2 and IntelDX2 
OverDrive processors both use the 0.25” heat sink. | OverDrive processors. Table 10-1 lists the physical 
The IntelDX4 OverDrive processor uses the 0.6” dimensions. 


heat sink. Both heat sinks are described in the fol- 
lowing sections. 


0.25" HEAT SINK 


ADHESIVE 


UPGRADE PROCESSOR, PGA PACKAGE 


290436-19 — 


Figure 10-2. Dimensions, IntelSX2™/IntelDX2TM OverDrive™ Processor with 0.25” Heat Sink 


Table 10-1. 0.25” Heat Sink Dimensions - 

Dimension (inches) | Minimum =| Maximum 

A. Heat Sink Width | 4520 «| 1550 
B. PGA Package Width | 4.735 1.765 
C. Heat Sink Edge Gap | ee ee ee 
D. Heat Sink Height | 022 | 260 
E. Adhesive Thickness / 0.008 =f sora 
F, Package Height from Stand-Offs | 0440 ff to 
G. Total Height from Package Stand-Offs to Top of Heat Sink | 0360 =| 0452 | 
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10.2.2 0.6” HEAT SINK Table 10-2. As the table shows, the maximum height 

of the IntelDX4 OverDrive processor from the pin 
Figure 10-3 describes the physical dimensions of the stand-offs to the top of the heat sink, including the 
0.6” heat sink used with the IntelDX4 OverDrive adhesive thickness, is 0.780 inches. A minimum 
processors. The maximum and minimum dimensions clearance of 0.25” should be allowed above the top 
for the PGA package with heat sink are shown in ~— of the heat sink. . 


0.6" HEAT SINK 


ADHESIVE 


UPGRADE PROCESSOR, PGA PACKAGE 


290436-36 


Figure 10-3. Dimensions, IntelDX4T OverDrive™ Processor with 0.6” Heat Sink 


Table 10-2. 0.6” Heat Sink Dimensions 


[__ pimensionnetes) | nim | Maman 
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11.0 THERMAL MANAGEMENT 


The heat generated by the Intel OverDrive proces- 
sor requires that heat dissipation be managed care- 
fully. All OverDrive processors are supplied with a 
heat sink attached with adhesive to the package. 
System designs must, therefore, provide sufficient 
clearance (a minimum of 0.25” above the heat sink) 
for the processor and the attached heat sink. 


Section 10 contains the physical dimensions for 
each of the heat sinks and packages used. 


Table 11-1. Thermal Resistance, IntelSX2™ OverDrive™ Processor with Attached Heat Sink 


— Oyg = 1.5°C/W 


Oya(?C/W) 


intel. 


The standard product markings and logo for the Intel 
OverDrive processor with the attached heat sink will 
be included on a 1in2 plate located on the top, cen- 
ter of the heat sink. . 


The heat sink is omni-directional, allowing air to flow 
from any direction in order to achieve adequate 
cooling. The thermal resistance values for the Over- 
Drive processors with an attached heat sink are 
shown in Table 11-1 through Tabie 11-3. 


Airflow (LFM) 


Table 11-2. Thermal Resistance, IntelDX2™ OverDrive™ Processor with Attached Heat Sink 


6455 = 2.0°C/W 


8yaC/W) 
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APPENDIX A 
DESIGN CONSIDERATIONS 


Intel has designed the family of OverDrive processors so that they can be installed by the end user. PC 
manufacturers can support this by implementing the design considerations listed in Table A-1. 


Table A-1. Design Considerations 


Design Consideration Implementation | 


Visible OverDrive Processor Socket | The Intel OverDrive processor socket should be easily visible when 
the PC’s cover is removed. Label the Intel OverDrive processor 
socket and the location of pin 1 by silk screening this information on 
the PC board. 


Accessible OverDrive Processor Make the Intel OverDrive processor socket easily accessible to the 

Socket end user (i.e., do not place the Intel OverDrive processor socket 
under a disk drive). If a Low Insertion Force (LIF) or screw machine 
socket is used, position the Intel OverDrive processor socket on the 
PC board such that there is ample clearance around the socket. 


Foolproof Chip Orientation Intel packages all Intel OverDrive processors in a 169-pin, PGA 
package. The 169th pin is called the “key” pin and insures that the 
Intel OverDrive processor fits into a 169-pin socket in only the 
correct orientation. Supplying a 169-pin socket as the Intel. 
OverDrive processor socket eliminates the possibility of end users 


or resellers damaging the PC board or Intel OverDrive processor by 
powering up the system with the Intel OverDrive processor 
incorrectly oriented. 


Zero Insertion Force The high pin count of the Intel OverDrive processor makes the 

Upgrade Socket insertion force required for installation in a screw machine PGA 
socket excessive for end users or resellers. Even most Low 
Insertion Force (LIF) sockets often require more than 60 Ibs. of 
insertion force. A Zero Insertion Force (ZIF) socket insures that the 
chip insertion force does not damage the PC board. If the ZIF 
socket has a handle, be sure to allow enough clearance for the 
socket handle. If a LIF or screw machine socket is used, additional 
PC board support is recommended. - 


“Plug and Play” Jumper or switch changes should not be needed to electrically 
configure the system for the Intel OverDrive processor. 

Thorough Documentation Describe the Intel OverDrive processor’s installation procedure in 
the PC’s User’s Manual. 
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APPENDIX B 
ZIF AND LIF SOCKET VENDORS 


The following list provides examples of sockets 
which can be used for Intel486 SX and Intel486 DX 
CPU-based systems. 


NOTE: | . 
This is not a comprehensive list. Intel has not test- 
ed all of the socket vendors’ sockets listed below 
and cannot guarantee that these sockets will meet 
every PC manufacturer’s specific requirements. 


Zero Insertion Force (ZIF) and Low Insertion 
Force (LIF) Socket Vendors: 


1. AMP Inc. . 
219 American Avenue 
Greensboro, NC 27409-1803 
Tel: (800) 522-6752 


2. Augat Inc. 
~ 425 John Dietsch Blvd. 
Attleboro Falls, MA 02763 
Tel: (800) 999-7646 


3. Aries Electronics 
| P.O. Box 130 
Frenchtown, NJ 08825 
Tel: (908) 996-3891 
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. Foxconn International Inc. 


930 West Maude Avenue 
Sunnyvale, CA 94086 
Tel: (800) 727-3699 


. JAE 


142 Technology Drive 
Irvine, CA 92718-2401 
Tel: (714) 753-2600 | 


. Loranger International Corp. 


817 Fourth Avenue 
Warren, PA 16365 


Tel: (814) 723-2250 


. Thomas and Betts 


200 Executive Center Drive 
P.O. Box 24901 

Greenville, SC 29616-2401 
Tel: (803) 676-2900 


. Yamaichi Electronics 


1420 Koll Circle, Suite B 
San Jose, CA 95112 
Tel: (800) 769-0797 
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ADVANCE INFORMATION 


82091AA 
ADVANCED INTEGRATED PERIPHERAL (AIP) 


Single-Chip PC Compatible I/O Solution 

for Notebook and Desktop Platforms: 

— 82078 Floppy Disk Controller Core 

— Two 16550 Compatible UARTs 

— One Multi-Function Parallel Port 

—IDE Interface 

— Integrated Back Power Protection 

— Integrated Game Port Chip Select 

— 5V or 3.3V Supply Operation with 5V 
Tolerant Drive Interface 

— Full Power Management Support 

— Supports Type F DMA Transfers for 
Faster 1/O Performance 

— No Wait-State Host !1/O Interface 

— Programmable Interrupt Interfaces 

— Single Crystal/Oscillator Clock 
(24 MHz) 

— Software Detectable Device ID 

— Comprehensive Powerup 
Configuration 


The 82091AA is 100 Percent 
Compatible with EISA, ISA and AT 


Host Interface Features 

— 8-Bit Zero Wait-State ISA Bus 
Interface 

— DMA with Type F Transfers 

— Five Programmable ISA Interrupt 
Lines 

— Internal Address Decoder 


Parallel Port Features 

— All IEEE Standard 1284 Protocols 
Supported (Compatibility, Nibble, 
Byte, EPP, and ECP) 

— Peak Bi-Directional Transfer Rate of 
2 MB/sec 

— Provides Interface for Low-Cost 
Engineless Laser Printer 

— 16-Byte FIFO for ECP 

— Interface Backpower Protection 


Floppy Disk Controller Features | 

— 100 Percent Software Compatible 
with Industry Standard 82077SL and 
82078 

-— Integrated Analog Data Separator 
250K, 300K, 500K, and 1 MBits/sec 

— Programmable Powerdown 
Command 

— Auto Powerdown and Wakeup 
Modes 

— Integrated Tape Drive Support 

— Perpendicular Recording Support for 
4 MB Drives | 

— Programmable Write Pre- : 
Compensation Delays 

— 256 Track Direct Address, Unilmited 
Track Support 

— 16-Byte FIFO 

— Supports 2 or 4 Drives 


16550 Compatible UART Features 

— Two Independent Serial Ports. 

— Software Compatible with 8250 and 
16450 UARTs | 

— 16-Byte FIFO per Serial Port- 

— Two UART Clock Sources, le 
MIDI Baud Rate 


IDE Interface Features 

— Generates Chip Selects for IDE 
Drives | 

— Integrated Buffer Control Logic 

— Dual IDE Interface Support 


Power Management Features 

— Transparent to Operating Systems 
and Applications Programs 

— Independent Power Control for Each 
Integrated Device 


100-Pin GFP Package 


(See Packaging Spec. 240800) 


The 82091AA Advanced Integrated Peripheral (AIP) is an integrated |/O solution containing a floppy disk 
controller, 2 serial ports, a multi-function parallel port, an IDE interface, and a game port on a single chip. The 
integration of these I/O devices results in a minimization of form factor, cost and power consumption. The 
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 82091AA | intel 


floppy disk controller is the 82078 core. The serial ports are 16550 compatible. The parallel port supports all of 
the IEEE Standard 1284 protocols (ECP, EPP, Byte, Compatibility, and Nibble). The IDE interface supports 
8- or 16-bit programmed I/O and 16-bit DMA. The Host Interface is an 8-bit ISA interface optimized for type 
“F” DMA and no wait-state |/O accesses. improved throughput and performance, the 82091AA contains six _ 
16-byte FIFOs—two for each serial port, one for the parallel port, and one for the floppy disk controller. The 
82091AA also includes power management and 3.3V capability for power sensitive applications such as . 
notebooks. The 82091AA supports both motherboard and add-in card configurations. 


4 " 
Fl Serial Port 
5 _ Host 16550 


26 Interface 


-Serial Port 
16550 


IDE 


intertace Parallel 


Port 


LPTx 


_ Floppy Drives 
: Parallel Port 
Buffer Direction 
or 
Game Port 
Chip Select 


PPDIR or 
GAME 
Chip Select 
Oscillator 
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Figure 1. 82091AA Advanced Integrated Peripheral Block Diagram 
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ADVANCED INTEGRATED PERIPHERAL (AIP) 


CONTENTS a PAGE 
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1.0. OVERVIEW 


The major functions of the 82091AA are shown in riguie a 1. A brief description of each of these functions is 
presented in this section. 


Host Interface 


The 82091AA host interface is an 8-bit direct-drive (24 mA) ISA Bus/X-Bus interface that permits the CPU to 
access its registers through read/write operations in |/O space. These registers may be accessed by pro-: 
grammed 1/O and/or DMA bus cycles. With the exception of the IDE Interface, all functions on the 82091AA 
require only 8-bit data accesses. The 16-bit access required for the IDE Interface is supported through the 
appropriate chip selects and data buffer enables from the 82091AA. | 


Figure 2 shows an example system implementation with the 82091 AA located on an ISA Bus add-in card. This | 
~ add-in card could also be used in a PCl-based system as shown in Figure 3. For motherboard implementa- 
tions, the 82091AA can be located on the X-Bus as shown in Figure 4. 


Host Bus 


ISA. ISA 
Interface | Master/Slave 


Serial Port A <— Parallel Port 
Serial Port B Floppy Disk Drives (up to 4) 


IDE Interface <—_— Game Port Chip Select 
290486-2 


- Figure 4 2. Block Diagram of the 82091AA on the ISA Bus 
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Main Host/PCI Bridge Pcl} iste 
Memory Agent Agent 


PC! Bus 


ISA 
Be ulon Badge Master/Slave 


ISA Bus 


Serlal Port A Parallel Port 


Serial Port B Floppy Disk Drives (up to 4) 


IDE Interface Game Port Chip Select 
290486-3 


Figure 3. Block Diagram of the 82091AA in a PCI System 


Main 
Memory 


X-Bus 


—— pe 


ISA Bus 


Parallel IDE Floppy 
ISA Port Interface Disk 
Master/Slave Drives 
i (up to 4) 


Figure 4. Block Diagram of the 82091AA on the X-Bus 
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82091AA 


Floppy Disk Controller © 


The 82091AA’s enhanced floppy disk controller 
(FDC) incorporates several new features allowing for 
easy implementation in both the portable and desk- 
~ top markets. It provides a low cost, small form factor 
solution targeted for 5.0V and 3.3V platforms. The 
FDC supports up to four drives. 


The 82091 AA’s FDC implements these new features 
while remaining functionally compatible with 82078/ 
82077SL/82077AA/8272A floppy disk controllers. 
Together, with a 24-MHz crystal, a resistor package 
and a device chip select, these clevices allow for the 
most integrated solution available. The integrated 
analog PLL data separator has better performance 
than most board level discrete PLL implementations 
and can be operated at 1 Mbps/500 Kbps/ 
300 Kbps/250 Kbps. A 16-byte FIFO. substantially 
improves system performance and is ideal for multi- 
master systems (e.g., EISA). 


Serial Ports 


The 82091AA contains two independent serial ports 
that provide asynchronous communications that are 
equivalent to two 16550 UARTSs. The serial ports 
have identical circuitry and provide the serial com- 
munication interface to a peripheral device or mo- 
dem via Serial Port A and Serial Port B. Each serial 
port can be configured for one of eight address as- 
signments. The standard PC/AT compatible logical 


address assignments for COM1, COM2, COM, and | 


COM4 are supported. 


The serial ports perform serial-to-parallel conversion 
on data characters received from a peripheral de- 
vice or modem, and parallel-to-serial conversion on 
data characters received from the host. The serial 
ports can operate in either FIFC mode or non-FIFO 
mode. In FIFO mode, a 16-byte transmit FIFO holds 
data from the host to be transmitted on the serial 
link and a 16-byte receive FIFO that buffers data 
from the serial link until read by the host. 
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intel. 
The serial ports contain programmable baud rate | 
generators that divide the internal reference clock 
by divisors of 1 to (216 — 1), and produce a 16x 
clock for driving the transmitter and receiver logic. 
The internal reference clock can be programmed to 
support MIDI. The serial ports have complete mo- 


dem-control capability and a prioritized interrupt sys- 
tem. 


Parallel Port 


The 82091AA provides a multi-function parallel port 
that transfers information between the host and pe- 
ripheral device (e.g., printer). The parallel port inter- 
face contains nine control/status lines and an 8-bit 
data bus. The standard PC/AT compatible logical 
address assignments for LPT1, LPT2, and LPTS are 
supported. The parallel port can be configured for 
one of four modes and supports the following IEEE 
Standard 1284 parallel interface protocol standards: 


Parallel Port Parallel Interface 
Mode Protocol 
ISA-Compatible Mode Compatibility, Nibble 
PS/2-Compatible Mode Byte 
EPP Mode EPP 
ECP Mode ECP 


For ISA-Compatible and PS/2-Compatible modes, 
software controls the handshake signals on the par- 
allel port interface to transfer data between the host 
and peripheral device. Status and Control registers 
permit software to monitor the state of the peripheral 
device and generate handshake sequences. 


The EPP parallel port interface protocol increases 
throughput by specifying an automatic handshake 
sequence. In EPP mode, the 82091AA parallel port 
automatically generates this handshake sequence in 
hardware to transfer data between the host and 
peripheral device. 
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In addition to a hardware handshake on the parallel! 
port interface, the ECP protocol specification also 
defines DMA and FIFO capability. To minimize pro- 
cessor overhead data transfer to/from a peripheral 
device, the 82091AA parallel port, in ECP mode, 
provides a 16-byte FIFO with DMA capability. 


IDE Interface 


The 82091AA supports the IDE (Integrated Drive 
Electronics) interface by providing chip selects and 
lower data byte control. Two chip selects are used to 
access registers on the IDE device. Separate lower 
and upper byte data control signals are provided. 
With these control signals, minimal external logic is 
needed to implement 16-bit IDE 1/O and DMA inter- 
faces. 


Game Port 


The 82091AA provides a game port chip select sig- 
nal for use when the 82091AA is in an add-in card 
application. This function is assigned to I/O address 
location 201h. Note that when the 82091 AA is locat- 
ed on the motherboard, this feature is not available. 


Power Management 


82091AA power management provides a mecha- 
nism for saving power when the device or a portion 
of the device is not being used. By programming the 
appropriate 82091AA registers, software can invoke 
power management to the entire 82091AA or select- 
ed modules within the 82091AA (e.g., floppy disk 
controller, serial port, or parallel port). There are two 
methods for applying power management—direct 
powerdown or auto powerdown. Direct powerdown 
turns off the clock to a particular module immediate- 
ly placing that module into a powerdown state. This 
method removes the clock regardless of the activity 
or status of the module. When auto powerdown is 
invoked, the module enters a powerdown state 
(clock is turned off) after certain conditions are met 
and the module is in an idle state. 
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1.1. 3.3V/5V Operating Modes 


The 82091AA can operate at a power supply of 
3.3V, 5V or a mix of 3.3V and 5V. The mixed power 
supply mode provides 5V interfaces for the floppy 
disk controller and parallel port while all other 
82091AA interfaces and internal logic (including the 
floppy disk controller and parallel port internal cir- 
cuitry) operate at 3.3V. The mixed mode permits 5V 
floppy disk drives and parallel port peripherals to be 
used in a 3.3V system without external buffering. 


NOTE: 
3.3V operation is available only in the 
82091 AA. 


2.0. SIGNAL DESCRIPTION 


This section describes the 82091AA signals. The in- 
terface signals are shown in Figure 5 and described 
in the following tables. Signal descriptions are orga- 
nized by functional group. 


Note that the ‘‘#” symbol at the end of a signal 
name indicates the active, or asserted, state occurs 
when the signal is at a low voltage level. When “#” 
is not present after the signal name, the signal is 
asserted when at the high voltage level. 


The terms assertion and negation are used exten- 
sively. This is done to avoid confusion when working 
with a mixture of “‘active-low” and ‘‘active-high’’ sig- 
nals. The term assert, or assertion, indicates that a 
signal is active, independent of whether that level is 
represented by a high or low voltage. The term ne- 
gate, or negation, indicates that a signal is inactive. 


The following notations are used to describe pin 
types: | 

| Input Pin 

QO Output Pin 

[/O Bi-Directional Pin 


82091AA 


SA[10:0] 
SD[7:0] 
IOCHRDY 
lIORC# 
lOWC# 
- AEN 
NOWS# 
RSTDRV Host 
TC interface 


Serial A 
Interface 


IRQ[7:3]# 


Serial B 


PPDREQ interface 


FDDREQ 
PPDACK#- 
FDDACK# 


X1/O0SC 
X2 


PD[7:0] 
STROBE# 
AUTOFD# 

INIT# 
SELECTIN# 
SELECT 
BUSY 
FAULT# 
PERROR 
ACK# 


Floppy 
Disk 


Parallel Interface 


Interface 


PPDIR/GCS# 


Game Port 
Chip 
Select 


IDE 
interface | 


Figure 5. 82091AA Signals - 


SOUTA 
SINA | 
DTRA# 
RTSA# 
DCDA# 
DSRA# 
CTSA# 
RIA# 


SOUTB 
SINB 

DTRB# 
RTSB# 
DCDB# 
DSRB# 
CTSB# 
RIB# 


RDDATA# 
WRDATA# 
HDSEL 

STEP# 

DIR# 

WE# 

TRKO# 

INDX# - 

WP# 
FDME0#/MEEN# 


FDS0#/MDSO# 


FDME1#/DSEN# 
FDS1#/MDS1# 
DSKCHG 
DRVDEN[1:0] 


IDECS[1:0]# 
DEN# 
HEN# 
1016# 
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2.1 Host Interface Signals 


Signal meats 


ISA SIGNALS 


SA[10:0] SYSTEM ADDRESS BUS: The 82091AA decodes the standard ISA I/O address 
space using SA[9:0]. SA10 is used along with SA[9:0] to decode the extended 
register set of the ECP parallel port. SA[10:0] connects directly to the ISA system 
address bus. 

SD[7:0] I/O | SYSTEM DATA BUS: SD[7:0] is a bi-directional data bus. Data is written to and 


read from the 82091AA on these signal lines. SD[7:0] connect directly to the ISA 
system data bus. . 7 


I/O READ COMMAND STROBE: IORC # is an I/O access read control signal. 
When a valid internal address is decoded by the 82091AA and IORC # is asserted, 
data at the decoded address location is driven onto the SD[7:0] signal lines. 


1/0 WRITE COMMAND STROBE: IOWC # is an I/O access write control signal. 
When a valid internal address is decoded by the 82091AA and IOWC # is asserted, 
data on the SD[7:0] signal lines is written into the decoded address location at the 
rising edge of IOWC#. 


NO WAIT-STATES: End data transfer signal. The 82091AA asserts NOWS# when 
a valid internal address is decoded by the 82091AA and the IORC# or lIOWC# 
signal is asserted. This reduces the total bus cycle time by eliminating the wait- 
states associated with the default 8-bit |/O cycles. NOWS # is not asserted for IDE 
accesses or DMA accesses. This is an open drain output pin. 


I/O CHANNEL READY: The 82091AA uses this signal for parallel port data 
transfers when the parallel port is in EPP mode. In this case, the 82091 AA negates 
IOCHRDY to extend the cycle to allow for completion of transfers to/from the 
peripheral attached to the parallel port. When the parallel port is in EPP mode, the 
82091AA negates IOCHRDY to lengthen the ISA Bus cycle if the parallel port BUSY 
signal is asserted. 


The 82091AA also uses IOCHRDY during hardware configuration time (see Section 
4.0, AIP Configuration). If IOWC #/IORC # is asserted to the 82091AA during 
hardware configuration time, the 82091AA negates IOCHRDY until hardware 
configuration time is completed. This is an open drain output pin. 


ADDRESS ENABLE: AEN is used during DMA cycles to prevent the 82091AA from 
misinterpreting DMA cycles from valid !/O cycles. When negated, AEN indicates 
that the 82091AA may respond to address and I/O commands addressed to the 
82091AA. When asserted, AEN informs the 82091AA that a DMA transfer is _ 
occurring. When AEN is asserted and a xDACK# signal is asserted, the 82091AA 
responds to the cycle as a DMA cycle. 


lORC # 


lIOWC# 


NOWS # 


lIOCHRDY 


AEN 


RESET DRIVE: RSTDRV forces the 82091AA to a known state. All 82091AA 
registers are set to their default state. 

CRYSTAL1/OSCILLATOR: Main clock input signal can be a 24 MHz crystal 
connected across X1 and X2 or a 24 MHz TTL level clock input connected to X71. 


CRYSTAL2: This signal pin is connected to one side of the crystal when a crystal 
oscillator is used to provide the main clock. If an external oscillator/clock is 
connected to X1, this pin is not used and left unconnected. 


RSTDRV 


X1/O0SC 


>< 
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2.1 Host Interface Signals (Continued) 


Signal 


DMA SIGNALS 


FDDREQ | 
FDDACK # ¥ 


PPDREQ 


PPDACK # 


ae 
INTERRUPT SIGNALS 
IRQ3, IRQ4 


IRQ5,IRQ7 | O 


4-14 


Description 


FLOPPY DISK CONTROLLER DMA REQUEST: The 82091AA asserts FDDREQ 
to request service from a DMA controller for the FDC module. This signal is 
enabled/disabled by bit 3 of the Digital Output Register (DOR). When disabled, 
FDDREQ( is tri-stated. 


FLOPPY DISK CONTROLLER DMA ACKNOWLEDGE: The DMA controller 
asserts this signal to acknowledge the FDC DMA request. When asserted, the 
lORC # and IOWC # inputs are enabled during DMA transfers. This signal is 
enabled/disabled by bit 3 of the DOR. 


PARALLEL PORT DMA REQUEST: Parallel port DMA service request to the 


system IDMA controller. This signal is only used when the parallel port is in ECP 
hardware mode and is always negated when the parailel port is not in this mode. In 
ECP hardware mode DMA requests are enabled/disabled by bit 3 of the ECP 
Extended Control Register (ECR). When disabled, PPDREQ is tri-stated. 


PARALLEL PORT DMA ACKNOWLEDGE: The DMA controller asserts this signal 
to acknowledge the parallel port DMA request. When asserted the |ORC# and 
lIOWC # inputs are enabled during DMA transfers. This signal is enabled/disabled 
by bit 3 of the ECR Register. 


TERMINAL COUNT: The system DMA controller asserts TC to indicate it has 
reachec! the last programmed data transfer. TC is pease only when FDDACK # 
or PPDACK # is aseeieus 


INTERRUPT 3 AND 4: IRQ3 and IRQ4 are associated with the serial ports and 
can be programmed (via the AIPCFG2 Register) to be either active high or active 
low. These signals can be configured for a particular serial channel via hardware 
configuration (at powerup) or by software configuration. 

Under Hardware Configuration _ 

IRQ3 is used as a serial port interrupt if the serial port is configured at address 
locations 2F8h—2FFh or 2E8h-2EFh. IRQ4 is used as a serial port interrupt if the 
serial port is configured at address locations 3F8h-3FFh or 3E8h-3EFh. 
Under Software configuration 


IRQ3 and IRQ4 are independently configured (. e., the IRQ does not automatically 
track the communication port address assignment). 


These interrupts are enabled/disabled globally via bit 3 of the serial port Modem 
Control Register (MCR) and for specific conditions via the Interrupt Enable 
Register (IER). |IRQ3 and IRQ4 are tri-stated when not enabled. 


INTERRUPT REQUEST 5: IRQ5 and IRQ7 are associated with the parallel port 
and can be programmed (via AIPCFG2 Register) to be either active high or active 
low. Either IRQ5 or IRQ7 is enabled/disabled via PCFG1 Register to signal a 
parallel port interrupt. The interrupt not selected is disabled and tri-stated. 


During hardware configuration (see Section 4.0, AIP Configuration), IRQ5 is used if 
the parallel port is assigned to 278h-—27Fh and IRQ7 is used if the parallel port 
interrupt is assigned to either 3BCh-3BFh or 378h-37Fh. 
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2.1 Host Interface Signals (Continued) 


INTERRUPT SIGNALS (Continued) 
INTERRUPT REQUEST 6: IRQG6 is associated with the floppy disk controller and can 


be programmed (via the AIPCFG2 Register) to be either active high or active low. In 
non-DMA mode this signal is asserted to signal when a data transfer is ready. IRQ6 is 
also asserted to signal the completion of the execution phase for certain FDC 
commands. This signal is enabled/disabled by the DMAGATE bit in the Digital a 
Register of the FDC. The signal is tri-stated when disabled. 


2.2 Floppy Disk Controller Interface 


Signal 
RDDATA# | | | READ DATA: Serial data from the disk drive. 


WRDATA# WRITE DATA: MFM serial data to the disk drive. Precompensation value is 
selectable through software. 


HDSEL . HEAD SELECT: Selects which side of a disk is to be accessed. When asserted 
(low), side 1 is selected. When negated (high), side 0 is selected. 


STEP# STEP: STEP # supplies step pulses (asserted) to the drive to move the head 
between the tracks during a seek operation. 

DIR # DIRECTION: Controls the direction the head moves when a step signal is present. 
The head moves toward the center when DIR # is asserted and away from the 
center when negated. 

WRITE ENABLE: WE # is a disk drive control signal. When asserted, WE # 
enables the head to write to the disk. 


‘TRKO# | 1 | TRACKO: The disk drive asserts this signal to indicate that the head is on track 0. 
INDX # Fire INDEX: The disk drive asserts this signal to indicate the beginning of the track. 


WRITE PROTECT: The disk drive asserts this signal to indicate that the disk drive 
DSKCHG sot 


is write-protected. | 
DISK CHANGE: The disk drive asserts this signal to indicate that the drive door 
DRIVDENO 
DRIVDEN1 


has been opened. The state of this signal input is available in the Digital Input 
Register (DIR #). 


DRIVE DENSITY: These signals are used by the disk drive to configure the drive 
for the appropriate media density. These signals are controlled by the FDC’s Drive 
Specification Command. 
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2.2 Floppy Disk Controller Interface (Continued) — 


Description 
FDME1 #/ FLOPPY DRIVE MOTOR ENABLE 1, IDLE, OR DRIVE SELECT ENABLE: This 
DSEN #(1) signal pin has two functions(™). FDME1 # is the motor enable for drive 1. FDME1 # 


is directly controlled via the Digital Output Register (DOR) and is a function of the 
mapping based on the BOOTSEL bits in the Tape Drive Register (TDR). 


The Drive Select Enable (DSEN #) function is only used in a four floppy drive . 
system (see Appendix A, FDC Four Drive Support). 


FDS1 #/ FLOPPY DRIVE SELECT1, POWERDOWN, OR MOTOR DRIVE SELECT 1: This 
-MDS1(1) signal pin has two functions(1). FDS1 # is the floppy drive select for drive 1. FDS1# 
is controlled by the select bits in the DOR and is a function of the mapping based 
on the BOOTSEL bits in the TDR. 


The Motor Drive Select 1 (MDS1) function is only used in a four floppy drive system 
(see Appendix A, FDC Four Drive Support). 


FDMEO #/ FLOPPY DRIVE MOTOR ENABLE 0 OR MOTOR ENABLE ENABLE: This signal 
MEEN #(1) pin has two functions(1). FDMEO# is the motor enable for drive 0. FDME0 # is 
directly controlled via the Digital Output Register (DOR) and is a function of the 
mapping based on the BOOTSEL bits in the Tape Drive Register (TDR). 


The Motor Enable Enable (MEEN #) function is only used in a four floppy drive 
system (see Appendix A, FDC Four Drive Support). 


FDSO#/ FLOPPY DRIVE SELECT 0 OR MOTOR DRIVE SELECT 0: This signal pin has two 
MDS0(1) functions(). FDSO# is the floppy drive select for drive 0. This output is controlled 
by the drive select bits in the DOR and is a function of the mapping based on 
BOOTSEL bits in the TDR. 


The Motor Drive Select 0 (MDSO) function is only used in a four floppy drive system 
(see eppere A, FDC Four Drive Support). 


NOTE: 
1. The function selected for these pins is based on the FDDQTY bit in the FCFG1 Register as shown in the following table. 


<j nal Pin 2 Drive System 4 Drive System 
: (FDDQTY = 0) (FDDQTY = 1) 


FDME1 #/DSEN# FDME1 # DSEN # 


FDS1#/MDS1 # FDS1# MDS1 
FDMEO#/MEEN # | FDMEO# — MEEN # 


a When FDDQTY = 1, these signal pins are used to control an external decoder for a four floppy disk 
drive system as described in Appendix A, FDC Four Drive Support. 
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2.3 Serial Port interface 


Serial Port A signal names end in the letter A and Serial Port B signal names end in the letter B. Serial Port A 
and B signals have the same functionality. 


Signal | : does | ee 


CTSA#, CLEAR TO SEND: When asserted, this signal indicates that the modem or data 
CTSB# set is ready to exchange data. The CTS# signal is a modem status input whose 
condition the CPU can determine by reading the CTS bit in Modem Status 
Register (MSR) for the appropriate serial port. The CTS bit is the compliment of 
the CTS# signal. The DCTS bit in the MSR indicates whether the CTS # input 
has changed state since the previous reading of the MSR. CTS# has no effect on 
the transmitter. 


DCDA#, DATA CARRIER DETECT: When asserted, this signal indicates that the data 

DCDB # carrier has been detected by the modem or data set. The DCD# signalisa _ 
modem status whose condition the CPU can determine by reading the DCD bit in 
the MSR for the appropriate serial port. The DCD bit is the compliment of the 
DCD # signal. The DDCD bit in the MSR indicates whether the DCD # input has 
changed state since the previous reading of the MSR. DCD # has no effect on the 
transmitter. 


DATA SET READY: When asserted, this signal indicates that the modem or data 
set is ready to establish the communications link with the serial port module. The 
DSR # signal is a modem status whose condition the CPU can determine by 

reading the DSR bit in the MSR for the appropriate serial channel. The DSR bit is 
the compliment of the DSR# signal. The DSR bit in the MSR indicates whether 

the DSR # input has changed state since the previous reading of the MSR. DSR # 
has no effect on the transmitter. 


DATA TERMINAL READY: DTRA#/DTRB# are outputs during normal system 
operations. When asserted, this signal indicates to the modem or data set that 
the serial port module is ready to establish a communications link. The DTR# | 
signal can be asserted via the Modem Control Register (MCR). A hard reset 
negates this signal. 


Hardware Configuration 


These signals are only inputs during hardware configuration time (RSTDRV 
asserted and for a short time after RSTDRV is negated). (See Section 4.0, AIP 
Configuration.) 


RING INDICATOR: When asserted, this signal indicates that a telephone ringing 
signal has been received by the modem or data set. The RI # signal is a modem 
status input whose condition the CPU can determine by reading the RI bit in the 
MSR for the appropriate serial channel. The RI bit is the compliment of the RI # 
signal. The TERI bit in the MSR indicates whether the RI # input has changed 
from low to high since the previous reading of the MSR. 


DSRA#, 
DSRB # 


RIA#, RIB# 
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2.3 Serial Port Interface (Continued) 


Description 


REQUEST TO SEND: RTSA#/RTSB# are outputs during normal system 
operations. When asserted, this signal informs the modem or data set that the 
serial port module is ready to exchange data. The RTS# signal can be asserted 
via the RTS bit in the Modem Control Register. A hard reset negates this signal. 
Hardware Configuration 

These signals are only inputs during hardware configuration time (RSTDRV 
asserted and for a short time after RSTDRV is negated). (See Section 4.0, AIP 

- Configuration.) 


SINA, SINB SERIAL INPUT: Serial data input from the communications link. (Peripheral 


device, rnodem, or data set.) 


SERIAL OUTPUT: SOUTA/SOUTB are serial data outputs to the communications 
link duririg normal system operations. (Peripheral device, modem, or data set.) The 
SOUT signal is set to a marking state (logic 1) after a hard reset. | 

Test Mode | 

In test mode (selected via the SACFG2 or SBCFG2 Registers), the baudout from 
the baud rate generator is output on SOUTx. 

Hardware Configuration 

These signals are only inputs during hardware configuration time (RSTDRV 
asserted and for a short time after RSTDRV is negated). (See Section 4.0, AIP 
Configuration. ) 


2.4 IDE Interface © 
Description 


16-BIT 1/0: This signal is driven by |/O devices on the ISA Bus to indicate 
support for 16-bit |/O bus cycles. The IDE interface asserts this signal to the 
82091AA to indicate support for 16-bit transfers. For IDE transfers, the 82091AA 
asserts HEN # when |016# is asserted. 


IDE CHIP SELECT: IDECS[1:0] # are outputs during normal ae operation 
and are chip selects for the IDE interface. IDECS[1:0] # select the Command 
Block Registers of the IDE device and are decoded from SA[9:3] and AEN. 

- Hardware Configuration 


These signals are only inputs during hardware configuration time (RSTDRV 
asserted). (See Section 4.0, AIP Configuration.) 


| IDECS[1:0] # I/O 
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IDE Interface (Continued) 


Signal 


DEN # 1/O DATA ENABLE: DEN # is an output during normal system operations and is a data 
enable for an external data buffer for all 82091AA and IDE accesses. The SD[7:0] 
signals can be connected directly to the ISA. In this case, the DEN # signal is not used. 
However, an external buffer can be used to isolate the SD[7:0] signals from the 240 pF 
loading of the ISA Bus. With an external buffer implementation, DEN# controls the 
external buffers for transfers to/from the ISA Bus. 


Hardware Configuration 


This signal is only an input during hardware configuration time (RSTDRV asserted). 
(See Section 4.0, AIP Configuration.) 


HEN# | 1/O IDE UPPER DATA TRANSCEIVER ENABLE: HEN # is an output during normal system 
operations and is a high byte data transceiver enable signal for the IDE hard disk drive 
interface. HEN # is asserted for |/O accesses to the IDE data register when the drive 
asserts 1016 #. 

Hardware Configuration 
This signal is only an input during hardware configuration time (RSTDRV asserted). 


(See Section 4.0, AIP Configuration.) 


2.5 Parallel Port External Buffer Control/Game Port 


Signal 


PPDIR/GCS # PARALLEL PORT DIRECTION (PPDIR) or GAME PORT CHIP SELECT 
(GCS #): This signal is an output during normal operations and provides the 
PPDIR and GCS # functions as follows: 


PPDIR 


This signal pin functions as a parallel port direction control output when the 
82091AA is configured for software motherboard mode (SWMB). For 
configuration details, see Section 4.0, AIP Configuration. If external buffers are 
used on PD[7:0], PPDIR can be used to control the buffer direction. The 
82091AA drives this signal low when PD[7:0] are outputs and the 82091AA 
drives this signal high when PD[7:0] are inputs. Note that if a configuration 
mode other than SWMB is selected, this signal pin is a game port chip select 
and does not track the PD[7:0] signal direction. 


GCS # 


This signal pin functions as a game port chip select output when 82091AA 
configuration is set for Software Add-In (SWAI), Hardware Basic (HWB), or 
Hardware Extended (HWE) modes. When the host accesses I/O address 201h, 
GCS # is asserted. 


Hardware Configuration 


This signal is only an input during hardware configuration time (RSTDRV 
asserted). (See Section 4.0, AIP Configuration.) 
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2.6 Parallel Port Interface 


The 82091AA parallel port is a multi-function inter- | 
face that can be configured for one of four hardware 
modes (see Section 4.0, AIP Configuration). The 
hardware modes are ISA-Compatible, PS/2-Com- 


: | 

intel. 
Table 1 shows a matrix of the 82091AA parallel port 
signal names and corresponding signal names for 
each of the protocols. Sections 2.6.1-2.6.5 provide 
a signal description for the five interface protocols. 


Note that the 82091AA hardware operations are the 
same for Compatibility and Nibble protocols. The 


signals, however, are controlled and used differently 
via software and the peripheral device. 


patible, EPP, and ECP modes. These parallel port 
modes support the compatibility, nibble, byte, EPP 
and ECP parallel interface protocols described in the 
IEEE 1284 standard. The operation and use of the 
interface signal pins are a function of the parallel 
port hardware mode selected and the protocol used. 


: Table 1. Parallel Port Signal Name Cross Reference 


ade Compatibility | Nibble Protocol Byte Protocol | EPP Protocol | ECP Protocol 
Signal eretecol signal Signal Names | Signal Names | Signal Names Signal Names 
Names Names 9 g g g 


STROBE # Strobe # 
BUSY Busy 


Ce a 
Datals1] =| = — - | Datals:1] Datals:1] 


NOTE: 
Not all parallel port signal pins are used for certain parallel port interface protocols. These signals are labeled “—”. 
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2.6.1 COMPATIBILITY PROTOCOL SIGNAL DESCRIPTION 


Except for the data bus, the 82091AA and compatibility protocol signal names are the same. For the data bus, 
the 82091AA signal names PD[7:0] corresponds to the compatibility protocol signal names Data[8:1]. 


82091AA 
Signal 
Name 


STROBE # / 
_ _ 
nti 


Compatibility Protocol Signal Name and Description 


STROBE: The host asserts STROBE # to latch data into the peripheral device’s 
input latch. This signal is controlled via the PCON Register. 


BUSY: BUSY is asserted by the peripheral to indicate that the peripheral device 
is not ready to receive data. The status of this signal line is reported in the PSTAT 
Register. 


ACKNOWLEDGE: The printer asserts this signal to indicate that it has received 
the data and is ready for new data. The status of this signal line is reported in the 
PSTAT Register. 


SELECT: SELECT is asserted by the peripheral device to indicate that the device 
is on line. The status of this signal line is reported in the PSTAT Register. 


PAPER ERROR: The peripheral device asserts PERROR to indicate that it has 
encountered an error in the paper path. The exact meaning varies from peripheral 


SELECT ge 
device to peripheral device. The status of this signal line is reported in the PSTAT 


PERROR 
Register. 


FAULT # FAULT: FAULT # is asserted by the peripheral device to indicate that an error 
has occurred. The status of this signal line is reported in the PSTAT Register. 

INIT # INITIALIZE: The host asserts INIT # to issue a hardware reset to the peripheral 
device. This signal is controlled via the PCON Register. 


AUTOFD# © AUTO FEED: AUTOFD # is asserted by the host to put the peripheral device into 
auto-line feed mode. This means that when software asserts this signal, the 
printer is instructed to advance the paper one line for each carriage return _ 
encountered. This signal is controlled via the PCON Register. 


PD[7:0] F260 DATA: Forward channel data. | 


SELECTIN# SELECT INPUT: SELECTIN # is asserted by the host to select a peripheral 
device. This signal is controlled via the PCON Register. 
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2.6.2 NIBBLE PROTOCOL SIGNAL DESCRIPTION 


The Nibble protocol assigns the following signal operation to the parallel port pins. The name in bold at the 
beginning of the signal description column is the Nibble protocol signal name. The terms assert and negate 
are used in accordance with the 82091AA signal name as described at the beginning of Section 2.0. For 
example, AUTOFD# (HostBusy) asserted refers to AUTOFD# (HostBusy) at a low level. 


82091AA 


Signal 
Name 


STROBE # 
BUSY 


SELECT 


PERROR> 
FAULT # 


INIT # 


| AUTOFD# 


PD[7:0] DATA: This 8-bit output data path to the peripheral Host data is written to the 
peripheral attached to the parallel port interface on these signal lines. _ - 


SELECTIN# SELECT INPUT: This signal is controlled by the PCON Register. 


Type | _Nibble Protocol Signal Name and Description 


STROBE: The host controls this signal via the PCON Register and STROBE # 
should be held negated by the host. 


PRINTER BUSY (PtrBusy): The peripheral drives this signal to transfer data bits 
3 and 7 sequentially. The status of this signal line is reported in the PSTAT 
Register. 


PRINTER CLOCK (PtrClk): The peripheral device asserts ACK # (PtrClk) to 
indicate to the host that data is available. The signal is subsequently asserted to 
qualify clata being sent to the host. The status of this signal line is reported in the 
PSTAT Register. if interrupts are enabled via the PCON Register, the assertion of 
this sigrial causes a host interrupt to be generated. 


XFLAG: The peripheral device drives this signal to transfer data bits 1 and 5 
sequentially. The status of this signal line is reported in the PSTAT Register. 


ACKNOWLEDGE DATA REQUEST (AckDataReq): This signal is initially high. 
The peripheral device drives this signal low to acknowledge HostBusy assertion. 
PERROR is subsequently used to transfer data bits 2 and 6 sequentially. Ne 
Status of this signal line is reported in the PSTAT Register. 


DATA AVAILABLE (DataAvail): The peripheral device asserts FAULT # 
(DataAvail) to indicate data availability. Subsequently used to transfer data bitsO | 
and 4 sequentially. The status of this signal line is reported in the PSTAT 
ees 

INITIALIZE: The host controls this signal via the PCON Register. 


2 we BUSY (HostBusy): The host negates AUTOFD# (HostBusy) in response 
to ACK# being asserted. This signal is subsequently driven low to enable the 
peripheral to transfer data to the host. AUTOFD # is then driven high to 
acknowledge receipt of byte data. This signal is controlled via the PCON 
Register. 


y 
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2.6.3 BYTE MODE SIGNAL DESCRIPTION 


The Byte protocol assigns the following signal operation to the parallel port pins. The name in bold at the 
beginning of the signal description column is the Byte protocol signal name. The terms assert and negate are 
used in accordance with the 82091AA signal name as described at the beginning of Section 2.0. For -Srample: 
STROBE # (HostClk) asserted refers to STROBE # (HostCik) at a low level. 


82091AA : ee 
Signal Name Byte Protocol Signal Name and Description | 
STROBE # HOST CLOCK (HostClik): This signal is strobed low by the host to acknowledge 
receipt of data. Note that the peripheral must not interpret this as a latch strobe 
for forward channel data. 


BUSY a PRINTER BUSY (PtrBusy): The peripheral device asserts BUSY (PtrBusy) to 


provide forward channel peripheral busy status. The status of this signal line is 
ae 


reported in the PSTAT Register. 
PRINTER CLOCK (PtrCik): The peripheral device asserts ACK # (PtrCik) to 
indicate to the host that data is available. The signal is subsequently asserted to 
qualify data being sent to the host. The status of this signal line is reported in the 
PSTAT Register. If interrupts are enabled via the PCON Register, the assertion 

of this signal causes a host interrupt to be generated. 

SELECT =) 

PERROR 
The peripheral device drives this signal low to acknowledge HostBusy assertion. 
The status of this signal line is reported in the PSTAT Register. 

FAULT # DATA AVAILABILITY (DataAvail): The peripheral device asserts FAULT # 
(DataAvail) to indicate data availability. The status of this signal line is reported in 
the PSTAT Register. 

INIT # INITIALIZE: The host controls this signal via the PCON Register and INIT # 
should be held in the negated state. 

AUTOFD# HOST BUSY (HostBusy): The host negates AUTOFD # (HostBusy) in response 
to ACK # being asserted. The signal is subsequently driven low to enable the 
peripheral to transfer data to the host. AUTOFD# is then driven highto _ 
acknowledge receipt of nibble data. This signal is controlled via the PCON 
Register. 
PD[7:0] Oc 4 DATA: This 8-bit data bus is used for bi-directional data transfer. 


XFLAG: SELECT (XFLAG) is asserted by the peripheral device to indicate that 
SELECTIN# | I/O | SELECT INPUT: This signal is controlled by the PCON Register. 


the device is on line. The status of this signal line is reported in the PSTAT 
Register. | 


ACKNOWLEDGE DATA REQUEST (AckDataReq): This signal is initially high. 
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2.6.4 ENHANCED PARALLEL PORT (EPP) PROTOCOL SIGNAL DESCRIPTION 


EPP protocol assigns the following signal operation to the parallel port pins. The name in bold at the beginning 
of the signal description column is the EPP mode signal name. The terms assert and negate are used in 
accordance with the 82091AA signal name as described at the beginning of Section 2.0. For example, BUSY 
(Wait #) asserted refers to BUSY (Wait#) being high. 


82091AA a 
Signal Name yP 


-STROBE# 


EPP Protocol Signal Name and Description 


WRITE (Write #): STROBE # (Write #) indicates an address or data read/write 
operation to the peripheral. The 82091AA drives this signal low for a write and 
high for a read. 


WAIT (Wait #): The peripheral sets BUSY (Wait#) low to indicate that the device 
is not ready. When BUSY signal is low, the 82091AA negates IOCHRDY on the 
ISA Bus to lengthen the I/O cycles. The peripheral device sets BUSY (Wait #) 
high to indicate that transfer of data or address is completed. 


INTERRUPT REQUEST (intr): The peripheral asserts. ACK # (Intr) to generate 
an interrupt the host. When this signal is low and interrupts are enabled via bit 4 
of the PCON Register, the 82091AA generates an interrupt request (via either 
IRQ5 or IRQ7) to the host. 


SELECT: SELECT is asserted by the peripheral device to indicate that the 
device is on line. The status of this signal line is reported in the PSTAT Register. 


PAPER ERROR: The peripheral device asserts PERROR to indicate that it has 
encountered an error in the paper path. The exact meaning varies from 
peripheral device to peripheral device. The status of this signal line is reported in 
the PSTAT Register. 


FAULT: FAULT # is asserted by the peripheral device to indicate that an error 
has occurred. The status of this signal line is reported in the PSTAT Register. 


SELECT 


PERROR | 


FAULT # 
‘INIT# 


| JNUTORD 2 


ell 


| SELECTIN# 


2.6.5 EXTENDED CAPABILITIES PORT (ECP) PROTOCOL SIGNAL DESCRIPTION 


INITIALIZE: The host asserts INIT # to issue a hardware reset to the ee 
device. This signal is controlled via the PCON Register. 


DATA STROBE (DStrb #): The 82091AA asserts AUTOFD# (DStrb#) to 
indicate that valid data is present on PD[7:0] and is used by the peripheral to. 
latch data during write cycles. For reads, the 82091AA reads in data from 
PD[7:0] when this signal is asserted. 


DATA: This 8-bit bi-directional bus provides addresses or data during the write 
cycles and supplies addresses or data to the 82091AA during the read cycles. 


ADDRESS STROBE (AStrb #): The 82091AA asserts SELECTIN # (AStrb#) to 
indicate that a valid address is present on PD[7:0] and is used by the peripheral 
to latch addresses during write cycles. For reads, the 82091AA reads in an 

address from PD[7:0] when this signal is asserted. 


ECP protocol assigns the following signal operation to the parallel port pins. The name in boid at the beginning 
of the signal description column is the ECP protocol signal name. The terms assert and negate are used in 
accordance with the 82091AA signal name as described at the beginning of Section 2.0. For example, 
STROBE # (HostCik) asserted refers to STROBE # (HostClk) being low. | 
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82091AA 
Signal Type ECP Protocol Signal Name and Description 


Name 


HOST CLOCK (HostClk): In the forward direction, the 82091 AA asserts 
STROBE # (HostClik) to instruct the peripheral to latch the data on PD[7:0]. 


STROBE # 
During write operations, the peripheral should latch data on the rising edge of 
STROBE # (HostCik). STROBE # (HostCik) handshakes with BUSY (PeriphAck) 
during write operations and is negated after the 82091AA detects BUSY 
(PeriphAck) asserted. STROBE # (HostClk) is not asserted by the 82091 AA again 
until BUSY (PeriphAck) is detected negated. For read operations (reverse 
direction), STROBE # (HostCik) is not used. 

BUSY 7 PERIPHERAL ACKNOWLEDGE (PeriphAck): The peripheral device asserts this 


signal during a host write operation to acknowledge receipt of data. The 


peripheral device then negates the signal after STROBE # is detected high to 
SELECT 


terminate the transfer. For host write operations (forward direction), this signal 
handshakes with STROBE # (HostCik). During a host read operation (reverse 
PERROR v 
FAULT # . 


- | 
an address/RLE or data. The 82091AA asserts this signal to identify an address/ 


AUTOFD # 
. RLE transfer and negates it to identify a data transfer. 


PD[7:0] DATA: PD[7:0] is a bi-directional data bus that transfers data, addresses, or RLE 
data. 

SELECTIN # ECP MODE (ECPmode): The host (via the PCON Register) negates this signal 
during ECP mode operation. 


direction), BUSY (PeriphAck) is normally low and is driven high by the peripheral 
to identify Run Length Encoded (RLE) data. : 


PERIPHERAL CLOCK (PeriphClik): During a peripheral to host transfer (reverse 
direction), ACK # (PeriphClk) is asserted by the peripheral to indicate data is valid 
on the data bus and then negated after AUTOFD # is detected high. This signal 


handshakes with AUTOFD # to transfer data. | 


XFLAG (Xflag): This signal is asserted by the peripheral to indicate that it is on- 
line. The status of this signal line is reported in the PSTAT Register. 


ACKNOWLEDGE REVERSE (AckReverse # ): PERROR (AckReverse #) is 
driven low by the peripheral to acknowledge a reverse transfer request by the 

host. This signal handshakes with INIT# (ReverseRequest#). The status of this 
signal line is reported in the PSTAT Register. 


PERIPHERAL REQUEST (PeriphRequest+# ): The peripheral asserts FAULT # 
(PeriphRequest#) to request a reverse transfer. The status of this signal line is 
reported in the PSTAT Register. 


REVERSE REQUEST (ReverseRequest#): The host controls this signal via the 
PCON Register to indicate the transfer direction. The host asserts this signal to 

request a reverse transfer direction and negates the signal for a forward transfer 
direction. 


HOST ACKNOWLEDGE (HostAck): The 82091AA asserts AUTOFD # 
(HostAck) to request data from the peripheral (reverse direction). This signal 

handshakes with ACK # (PeriphClk). AUTOFD # (HostAck) is negated when the 
peripheral indicates valid state of the data bus (i.e., ACK # is detected asserted). 


In the forward direction, AUTOFD# (HostAck) indicates whether PD[7:0] contain 
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2.7 Hard Reset Signal Conditions 


Table 1 shows the state of all 82091AA output and bi-directional signals during hard reset (RSTDRV asserted). 
The strapping options described in Section 4.0, AIP Configuration are sampled when.the 82091AA is hard 
reset. | 


Table 2. Output and I/O Signal States During a Hard Reset 
Signal Name 


| State — Signal Name | State | Signal Name | State | 
<a oe 
‘ ATSIABI¢ 
SDI7:0] 


IDECS[1,0] # 
INDX# i a od 
ew | sect 
swiasl | 
T—] [sourias 


0 
O 


STROBE # 


TRKO# 


a 
_DTRIA,B] # 
FAULT # et 
= 


Ww 
High) 


NOTES: 


1. During and immediately after a hard reset, this signal is an input for hardware configuration. After the hardware configura- 
tion time, these signals go to the state specified in the table. 

2. If |ORC# or IOWC# is asserted, IOCHRDY will be asserted by the IOCHRDY. 

3. Dashes represent input signals. 
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2.8 Power And Ground 


to Vcc. 


is applied to Vcc. 


NOTE: 
1. 3.3V operation is available only in the 82091AA. 


3.0 1/0 ADDRESS ASSIGNMENTS 


The 82091AA assigns CPU I/O address locations to 
its game port chip select, IDE interface, serial ports, 
parallel port, floppy disk controller, and the 82091AA 
configuration registers as indicated in Table 3. Ex- 
cept for the game port chip select (address 201h), 
address assignments are configurable. For example, 
the serial port can be assigned to one of eight ad- 
dress blocks. The parallel port can be assigned to 
one of three address blocks, and the IDE interface 
and floppy disk controller can be assigned to one of 
two address blocks. These address _ assign- 
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vss} GROUND: The ground reference for the 82091AA. 


POWER: The 5V/3.3V(1) modes are selected via strapping options at power-up (see 
Section 4.2, hardware Configuration). When strapping options (Vs__) are set to 5V, the 
Vcc pins must be connected to 5V. When strapping options are set to 3.3V, the Voc 

pins must be connected to 3.3V. 


POWER: The 5V/3.3V(1) power supply for the 82091AA. In 5V or 3.3V power supply 
modes (non-mixed mode), the voltage apeney to Vccr is the same voltage as applied 


For mixed mode operations, 5V is applied to Vocr. This voltage provides 5V reference 
for the parallel port and floppy disk controller interfaces. Note that in mixed mode, 3.3V 


82091AA 


ments are made during 82091AA configuration (ei- 
ther hardware configuration at powerup or a hard 
reset, or software configuration by programming the 
82091AA configuration registers). In addition, the 
82091AA configuration registers can be located at 
one of two address blocks during hardware configu- 
ration. 


All of the 82091AA address locations are located in 
the host I/O address space. The address block as- 
signments are shown in Table 3. The first hex ad- 
dress in the Address Block column represents the 
base address for that particular block. 
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Table 3. AIP Address Assignments 


Assignment 
Block (ISA Bus) 
IDE Interface—Secondary Address Block 
| IDE Interface—Primary Address Block 
Game Port Chip Select 
| 220-227h Serial Port | 
Serial Port 
Serial Port 
82091 AA Configuration Registers—Primary Address Block (022-023h on X-Bus) 
Parallel Port 
Serial Port 
Serial Port 
| 338-33Fh Serial Port ; | 
ar | Floppy Disk Controller—Secondary Address Block (376h and 377h are Shared with 
the IDE Drive Interface Secondary Address) 
Parallel Port 
82091AA Configuration Registers—Secondary Address Block (024-025h on X-Bus) 
Parallel Port (All Mopes Except EPP) 
Serial Port 
: Floppy Disk Controller—Primary Address (3F6h and 3F7h are Shared with the IDE 
| Drive Interface Primary Address) 
Serial Port | 
Parallel Port (ECP Mode Peripheral Interface Protocol) 
Parallel Port (ECP Mode Peripheral Interface Protocol) 
Parallel Port (ECP Mode Peripheral Interface Protocol) 


NOTES: 

1. The 82091AA does not contain IDE: registers. However, the 82091AA provides the address block assignments for access- 
ing the IDE registers that are located in the IDE device. 

2. The standard PC/AT* compatible logical |/O address assignments are supported. For example, COM1 (8F8-3FFh) and 
COM2 (2F8-2FFh) are part of the serial port assignments and LPT1 (8BC-3BFh), LPT2 (378-—37Fh), and LPT3 
(278-27Fh) are part of the parallel port assignments. 


*Other brands and names are the property of their respective owners. 
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4.0 AIP CONFIGURATION 


82091AA configuration consists of setting up overall 
device operations along with certain functions 
pertaining to the individual 82091AA modules 
(parallel port, serial ports, floppy disk controller, and 
IDE interface). Overall device operations include 
selecting the clock frequency, power supply voltage, 
and address assignment for the configuration 
registers. Overall device operations also enable/ 
disable access to the configuration registers and 
provide interrupt signal level control. For the 
individual modules, 82091AA configuration includes 
module address assignment, interrupt control, 
module enable/disable, powerdown control, test 
mode control, module reset, and certain functions 
specific to each module. The remainder of the 
functions unique to each module are handled via the 
individual module registers. 


Two methods are provided for configuring the 
82091 AA—hardware configuration via strapping op- 
tions at powerup (or whenever RSTDRV is asserted) 
and software configuration by programming the con- 
figuration registers. (For information on hardware 
configuration, see Section 4.2, Hardware Configura- 
tion. For information on software configuration, see 
Section 4.1, Configuration Registers.) 


NOTE: 

1. There are four hardware configuration 
modes—SWMB (Software Motherboard), 
SWAI (Software Add-In), HWB (Hardware 
Basic), and HWE (Hardware Extended). 
Some of these modes can be used without 
the need for programming the 82091AA 
configuration registers. Other modes use 
both hardware configuration strapping op- 
tions and programming the configuration 
registers to set up the 82091AA. 

2. The 82091AA’s operating power supply 
voltage level, 82091AA clock frequency, 
and address assignment for the 82091AA 
configuration registers can only be config- 
ured by hardware configuration. 


4.1 Configuration Registers 
82091AA Configuration Space contains 13 configu- 


ration registers. Four of the registers (Product and 
Revision Identification Registers and the 82091AA 
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Configuration 1 and 2 Registers) provide control and 
status information for the entire chip. In addition, two 
registers each for the floppy disk controller, parallel 
port, serial port A, and serial port B and one register 
for the IDE interface provide certain module status 
and control information. The 82091AA configuration 
registers are indirectly addressed by first writing to 
the 82091AA Configuration Index Register as de- 
scribed in Section 4.1.1. Thus, the 13 configuration 
registers occupy two address locations in the host’s 
|/O address space—one for indirectly selecting the 
specific configuration register and the other for 
transfering register data. All 82091AA configuration 
registers are 8-bits wide and are accessed as byte 
quantities. 


Some of the 82091AA Configuration registers de- 
scribed in this section contain reserved bits. These 
bits are labeled ‘‘R’’. Software must deal correctly 
with fields that are reserved. On reads, software 
must uSe appropriate masks to extract the defined 
bits and not rely on reserved bits being any particu- 
lar value. On writes, software must ensure that the 
values of reserved bit positions are preserved. That 
is, the value of reserved bit positions must first be 
read, merged with the new values for other bit posi- 
tions, and then written back. 


In addition to reserved bits within a register, the 
82091AA configuration space contains address lo- 
cations that are labeled ‘‘Reserved”’ (Table 5). While 
the 82091AA responds to accesses to these I/O ad- 
dresses by completing the host cycle, writing to a 
reserved I/O address can result in unintended de- 
vice operations. Values read from a reserved |/O 
address should not be used to permit future expan- 
sion and upgrades. 


During a hard reset (RSTDRV asserted), the 
82091AA sets its configuration registers to pre-de- 
termined default states. The default values are indi- 
cated in the individual register descriptions. The fol- 
lowing nomenciature is used for register access at- 
tributes: 


RO Read Only. If a register is read only, writes 
have no effect. 


R/W Read/Write. A register with this attribute can 
be read and written. Note that individual bits in 
some read/write registers may be read only. 
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4.1.1 CFGINDX, CFGTRGT—CONFIGURATION INDEX REGISTER AND TARGET PORT 


1/O Address: Hardware Configurable (see Table 4) 
Default Value: OOh : 

Attribute: Read/Write 

Size: 8 bits 


CFGINDX and CFGTRGT are used to access 82091AA configuration space where all of the 82091 AA configu- 
ration registers are located. CFGINDX and CFGTRGT are located in the host I/O address space and the 
address locations are hardware configurable as shown in Table 4. CFGINDX is an 8-bit register that contains 
the address index of the 82091AA configuration register to be accessed. CFGTRGT is a port for reading data 
from or writing data to the configuration register whose index address matches the address stored in the 
_ CFGINDX Register. Thus, to access a configuration register, CFGINDX must first be programmed with the 
index address. A software example is provided in this section demonstrating how to access the configuration 
registers. 7 : . 


Address Selection 


Table 4. Configuration Register Access Addresses 


X-Bus Implementation 
| Target | index | Target 


Table 5 summarizes the 82091AA configuration space. Following the table, is a detailed description of each . 
register. The register descriptions are arranged in the order that they appear in Table 5. 


Description 


el 82091AA Configuration Register Address Index: Bits[7:0] correspond to SD[7:0]. 
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Software Configuration 


Access Addresses for the two Software Configuration Modes: 
index Target 


For SWMB Mode Primary Address: 22h 23h 
For SWMB Mode Secondary Address: 24h 25h 
For SWAI, HWE, and HWB Modes Primary Address: 26Eh 26Fh 


For SWAI, HWE, and HWB Modes Secondary Address: 398h 399h 


The following pseudo code sequence could be used to access the configuration registers under SWMB 
primary address: 


Configuration register write: OUT 22h, ConfigRegAddr 
OUT 23h, ConfigRegData 


Configuration register read: OUT 22h, ConfigRegAddr 
IN 23h 


Table 5. AIP Configuration Registers 


Abbreviation Register Name 


[a | Productideniicaion ——~SCSCS~*~rtCia 
Toh | APRev | Revisiontdenticaton ——~—SC~dtCiR 
8 a ee 
Tin | Force [Foc Power Menagemrentand Stas | RW 
ee 


82091AA 
Configuration 
Address Index 


7 
R/W 

ura 

PCFG2 Parallel Port Power Management and Status 

SACFG1 Serial Port A Configuration 

i i R/W 

R/W 

] — R/W 


a 
Caan | S801 Serial Pot Configuration SSC 
Tah | sacra | Seria Port 8 Power Managemertand Sats | RW 
a 
ec 


NOTE: 
_Writing to a reserved I/O address should not be attempted and can result in unintended device operations. 


R/W 
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4.1.2 AIPID—AIP IDENTIFICATION REGISTER 


Index Address: 00h 


Default Value: AOh 
Attribute: Read Only 


Size: 8 bits 


Description 


AIP IDENTIFICATION (AIPID): A value of AOh is assigned to the 82091AA. This 8-bit register 
combined with the 82091 AA Revision Identification Register uniquely identifies the device. 


4.1.3 AIPREV—AIP REVISION IDENTIFICATION 


Index Address: Oth 


Default Value: 00h 
Attribute: Read Only 
Size: 8 bits 


This register contains two fields that identify the revision of the 82091AA device. The revision number will be 
incremented for every stepping, even if change is invisible to software. 


0 Bit 


43 


7 
Step 
eee 
ees Revision Identification Number (RO) 
290486-6 


Figure 6. AIP Revision Identification Register 


| . Description 
STEP NUMBER: Contains the hexadecimal representation of the device stepping. 


DASH NUMBER: Contairis the hexadecimal representation of the dash number of the device 
stepping. 
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4.1.4 AIPCFG1—AIP CONFIGURATION 1 REGISTER 


Index Address: 02h 


Default Value: Depends upon hardware strap 
Attribute: Read/Write 
Size: 8 bits 


The AIPCFG1 Register enables/disables master clock circuitry for power management, enables/disables 
access to the configuration registers, and selects the 82091AA configuration mode. This register provides 
status for certain hardware configuration selections—the 82091AA clock frequency, power supply voltage, and 
address assignment for the configuration registers (address locations of the INDEX and TARGET Registers). 


7 6 5 4 3 2 +1 +O Bit 
po x] xx [x] a] a] 0] oetaun 


Clock Off (R/W) 
1=AIP Powered Off 
a 0=AIP Powered On 


Reserved 


Reserved 


Configuration Address Select (RO) 
1=Secondary Address (24h/25h for X-Bus and 
398h/399h for ISA Bus) 
0=Primary Address (22h/23h for X-Bus and 
26Eh/26Fh for ISA Bus) 
Configuration Mode Select (R/W) 
00=Software Motherboard 
01=Software Add-in 
10=Extended Hardware 
11=Basic Hardware 
Supply Voltage (RO) 
1=3.3 Volts * 
0=5.0 Volts | 
Not Used 


Always Write 0 
290486-7 


NOTES: 
*3.3V operation is available only in the 82091AA. 
X= Value is determined by hardware strapping options as described in Section 4.2, Hardware Configuration. 


Figure 7. AIP Configuration 1 Register 


ADVANCE INFORMATION 4-33 


82091AA — | intel ‘ 


| } Bit | Description 
NOT USED: Always write to 0. 


VOLTAGE SELECT (VSEL): This bit indicates whether 3.3V or 5V has been selected for the 
operating power supply voltage during hardware configuration. A 1 indicates that 3.3V is selected 
and a 0 indicates that 5V is selected. This bit is read only and writes have no effect. 

NOTE: 


3.3V operation is available only in the 82091AA. 


CONFIGURATION MONDE SELECT (CFGMOD): These bits indicate the configuration mode for the 
82091AA. After a hard reset, these bits reflect the mode selected by hardware configuration. If 
configuration register access is not locked out during hardware configuration, software can change 
the configuration mode by writing to this field. For configuration mode details, (see Section 4.2, 
Hardware Configuration). 


- Bits[5:4] Configuration Mode 
00 Software Motherboard (SWMB) 


01 Software Add-in (SWAI) 
10 Extended Hardware (HWE) 
11 Basic Hardware (HWB) 


Hardware configuration selects between primary addresses (22h/23h and 26Eh/26Fh) and 
secondary addresses (24h/25h and 398h/399h) for accessing the 82091AA configuration registers. 
When CFGADS = 0, the primary addresses are selected and when CFGADS = 1, the secondary 
addresses are selected. 


2 RESERVED 


| 1 | RESERVED 


CLOCK OFF (CLKOFF): The CLKOFF bit is used to implement clock circuitry power management. 


3 CONFIGURATION ADDRESS SELECT (CFGADS): This read only bit indicates the address 
assignment for the 82091AA configuration registers as selected by hardware configuration. 


When CLKOFF = 0, the main clock circuitry is powered on. When CLKOFF = 1, the main clock 
circuitry is powered off. This capability is independent of the 82091 AA’s powerdown state. Note that 
auto powerdown mode and powerdown have no effect over the power state of the clock Circuitry. 


4.1.5 AIPCFG2—AIP CONFIGURATION 2 REGISTER 


Index Address: 03h 


Default Value: = 0000 ORRR 
Attribute: Read/Write 


Size: | 8 bits | 


This register selects the active signal level for IRQ[7:3]. The interrupt signals can be individually programmed 
for either active high or active low drive characteristics. The active high mode is ISA (non-share) compatible 
and has tri-state drive characteristic. The active low mode is EISA (sharable) compatible and has an open 
collector drive characteristic. 
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Bit 


Default. 


Reserved 


IRQ3 Mode Select (R/W) 
1=Active Low (Open Collector Drive) 
0=Active High (Tri-state Drive) 


IRQ4 Mode Select (R/W) 
1=Active Low (Open Collector Drive) 
0=Active High (Tri-state Drive) 


IRQ5 Mode Select (R/W) 
1=Active Low (Open Collector Drive) 
0=Active High (Tri-state Drive) 


IRQ6 Mode Select (R/W) 
1=Active Low (Open Collector Drive) 
0=Active High (Tri-state Drive) 


IRQ7 Mode Select (R/W) 


1=Active Low (Open Collector Drive) 


0=Active High (Tri-state Drive) : 
290486-8 


Figure 8. AIP Configuration 2 Register 


Description 


IRQ7 MODE SELECT (IRQ7MOD): When IRQ7MOD = 0, IRQ7 is an active high tri-state drive signal. 
When IRQ7MOD = 1, IRQ7 is an active low open collector drive signal. 


IRQ6 MODE SELECT (IRQ6MOD): When IRQ6MOD = 0, IRQ6 is an active high tri-state drive signal. 
When IRQ6MOD = 1, IRQ6 is an active low open collector drive signal. 


IRQ5 MODE SELECT (IRQ5MOD): When IRQ5MOD = 0, IRQ5 is an active high tri-state drive signal. 
When IRQ5MOD = 1, IRQ85 is an active low open collector drive signal. . 


IRQ4 MODE SELECT (IRQ4MOD): When IRQ4MOD = 0, IRQ4 is an active high tri-state drive signal. 


When IRQ4MOD = 1, IRQ4 is an active low open collector drive signal. 


IRQ3 MODE SELECT (IRQ3MOD): When IRQ3MOD = 0, IRQ3 is an active high tri-state drive signal. 
When IRQ3MOD = 1, IRQ3 is an active low open collector drive signal. 


2:0 | RESERVED 
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(4.1.6 FCFG1—FDC CONFIGURATION REGISTER 


Index Address: 10h 


Default Value: ORRR RRO1 
Attribute: Read/Write 
Size: 8 bits 


_ This register selects between a 2 and 4 floppy drive system, selects primary/secondary ISA address range for 
the FDC, and enables/disables the FDC. All bits in this register are read/write. 


FDC Enable (R/W) 
1=Enable 
0=Disable 
FDC Address Select (R/W) 
1=Secondary FDC Address (370-377) 
- 0=Primary FDC Address (3F0-3F7) 


Reserved 


Floppy Disk Drive Quantity (R/W) 
1=Four Floppy Disk Drives (with External Decoder) 
0=Two Floppy Disk Drives (without External Decoder) 290486-9 


NOTES: 

*Default shown is for SWMB, SWAI, and HWB hardware configuration modes. For HWE, the default is determined by 
hardware strapping options as described in Section 4.2, Hardware Configuration. 
**Default shown is for SWMB and SWAI configuration modes. For HWB and HWE configuration modes, the default is 
determined by hardware strapping options as described in Section 4.2, Hardware Configuration. 


Figure 9. FDC Configuration Register 


Description 


FLOPPY DISK DRIVE QUANTITY (FDDQTY): This bit selects between two and four floppy disk 
drive capability. When FDDQTY = 0, the 82091AA can control two floppy disk drives directly without 
an external decoder. When FDDQTY = 1, the 82091 AA can control four floppy disk drives with an 
external decoder. When FDDQTY = 1, the PDEN feature in the powerdown command is disabled. 
For further details, see Appendix A, FDC Four Drive Support. This bit can be configured by hardware 
extended configuration (-HIWE) at powerup. For all other hardware configuration modes (SWMB, 
SWAI, and HWB), the floppy disk drive yan is not configurable by hardware strapping options 
and defaults to 2 drives. 


RESERVED 


FLOPPY DISK CONTROLLER ADDRESS SELECT (FADS): When FADS = 0, the primary FDC 
address (3F0-3F7) is selected. When FADS = 1, the secondary FDC address (370-377) is 
selected. For SWMB and SWAI configuration modes, the default is 0 (primary address). For HWB 
and HWE hardware configuration modes, the default is determined by signal pin strapping options. 


FLOPPY DISK CONTROLLER ENABLE (FEN): This bit enables/disables the FDC. When FEN = 1, 
the FDC is enabled. When FEN =0, the FDC module is disabled. For SWMB and SWAI configuration 
modes, the default is 1 (enabled). For HWB and HWE hardware configuration modes, the default is 
determined by signal pin strapping options. Note that, when the FDC is disabled, IRQ6 and FDDREQ 
are tri-stated. | 
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4.1.7 FCFG2—FDC POWER MANAGEMENT AND STATUS REGISTER 


Index Address: 1th 

Default Value: RRRR 0000 
Attribute: Read/Write 
Size: 8 bits 


This register enables/disables FDC auto powerdown and can place the FDC into direct powerdown. The 
register also provides FDC idle status and FDC reset control. 


Defauit 


FDC Direct Powerdown Control (R/W) 
1=Powerdown 
=Not in Direct Powerdown 


FDC Idle Status (RO) 


1=Idle 
0=Active 
FDC Reset (R/W) 
1=Reset FDC Module 
0=No FDC Module Reset 
FDC Auto Powerdown Enabie (R/W) 
1=Enable 
0=Disable 


Reserved . 290486-10 


Figure 10. FDC Power Management and Status Register 


-4 | RESERVED 


FLOPPY DISK AUTO POWERDOWN ENABLE (FAPDN): This bit is used to enable/disable auto 
powerdown for the FDC. When FAPDN = 1, the FDC will enter auto powerdown when the required 
conditions are met. When FAPDN=0, FDC auto powerdown is disabled. 


FLOPPY DISK CONTROLLER RESET (FRESET): FRESET is a reset for the FDC. When 

FRESET = 1, the FDC is reset (i.e., all programming and current state information is lost). 

FRESET = 1 has the same affect on the FDC as a hard reset (asserting the RSTDRV signal). When 
resetting the FDC via this configuration bit, the software must toggle this bit and ensure the reset 
active time (FRESET = 1) of 1.13 zs minimum is met. 


FLOPPY DISK CONTROLLER IDLE STATUS (FIDLE): When the FDC is in the idle state, this bit is 


set to 1 by the 82091AA hardware. In the idle state the FDC’s Main Status Register (MSR) = 80h, 
IRQ6 = inactive, and the head unload timer has expired. When the FDC exits its idle state, this bit is 
set to 0. This bit is read only. 


FLOPPY DISK CONTROLLER POWERDOWN (FDPDN): When FDPODN is set to 1, the FDC is _ 
placed in direct powerdown. Once in powerdown the following procedure should be used to ne 
the FDC out of powerdown: 
e Write this bit low 
e Apply a hardware reset (via bit 2 of this register) or a software reset (via either bit 2 of the FDC’s 
DOR or bit 7 of the FDC’s DSR). 
NOTE: 
A hard reset via the RSTDRV pin also removes the FDC powerdown. 
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4.1.8 PCFG1—PARALLEL PORT CONFIGURATION REGISTER 


Index Address: 20h 


Default Value: -OOOR 0000 
Attribute: Read/Write 
Size: 8 bits 


The PCFG1 Register enables/disables the parallel port, selects the parallel port address, and selects the 
parallel port interrupt. This register also selects the hardware operation mode for the parallel port. 


Default 


PP Enable (R/W) 
1=Enable 
0=Disable 


PP Address Select (R/W) 
00=378h-37Fh 
01=278h-27Fh 
10=3BCh-3BEh 
11=Reserved 


PP interrupt Select (R/W) 


1=IRQ7 
0=IRQ5. 


Reserved 


PP Hardware Mode Select (R/W) 
See Text Description 


PP FIFO Threshold Select (R/W) 
1=1 (forward), 15 (reverse) 


0=8 (forward and reverse) 
: ? 290486-11 


NOTES: . 
| *Default shown is for SWMB and SWA! configuration modes. For HWB and HWE modes, the default is determined by 

hardware configuration options as described in Section 4.2, Hardware Configuration. 

**Default shown is for SWMB, SWAI, and HWB configuration modes. For HWE mode, the default is determined by 

hardware configuration options as described in Section 4.2, Hardware Configuration. 


Figure 11. Parallel Port Configuration Register 
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3. Parallel port interface signals controlled by the PCON Register (SELECTIN #, INIT#, AUTOFD#, 


PARALLEL PORT FIFO THRESHOLD SELECT (PTHRSEL): This bit controls the FIFO threshold 
| and STROBE #) should be negated before entering EPP mode. 


NOTE: 


In the reverse direction, a threshold of 15/8 means that a request (DMA or Interrupt is 
enabled) is generated when 15/8 bytes are in the FIFO. In the forward direction, a threshold 
of 1/8 means that a request is generated when 1/8 byte locations are available. 


PARALLEL PORT HARDWARE MODE SELECT (PPHMOD): This field selects the parallel port 
hardware mode. The ISA-Compatible mode is for compatibility and nibble mode peripheral interface 
protocols. The PS/2-Compatible mode is for the byte mode peripheral interface protocol. The EPP 
and ECP modes are for the EPP and ECP mode peripheral interface protocols, respectively. This 
field can be configured by strapping options at powerup for hardware extended configuration (HWE) 
mode only. For all other hardware configuration modes (SWMB, SWAI, and HWB), the default is 00 
(ISA-Compatible). 


PE 


Bits [6:5] Read Write 
00 ISA-Compatible ISA-Compatible(™) 
01 PS/2-Compatible | PS/2-Compatible() 
10 EPP EPp(1, 3) 
11 ECP(2) Reserved; do not write(2) 
NOTES: 


1. ISA-Compatible, PS/2-Compatible, and EPP modes are selected via this field or hardware 
configuration. In addition, ISA-Compatible and PS/2-Compatible modes can be selected via the 
ECP Extended Control Register (ECR). When the ECR is programmed for one of these two 
modes (ECR[7:5] = 000, 001), this field is updated to match the selected mode. 

2. ECP Mode can not be entered by programming this field. ECP Mode can only be selected through 
the ECR. When the ECR is programmed for ECP mode, the 82091AA sets this field to 11. 


and only affects parallel port operations when the parallel port is in ECP mode or ISA-Compatible 

FIFO mode. When PTHRSEL = 1, the FIFO threshhold is 1 in the forward direction and 15 in the 

reverse direction. When PTHRSEL=0, the FIFO threshold is 8 in both directions. This bit can only 

be programmed when the parallel port is in ISA-Compatible or PS/2-Compatible mode. These 

modes can be selected via bits[6:5] of this register or the ECP Extended Control Register (ECR). 
PARALLEL PORT IRQ SELECT (PIRQSEL): When PIRQSEL = 1, IRQ7 is selected as the parallel 
port interrupt. When PIRQSEL =0, IRQ5 is selected as the parallel port interrupt. This field can be 
configured by strapping options at powerup for HWB and HWE modes only. For ail other hardware 
configuration modes (SWMB and SWAI), the default is 0 (IRQ5). 
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| Description - | 
PARALLEL PORT ADDRESS SELECT (PADS): This field selects the address for the parallel port 


‘as follows: 


Bits[2:1] Address Parallel Port Hardware Mode 
~00 378-37F All 


01 278-~—27F All 
10 3BC-3BE All except EPP 
11. Reserved None, do not write 


This field can be configured by strapping options at powerup for HWB and HWE modes only. For all 
other hardware configuration modes (SWMB and SWA)), the default is 00 (8378h-37Fh). Note that 
the SWMB and SWAI default settings for PIRQSEL (bit 3) and PADS (bits[2,1]) do not match a 
standard PC/AT* combination for address assignment and interrupt setting. However, for SWMB 
and SWAI, the parallel port defaults to a disabled condition and this register must be programmed to - 
enable the parallel port (i.e., bit 0 set to 1). At this time, the selections for interrupt and address 
assignments should be made. 


PARALLEL PORT ENABLE (PEN): When PEN = 0, the parallel port is disabled. When PEN = 1, the 
parallel port is enabled. This bit can be configured by hardware strapping options at powerup for 
-HWB and HWE modes only. For all other hardware configuration modes (SWMB and SWAI), the 
default is 0 (disabled). Note that when the parallel port is disabled, IRQ(7,5] and PPDREQ are tri- 
stated. 


4.1.9 PCFG2—PARALLEL PORT POWER MANAGEMENT AND STATUS REGISTER 


Index Address: 21h 


Default Value: RROR 0000 
Attribute: Read/Write 
Size: 8 bits 


This register enables/disables parallel port auto powerdown and can place the parallel port into a powerdown 
mode directly. The register also provides parallel port idle status, resets the parallel port, and reports FIFO 
underrun or overrun errors. 


*Other brands and names are the property of their respective owners. 
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PP Port Direct Powerdown (R/W) 


1=Enabled 
0=Disabled 


PP Idle Status (RO) 
1=Idle 
0=Active 


PP Reset (R/W) 
1=Active 
0=Inactive 
PP Auto Powerdown Enable (R/W) 
1=Enable 
0=Disable 
Reserved 
PP FIFO Error Status (RO) 
1=Underrun or Overrun 
O=No Underrun or Overrun 


Reserved 


290486-12 


Figure 12. Parallel Port Power Management and Status Register 


es Description 
RESERVED 


PARALLEL PORT FIFO ERROR STATUS (PFERR): When PFERR = 1, a FIFO underrun or overrun 
condition has occurred. This bit is read only. Setting PRESET to 1 clears this bit to 0. 


fess 


PARALLEL PORT AUTO POWERDOWN ENABLE (PAPDN): When PAPDN = 1, the parallel port 
can enter auto powerdown if the required auto powerdown conditions are met. When PAPDN = 0, 
auto powerdown is disabled. 


PARALLEL PORT RESET (PRESET): When PRESET is set to 1, the parallel port is reset (i.e., all 
programming and current state information is lost). This is the same state the module would be in 


after a hard reset (RSTDRV asserted) to the 82091AA. When resetting the parallel port via this 
configuration bit, the software must toggle this bit and ensure the reset active time (PRESET = 1) of 
1.13 ws minimum is met. 


PARALLEL PORT IDLE STATUS (PIDLE): This bit reflects the idle state of the parallel port. When 

the parallel port is in an idle state (i.e., when the same conditions are met that apply to entering auto 
powerdown) the 82091AA sets this bit to 1. The parallel port idle state is defined as the FIFO empty 
and no activity on the parallel port interface. This bit is read only. 


PARALLEL PORT DIRECT POWERDOWN (PDPDN): When PDPDN is set to 1, the parallel port 
enters direct powerdown. When PDPDN is set to 0, the parallel port is not in direct powerdown. Note 
that a parallel port module reset (PRESET bit in this register) also brings the parallel port out of the 
direct powerdown state. 
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4.1.10 SACFG1—SERIAL PORT A CONFIGURATION REGISTER 


Index Address: 30h 


Default Value: ORRO 0000 
Attribute: Read/Write 
Size: 8 bits 


The SACFG1 register enables/disables Serial Port A, selects the Serial Port A address range, and selects 
between IRQ3 and IRQ4 as the Serial Port A interrupt. This register also selects the appropriate clock frequen- 
cy for use with MIDI. 


NOTES: 

1. Through programming of this register and the SBCFG1 Register, the 82091AA permits serial ports A 
and B to be configured for the same interrupt assignment. However, software must take care in 
responding to interrupts correctly. 

2. It is possible to enable and assign both serial ports to the same address through software. In this 
configuration, the 82091AA disables serial port B, but does not set serial port B into it’s powerdown 
condition. Although this is a safe configuration for the 82091AA, it is not power conservative and is 
not recommended. 


Default 


Serial Port A Enable (R/W) 
1=Enable 
0=Disable 


Serial Port A Address Select (R/W) 
(ISA Address Range) 

000=3F8-3FFh 

001=2F8-2FFh 

010=220-227h 

011=228-22Fh 

100=238-23Fh 

101=2E8-2EFh 

110=338-33Fh 

111=3E8-3EFh 


Serial Port A IRQ Select (R/W) 
1=IRQ4 
=IRQ3 


Reserved 


MIDI Clock Enable for Serial Port A (R/W) 
1=2 MHz Clock for Serial Port A (used for generating MIDI baud rate) 
0=1.8462 MHz Clock for Serial Port A 


- 290486-13 


NOTE: 
*Default shown is for SWMB and SWAI hardware configuration modes. For HWB and HWE modes, the default is deter- 
mined by hardware strapping options as described in Section 4.2, Hardware Configuration. 


Figure 13. Serial Port A Configuration Register 
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MIDI CLOCK FOR SERIAL PORT A ENABLE (SAMIDI): When SAMIDI = 1, the clock into Serial 
Port A is changed from 1.8462 MHz—2 MHz. The 2 MHz clock is needed to generate the MIDI baud 
rate. When SAMIDI=0, the clock frequency is 1.8462 MHz. 


res RESERVED SOSC“~S~S~S~S 


SERIAL PORT A IRQ SELECT (SAIRQSEL): When SAIRQSEL=0, IRQ3 is selected for the Serial 
Port A interrupt. When SAIRQSEL = 1, IRQ4 is selected for the Serial Port A interrupt. This bit can be 
configured by strapping options at powerup for HWB and HWE modes only. For SWMB and SWAI 
hardware configuration modes, the default is 0 (IRQ3). Note that, while the default address and IRQ 
assignments for SWMB and SWAI modes are the same for both serial ports, the serial ports are 
disabled and programming of this register is required for operation. 


SERIAL PORT A ADDRESS SELECT (SAADS): This field selects the ISA address range for Serial 
Port A as follows: 


Bits[3:1] ISA Address Range 


000 3F8-—3FFh 

001 2F8-2FFh 

010 220-227h 

0114 228-22Fh 

100 238-23Fh 

101 2E8-2EFh 

110 338-33Fh 

111 SE8-3EFh 
This field can be configured by strapping options at powerup for HWB and HWE modes only. For 
SWMB and SWAI hardware configuration modes, the default is O00 (3F8-3FFh). Note that, while 
the default address and IRQ assignments for SWMB and SWAI modes are the same for both serial 
ports, the serial ports are disabled and programming of this register is required for operation. 


SERIAL PORT A ENABLE (SAEN): When SAEN = 1, Serial Port A is enabled. When SAEN = 0, 
Serial Port A is disabled. This bit can be configured by strapping options at powerup for HWB and 
HWE modes only. For SWMB and SWAI hardware configuration modes, the default is 0 (disabled). 


4.1.11 SACFG2—SERIAL PORT A POWER MANAGEMENT AND STATUS REGISTER 


Index Address: 31h 


Default Value: RRRO OOUO 
Attribute: Read/Write 
Size: 8 bits 


This register enables/disables the Serial Port A module auto powerdown and can place the module into a 
direct powerdown mode. The register also provides Serial Port A idle status, resets the Serial Port A module, 
and places Serial Port A into test mode. 
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Default 


Serial Port A Direct Powerdown (R/W) 
1=Enable 
_0=Disable 
Serial Port A Idle Status (RO) 
1=Idle 
0=Active 
Serial Port A Reset (R/W) 


1=Reset Active 
0=Reset Inactive 
Serial Port A Auto Powerdown Enable (R/W) 
1=Enable 
0=Disable 
— Serial Port A Test Mode (R/W) 
1=Enable 
0=Disable 


Reserved 
290486-14 


NOTE: 
U = Undefined 


Figure 14. Serial Port A Power Management and Status Register 


Description 


RESERVED 


SERIAL PORT A TEST MODE (SATEST): The serial port test mode provides user access to the 
output of the baud out generator. When SATEST = 1 (and the DLAB bit is 1 in the LCR), the Serial 
Port A test mode is enabled and the baud rate clock is output on the SOUTA pin (Figure 15). When 
SATEST =0, the Serial Port A test mode is disabled. 
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BAUDOUT 
(+1) 


BAUDOUT 
(+2) 


BAUDOUT 
(+3) 


BAUDOUT 
(+N, N>3) 


SERIAL PORT A AUTO POWERDOWN ENABLE (SAAPDN): This bit enables/disables auto 
powerdown. When SAAPDN = 1, Serial Port A can enter auto powerdown if the required conditions 

are met. The required conditions are that the transmit and receive FIFOs are empty and the timeout 
counter has expired. When SAAPDN = 0, auto powerdown is disabled. 


SERIAL PORT A RESET (SARESET): When SARESET = 1, the Serial Port A module is reset (i.e. all 
programming and current state information is lost). This is the same state the module would bein 
after a hard reset (RSTDRV asserted). When resetting the serial port via this configuration bit, the 
software must toggle this bit and ensure the reset active time (SARESET = 1) of 1.13 ws minimum is 
met. 


SERIAL PORT A IDLE STATUS (SAIDLE): When Serial Port A is in an idle state the 82091AA sets 
this bit to 1. Serial Port A is in the idle state when the transmit and receive FIFOs are empty and the 
timeout counter has expired. Note that these are the same conditions that apply to entering auto 
powerdown. When serial port A is not in an idle state, the 82091AA sets this bit to 0. Direct 
powerdown does not affect this bit and in auto powerdown SAIDLE is only set to a 1 if the receive 
and transmit FIFOs are empty. This bit is read only. 


During a hard reset (RSTDRV asserted), The 82091AA sets SAIDLE to 0. However, because the 
serial port is typically initialized by software before the idle conditions are met, the default state is 
shown as undefined. 


SERIAL PORT A DIRECT POWERDOWN (SADPDN): When SADPDN = 1, Serial Port A is placed in 
direct powerdown mode. Setting this bit to 0 brings Serial Port A out of direct powerdown mode. 
Setting bit 2 (SARESET) of this register to 1 will also bring Serial Port A out of the direct powerdown 
mode. 


NOTE: 


Direct powerdown resets the receiver and transmitter portions of the serial port including the 
receive and transmit FIFOs. To ensure that the resetting of the FIFOs does not cause data 
loss, the SAIDLE bit should be 1 before placing the serial port into direct powerdown. 
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4.1.12 SBCFG1—SERIAL PORT B CONFIGURATION REGISTER 


Index Address: 40h 


Default Value: — ORRO 0000 
Attribute: Read/Write 


Size: 8 bits 


The SBCFG1 register enables/disables Serial Port B, selects the Serial Port B address range, and selects 
between IRQ3 and IRQ4 as the Serial Port B interrupt. This register also selects the appropriate clock frequen- 
cy for use with MIDI. : 


NOTES: 

1. Through programming of this register and the SBCFG1 Register, the 82091AA permits serial ports A | 
and B to be configured for the same interrupt assignment. However, software must take care in 
responding to interrupts correctly. 

2. It is possible to enable and assign both serial ports to the same address through software. In this 
configuration, the 82091AA disables serial port B, but does not set serial port B into it’s powerdown 
condition. Although this is a safe conmnanrauon for the 82091AA, it is not power conservative and is 
not recommended. 


Default 


Serial Port B Enable (R/W) 
1=Enable 
0=Disable 


Serial Port B Address Select (R/W) 
(ISA Address Range) 

000=3F8-3FFh 

001=2F8-2FFh 

010=220-227h 

011=228-22Fh 

100=238-23Fh 

101=2E8-2EFh 

110=338-33Fh 

111=3E8-3EFh 


Serial Port B IRQ Select (R/W) 
1=IRQ4 — 
0=IRQ3 


- Reserved 


MIDI Clock Enable for Serial Port B (R/W) 
1=2 MHz Clock for Serial Port B (used for generating MIDI baud rate) 
0=1.8462 MHz Clock for Serial Port B 


290486-16 


NOTE: . 
*Default shown is for SWMB and SWAI hardware configuration modes. For HWB and HWE modes, the default is 
determined by hardware strapping options as described in Section 4.2, Hardware Configuration. 


Figure 16. Serial Port B Configuration Register 
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MIDI CLOCK FOR SERIAL PORT B ENABLE (SBMIDI): When SBMIDI= 1, the clock into Serial 
Port B is changed from 1.8462 MHz to 2 MHz. The 2 MHz clock is needed to generate the MIDI baud 
rate. When SBMIDI=0, the clock frequency is 1.8462 MHz. The default value is 0. 


SERIAL PORT B IRQ SELECT (SBIRQSEL): When SBIRQSEL = 0, IRQ3 is selected for the Serial 
Port B interrupt. When SBIRQSEL = 1, IRQ4 is selected for the Serial Port B interrupt. The default 
value is 0. This bit can be configured by strapping options at powerup for HWB and HWE modes 
only. For SWMB and SWAI configuration modes, the default is 0 (IRQ3). Note that, while the default 
address and IRQ assignments for SWMB and SWAI modes are the same for both serial ports, the 
serial ports are disabled and programming of this register is required for operation. 


SERIAL PORT B ADDRESS SELECT (SBADS): This field selects the ISA address range for Serial 
Port B as follows: 


Bits[3:1] ISA Address Range 


000 3F8-3FFh 

001 2F8-2FFh 

010 220-227h 

011 228-22Fh 

100 238-23Fh 

101 2E8-2EFh 

110 338-33Fh 

111 3E8-3EFh 
This field can be configured by strapping options at powerup for HWB and HWE modes only. For 
SWMB and SWAI configuration modes, the default is O00 (3F8-3FFh). Note that, while the default 
address and IRQ assignments for SWMB and SWAI modes are the same for both serial ports, the 
serial ports are disabled and programming of this register is required for operation. 


SERIAL PORT B ENABLE (SBEN): When SBEN = 1, Serial Port B is enabled. When SAEN = 0, 
Serial Port B is disabled. This bit can be configured by strapping options at powerup for HWB and 
HWE modes only. For SWMB and SWAI configuration modes, the default is 0 (disabled). 
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4.1.13 SBCFG2—SERIAL PORT B POWER MANAGEMENT AND STATUS REGISTER 


Index Address: 41h 


Default Value: RRRO O0OUO 
Attribute: Read/Write 


Size: 8 bits 
This register enables/disables the Serial Port B module auto powerdown and can place the module into a 


powerdown mode directly. The register also provides Serial Port B idle status, resets the Serial Port B module, 
and enables/disables Serial Port B test mode. 


7 +2,5 4 38 2 1 O Bit 


Default 


Serial Port B Direct Powerdown (R/W) 
1=Enable 
0=Disable 
Serial Port B Idle Status (RO) . 
1=Idle 
0=Active 
Serial Port B Reset (R/W) 
1=Reset Active 
0=Reset Inactive 
Serial Port B Auto Powerdown Enable (R/W) 
1=Enable 
Q=Disable — 

— Serial Port B Test Mode (R/W) 

1=Enable | | 

0=Disable 
Reserved 
290486-17 


NOTE: 
U = Undefined 


Figure 17. Serial Port B Power Management and Status Register 
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RESERVED 


SERIAL PORT B TEST MODE (SBTEST): The serial port test mode provides user access to the 
output of the baud out generator. When SBTEST = 1 (and the DLAB bit is 1 in the LCR), the Serial 
Port B test mode is enabled and the baud rate clock is output on the SOUTB pin (Figure 15). When 
SBTEST =0, the Serial Port B test mode is disabled. 


SERIAL PORT B AUTO POWERDOWN ENABLE (SBAPDN): This bit enables/disables auto 

powerdown. When SBAPDN = 1, Serial Port B can enter auto powerdown if the required conditions 
are met. The required conditions are that the transmit and receive FIFOs are empty and the timeout 
counter has expired. When SBAPDN = 0, auto powerdown is disabled. 


SERIAL PORT B RESET (SBRESET): When SBRESET = 1, Serial Port B is reset (i.e., all 
programming and current state information is lost). This is the same state the module would be in 
after a hard reset (RSTDRV asserted). When resetting the serial port via this configuration bit, the 
software must toggle this bit and ensure the reset active time (SBRESET = 1) of 1.13 ws minimum is 
met. 


SERIAL PORT B IDLE STATUS (SBIDLE): When Serial Port B is in an idle state the 82091AA sets 
this bit to 1. Serial Port B is in the idle state when the transmit and receive FIFOs are empty and the 
timeout counter has expired. Note that these are the same conditions that apply to entering auto 
powerdown. When serial port B is not in an idle state, the 82091AA sets this bit to 0. Direct 
powerdown does not affect this bit and in auto powerdown, this bit is only set to a 1 if the receive 
and transmit FIFOs are empty. This bit is read only. 


During a hard reset (RSTDRV asserted), the 82091AA sets this bit to 0. However, because the serial 


port is typically initialized by software before the idle conditions are met, the defaullt state is shown 
as undefined. 


SERIAL PORT B DIRECT POWERDOWN (SBDPDN): When SBDPDN = 1, Serial Port B is placed in 
powerdown mode. Setting this bit to 0 brings the module out of direct powerdown mode. Setting bit 2 
(SBRESET) of this register to 1 will also bring Serial Port B out of the direct powerdown mode. 


NOTE: 


Direct powerdown resets the receiver and transmitter portions of the serial port including the 
receive and transmit FIFOs. To ensure that the resetting of the FIFOs does not cause data 
loss, the SBIDLE bit should be 1 before placing the serial port into direct powerdown. 
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4.1.14 IDECFG—IDE CONFIGURATION REGISTER 


Index Address: 50h 


Default Value: RRRR ROO1 
Attribute: Read/Write 
Size: : 8 bits 


The IDECFG Register sets up the 82091AA IDE interface. This register enables the IDE interface and selects 
the address for accessing the IDE:. ) 


Default 


IDE Interface Enable (R/W) 
1=Enable 
0=Disable 


IDE Address Select (R/W) 


1=Secondary IDE Address (170-177, 376, 377) 
0=Primary IDE Address (1F0-1F7, 3F6, 3F7) 


IDE Dual Interface Select (R/W) 
1=Primary and Secondary Addresses Selected 
0=Dual Interface Disabied 


Reserved 
290486-18 


NOTES: . 
* Default shown is for SWMB and SWAI configuration modes. For HWB and HWE hardware configuration modes, the 
default is determined by hardware strapping options as described in Section 4.2, Hardware Configuration. 
** Not hardware configurable. 


Figure 18. IDE Configuration Register 


Description 
RESERVED 


IDE DUAL SELECT (IDUAL): When IDUAL =0, the IDE address selection is determined by the 
IADS bit. When IDUAL = 4, both the primary and secondary IDE addresses are selected ae the 
setting of the IADS bit does not affect IDE address selection. 


IDE ADDRESS SELECT (IADS): When IADS=0, the primary IDE address is selected (1FOh-—1F7h, 
3F6h, 3F7h ). When IADS = 1, the secondary IDE address is selected (1FOh-1F7h, 376h, 377h). For 
all hardware configuration modes (SWMB, SWAI, HWB, and HWE), the default is determined by 
signal pin strapping options. 


IDE INTERFACE ENABLE (IEN): When IEN=0, the IDE. interface is disabled (i.e., the IDE chip 
selects (IDECS[1:0]), DEN#, and HEN# are negated (remain inactive) for accesses to the IDE 
primary and secondary addresses). When IEN = 1, the IDE interface is enabled. For all hardware 
configuration modes (SWMB, SWAI, HWB, and HWE), the default is determined by signal ii 
strapping options. 
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4.2 Hardware Configuration 


Hardware configuration provides a mechanism for 
configuring certain 82091AA operations at powerup. 
Four hardware configuration modes provide different 
levels of configuration depending on the type of ap- 
plication and the degree of hardware/software con- 
figuration desired. The hardware configuration 
modes are: 


e Software Motherboard (SWMB) 
e Software Add-In (SWAI) 

e Hardware Extended (HWE) 

e Hardware Basic (HWB) 


These modes support a variety of system implemen- 
tations. For example, with Hardware Basic (HWB) 
and Hardware Extended (HWE) modes, an exten- 
sive set of 82091AA configuration options are avail- 
able for setting up the 82091AA at powerup. This 
permits the 82091AA to be used in systems without 
82091AA software drivers. For many of these sys- 
tems, access to the 82091AA configuration registers 
may not be necessary. As such, access to these 
registers can be disabled via hardware configura- 
tion. This option could be used to prevent software 
from inadvertently re-configuring the 82091AA. _ 


NOTE: 

If the 82091AA is configured in HWB or 
HWE configuration mode at powerup, and 
reconfiguration with software is desired, the 
82091AA configuration mode must first be 
changed to SWAI configuration mode by 
writing the AIPCFG1 register. The 82091AA 
can then remain in SWAI configuration mode 
to accomodate software programmable con- 
figuration changes as desired. 


Software Motherboard (SWMB) and Software Adda- 
In (SWAI) modes provide a minimum hardware con- 
figuration in systems where software/firmware driv- 
ers are used for configuration. Because access to 
the 82091AA configuration registers after powerup/ 
hardware configuration is needed, the SWMB and 
SWAI modes do not provide disabling access to 
these registers (i.e., the strapping of the HEN# sig- 
nal has no effect). 


The desired hardware configuration mode and op- 


tions within the mode are selected by strapping cer- 
tain 82091AA signal pins at powerup. These signal 
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pins are sampled when the 82091AA receives a 
hard reset (via RSTDRV). This section describes 
how to select the configuration mode and options 
within the mode. The section also provides example 
hardware connection diagrams for the different 
modes. | 


4.2.1 SELECTING THE HARDWARE 
CONFIGURATION MODE 


During powerup or a hard reset, four signal pins 
(DEN#, PPDIR/GCS#, DTRA, and HEN#) select 
the hardware configuration mode, I/O address as- 
signment for the 82091AA configuration registers, 
and whether software access to these configuration 
registers is permitted. The following mnemonics and 
signal pins are assigned for these functions: 


CFGMOD[1,0] Hardware Configuration Mode. 
The 82091AA samples _ the 
CFGMODO (DEN #) and CFGMOD1 
(PPDIR/GCS #) signal pins to select 
one of the four hardware configura- 
tion modes as shown in Table 6. 


82091AA Configuration Register 
Address Assignment. The 
82091AA samples the DTRA# sig- 
nal (CFGADS function) to determine 
the address assignment of the 
82091AA configuration registers as 
shown in Table 6. CFGADS works in 
conjunction with CFGDIS. Note that 
the 82091AA configuration register 
address assignment for Hardware 
Basic mode is not selectable. 


82091AA Configuration Register 
Disable. The 82091AA samples 
CFGDIS (HEN# signal) to enable/ 
disable access to the 82091AA con- 
figuration registers as shown in Ta- 
ble 6. Note that CFGDIS only affects 
the HWE and HWB modes. 


CFGADS 


CFGDIS 


NOTE: 

For Extended Hardware Configuration, the 
time immediately following the RSTDRV 
pulse is required to complete the configura- 
tion time. If IORC#/IOWC# are asserted 
during this time, IOCHRDY will be negated 
(wait-states inserted) until the 82091AA con- 
figuration time expires. 
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CFGDIS 
(HEN #) 


CFGMODO 
(DEN #) 


CFGMOD1 
(PPDIR) 


| © 


—~>~> il a 1/OhroO;ro;=s li —-s |o 


4.2.2 SELECTING HARDWARE: 
CONFIGURATION MODE OPTIONS 


Within each hardware configuration mode, a number 
of options are available. For the HWB and HWE 
hardware configuration modes, the user can enable/ 


[oe 


- intel. 


Table 6. AllP Configuration Mode Register Address Assignment 


Configuration 
Register ISA 
Address 
(INDEX/TARGET) 


Access Disabled — 


disable the floppy disk controller and the IDE inter- 
face via the IDE chip select pins (see Table 7). If 
enabled, these signal pins also select the address 
assignment. For SWMB and SWAI configuration 


' modes, these signal pins have no effect. 


Table 7. FDC and IDE Enable/Disable 


DDCFGO 
(IDECSO#) 


‘| DDCFG1 
(IDECS1#) 
a oe 


The 82091AA provides additional hardware configu- 
ration options through the SOUTA, SOUTB, RTSA#, 
RTSB#, DTRA#, and DTRB# signal pins as shown 
in Table 8. In the case of the Hardware Extended 
Mode, the 82091AA samples the signal pins at two 
different times (once for HWEa options and again for 
HWEb options). The timing for signal sampling is dis- 
cussed in Section 4.2.3, Hardware Configuration 
Timing Relationships. The options provide configura- 
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tion of the serial ports, floppy disk controller, parallel 
port, IDE interface, 82091AA operating power supply 
voltage, 82091AA clock frequency, and address as- 
signment for the 82091AA configuration registers. 
Table 8 provides a matrix of the options available for 
each hardware configuration mode. The configura- 
tion options are selected as shown in Table 8 
through Table 14. 
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Note that for the SWAI and SWMB modes, the se- 
lection of the operating frequency (CLKSEL), power 
supply voltage level (VSEL), and 82091AA configu- 
ration register address assignment (CFGADS) are 
the only hardware configuration options (Table 8). In 
these modes, software/firmware provides the re- 
mainder of the 82091AA configuration by program- 
ming the 82091AA configuration registers (see Sec- 
tion 4.1, Configuration Registers). For the SWAI 


82091AA 


and SWMB modes, the 82091AA modules are 
placed in the following states after powerup or a 
hard reset: | 


¢ Serial ports disabled 

e Parallel port disabled 

¢ FDC enabled for two drives (primary address) 
e IDE enabled (primary address) 


Table &. Hardware Configuration Mode Option Matrix 


Basic Hardware 
Configuration 


Signal 
Name 


f 


NOTES: 


Extended Hardware 
Configuration 


[awe | swat SWMB 
sours | secre: | pemooo | sporcn | — «iS 
Carses [spores | Fooary | srorcs | — 


Software 
MotherBoard 
Configuration 


Software Add-In 
Configuration 


1. HWEa and HWEb reference the switching banks shown in Figure 22. 

2. The following mnemonics are used in the table: SPCFGx=serial port configuration, PPCFGx= parallel port configuration, 
CLKSEL= clock select, PPMODx= parallel port hardware mode, FDDQTY = floppy disk drive quantity, VSEL= power sup- 
ply voltage select, CFGADS = 82091AA configuration register address assignment select. 


3. Always tie this signal low with a 10K resistor. 
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| Table 9. Serial Port Address and Interrupt Assignments 


a 
| en 

Assignment | Assignment | Assignment | Assignment 

| 0 | Disable |  — | Disable | — | 
[1 | pisatie | — (| aFe-arrh | nae 
[0 | Disebie | — | afe-orrn | inca 
P+ | Disable | — | 960-a6rh | iROs 
[0 | aFe-arrh | iRae | Disable | — 
[1 [aee-sern [mas | Disabie | 
[0 | aFe-arrn | ina | 2re-arrn | RCO 
[0 | areerrn | imas | Disab | 
[0 | disable | — | 2e0-26rn | ince | 
[1 | areoren | imas | aeecern | IRQs 
| 0 | aee-2eFn | IRQs | Disable | 
P+ | ee-2ern | ios | aFe-arrh | iRae 
Po | ee-2ern | ios’) | are-zrrn | imaat) 
[1 | aee-2ern [tras | aee-aern | ios 

NOTE: a 


1. In this configuration, the two serial ports share the same interrupt line. Responding correctly to interrupts generated in this 
configuration is the exclusive resporisibility of software. 


 SerialPortA 


salIualolojalalolo{[a|/=|lojo}j=|alolo 


Table 10. Parallel Port Address and Interrupt Assignments 


Assignment Assignment 
a 
| e7e-a7Fh | RO 


PPCFG1 (DTRB#) PPCFGO (DTRA#) 
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Table 11. Parallel Port Hardware Mode Select 


EAs 
(RTSA#) (SOUTB) 

Po | 0 | SAConpatbie 
To | Ps r2-Compatie 
ee 


NOTES: 

1. PPMODx hardware configuration is effective in HWE 
mode only. 

2. ECP mode is not selectable via hardware configuration. 

3. For EPP mode, address assignment must be either 278h 
or 378h. 


Table 12. AIP Clock Select 


CLKSEL(SOUTA) | si 
ol a 


NOTE: 
Always tie this low. 


Table 13. AIP Power Supply Voltage 


VSEL Power Supply Voltage 


(DTRB#) 
5.0V Operation 


3.3V Operation 


NOTES: 

1. VSEL hardware configuration is not available in HWB 
mode only. 

2. To operate the 82091AA and all of the interfaces at 5V 
or 3.3V, both Vcc and Vocr are connected to 5V or 
3.3V power supplies, respectively. However, in the 
mixed mode, hardware configuration (Vse,) is set to 
3.3V, Vcc is connected to 3.3V, and Vccr connected to 
5V. 

3. 3.3V operation is available only in the 82091AA. 
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Table 14. Floppy Drive Quantity Select 


FDDQTY Number of Supported 
(RTSB #) Floppy Drives 


NOTES: 

1. FDDQTY hardware configuration is effective in HWE 
mode only. 

2. Four floppy drive support requires external logic to de- 
code. 


4.2.3 HARDWARE CONFIGURATION TIMING 
RELATIONSHIPS 


The 82091AA samples all of the hardware configu- 
ration signals on the high-to-low transition of 
RSTDRV. For the HWB, SWMB, and SWAI modes, 
the 82091AA completes hardware configuration on 
this sampling (Figure 19). For HWE mode, the 
82091AA samples some of the signals twice (Figure 
20). The first sampling occurs on the high-to-low 
transition of RSTDRV. As Figure 22 shows (see Sec- 
tion 4.2.5, Extended Hardware Configuration Mode), 
the HC367 tri-states its outputs when RSTDRV is 
negated. This permits the strapping options from the 
HWEb block to be sampled. A short time after 
RSTDRV is negated (the time is specified in Section 
11.0, Electrical Characteristics), the 82091AA sam- 
ples the SOUTA, RTSA#, DTRA#, SOUTB, 
RTSB#, and DTRB# signals. 
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All Hardware. 
Configuration 
Signals Sampled 


RSTDRV 


Hardware - . , , : 
Configuration Driven by pullup/pulldown X AIP Operational 


Signals ~- 


290486-19 


Figure 19. HWB, SWMB, and SWAI Hardware Configuration Mode Timing 


HWEa, CFGMODT[1,0], 
VSEL, CFGDIS, CLKSEL, HWEb, 
DDCFG[1,0],. CFGADS, SPCFG[3:0], 
PPMODE[1,0], FDDQTM PPCFG[1,0] 


RSTDRV 


Hardware —— = = ee — — 
Configuration Driven by riven by 
“Signals __Configuration Buffer pullup/pulldown | AIP Operational 


Figure 20. HWE Hardware Configuration Mode Timing 


290486~—20 
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4.2.4 HARDWARE BASIC CONFIGURATION 


The Hardware Basic configuration mode permits the 
user to assign addresses to the serial ports and par- 
allel ports. This is achieved by sampling several of 
the serial port connections at the end of a hardware 
reset. The PPDIR/GCS# signal defaults to game 
port chip select output (GCS #). The 82091AA pow- 
er supply voltage is not selectable in this mode and 


Hard Wire to 
Vcc/Vss is Optional 


>o 


82091AA 


is fixed at 5V. The parallel port mode is set to ISA- 
Compatible. In addition, the FDC floppy drive sup- 
port is set at two floppy drives. If configuration regis- 
ter access is enabled, the access address is fixed at 
398h/399h. To reconfigure the 82091AA using soft- 
ware, the 82091AA configuration mode must be 
changed to SWAI mode (refer to AIPCFG1 register). 
Figure 21 shows the implementation of a basic hard- 
ware configuration. 


Hard Wire to 
Vcc/Vss is Optional 


VV 


10 K 


Serial Port A 
Connector 


Serial Port B 
Connector 


IDE 
Signals 


290486--21 


Figure 21. Hardware Basic Configuration 
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4.2.5 HARDWARE EXTENDED 
CONFIGURATION MODE 


The Hardware Extended configuration mode pro- 
vides all of the features of the Hardware Basic con- 
figuration mode. Additional features in Hardware Ex- 
tended configuration permit the user to select quan- 
tity of floppy drives can be selected for either 2 or 4 
floppy drive support. The 82091AA operating volt- 
age is selectable between 3.3V* and 5V. In addition, 
the parallel port can be configured to operate in ISA- 
Compatible, PS/2-Compatible, or EPP modes. Hard- 
ware extended configuration provides these addi- 
tional hardware configuration options by sampling 


intel. 


When RSTDRV is asserted, the HC367 drives the 
values on SOUTA, RTSA#, DTRA#, SOUTB, 
RTSB#, and DTRB# (Figure 22). When RTSDRV is 
negated, the HC367 is disabled and these serial port 
signals are driven by HWEb pullup/down resistors. 
The PPDIR/GCS# signal defaults to a game port 
chip select (GCS#). To reconfigure the 82091AA 
using software, the 82091AA configuration mode 
must be changed to SWAI mode (refer to AIPCFG1 
register). 


NOTE: | 
*3.3V operation is only available in the 
82091AA. 


the pins on the serial ports at two different times. 


a — — 


Zs — — 
Harci Wire to Ae—p—o— So \ 
7 


Vec/Vss is Optional i me By [Ese aed Eo 


7 _— 
Mos ae 
| =e Hard Wire to 
=o Vec/Vss is Optional 
~ 1 
10 K 


: 


A 
eg re ee eer \eei Port A 
DTRA# aan : Connector 
kame 7 
es Ves (PE ee jcc 
DTRB# aaa eee ee ae d Connector 
rte 0, RRs (es se | aie (me (ORs 
ey aie es i) | = 
cS LRA 


Hard Wire to 
Vec/Vss is Optional 


~~ 
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Figure 22. Hardware Extended Configuration 
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4.2.6 SOFTWARE ADD-IN CONFIGURATION 


The Software Add-in configuration mode permits the 
user to assign the address for the 82091AA configu- 
ration registers, and select the power supply voltage 
for the 82091AA. The 82091AA configuration 


Hard Wire to 
Vec/Vss is Optional 


V/ 


82091AA 


registers are accessible. The registers are located in 
the ISA Bus I/O address space and can be selected 
to be at either 398h/399h or 26Eh/26Fh. The 
PPDIR/GCS# signal defaults to a game port chip 
select (GCS #). 


Serial Port A 
Connector 


IDECS1# 
DEN# 
HEN# 


10 K 


Hard Wire to 


Vec/Vss is Optional 


>o 


Connector 


IDE 
Signals 


290486-23 


Figure 23. Software Add-In Configuration 
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4.2.7 SOFTWARE MOTHERBOARD are accessible via the X-Bus |/O address space and 
CONFIGURATION can be selected to be at either 22h/23h or 24h/25h. 
: ; In addition, the user selects the power supply volt- 
The Software. Motherboard configuration mode per- age for the 82091AA. The PPDIR/GCS# signal de- 


mits the 82091 AA to be located on the motherboard. faults to a Parallel Port Direction Control Output 
In this mode, the 82091AA configuration registers (PPDIR). 


Per 


Hard Wire to 38) 


Vec/Vss is Optional 


10K 
SOUTA 


> 
RTSA# Tr oo Port A 
come 
DTRA# > 


SOUTB 
RTSB# 
DTRB# Serial Port B | 


| . 3 Connector 
RSTDRV EL eot IDE 
IDECS1# 
Signal 


DEN# 


> 
PPDIR/ HEN# 


GCS# 


10K 


1 ) Hard Wire to 


Vecc/Vss is Optional 
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Figure 24. Software Motherboard Hardware Configuration 
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5.0 HOST INTERFACE 


The 82091AA host interface is an 8-bit direct-drive | 
(24 mA) ISA Bus/X-Bus interface that permits the . 


CPU to access its registers through read/write oper- 
ations in I/O space. These registers may be ac- 
cessed by programmed I/O and/or DMA bus cycles. 
With the exception of the IDE Interface, all functions 
on the 82091AA require only 8-bit data accesses. 
The 16-bit access required for the IDE Interface is 
supported through the appropriate chip selects and 
data buffer enables from the 82091AA. The 
82091AA does not participate in 16-bit IDE DMA 
transfers. 


Although the 82091AA has an ISA/X-Bus host inter- 
face, there are a few features that differentiate it 
from conventional ISA/X-Bus peripherals. These 
features are as follows: 


e Internal, Configurable Chip Select Decode 
Logic. SA[9:0] allow full decoding of the ISA 1/O 
address space such that the functional modules 
contained in the 82091AA can be relocated to 
the desired !/O address. This feature can be 
used to resolve potential system configuration 
conflicts. 


e IOCHRDY for ISA Cycle Extension. During cer- 
tain I/O cycles to the parallel port controller in the 
82091AA, it is necessary to extend the current 


ISA Bus 


82091AA 


bus cycle to match the access time of the device 
connected to the Parallel Port. The 
IOCHRDY signal is used by the 82091AA to ex- 
tend ISA Bus cycles, as needed, according to the 
ISA protocol. IOCHRDY overrides all other 
strobes that attempt to shorten the bus cycle. 


e NOWS# for 3 BCLK I/O Cycles. All pro- 


grammed |/O accesses to 82091AA registers 
can be completed in a total of 3 BCLK cycles. 
This is possible because the 82091AA register 
access times have been minimized to allow data 
transfers to occur with shortened read/write con- 
trol strobes. As a result, the 82091AA is well suit- 
ed for use in embedded control designs that use 
an asynchronous microprocessor interface with- 
out any particular reference to ISA cycle timings. 


e DMA Transfers: The 82091AA supports DMA 
compatible, type A, type B and type F DMA cy- 
cles. Some newer system DMA controllers are 
capable of generating fast DMA cycles (type F) 
on all DMA channels. If such a controller is used 
in conjunction with the 82091AA, it will. be possi- 
ble to accomplish a DMA transfer in 2 BCLKs. 


The 82091AA ISA data lines (SD[7:0]) can be con- 
nected directly to the ISA Bus. If external buffers are 
used to isolate the SD[7:0] signals from the 240 pF 
loading of the ISA Bus, the DEN# signal can be 
used to control the external buffers as shown in Fig- 
ure 25. 


Optional 


Buffer 
290486-25 


Figure 25. ISA Interface (with Optional Data Buffer) 
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6.0 PARALLEL PORT 


The 82091AA parallel port can be configured for 
four parallel port modes. These parallel port modes 
and the associated parallel interface protocols are: 


| Parallel Port Mode Parallel Interface Protocol 


ISA-Compatible Mode Compatibility, Nibble 
PS/2-Compatible Mode Byte 
EPP Mode EPP 
ECP | ECP 


ISA-Compatible, PS/2-Compatible, and EPP modes 
are selected through 82091AA configuration (see 
Section 4.0, AIP Configuration). ECP is selected by 
programming the ECP Extended Control Register 
(ECR). oo 


In ISA-Compatible mode, the parallel port exactly 
emulates a standard ISA-style parallel port. The par- 
allel port data bus (PD[7:0]) is uni-directional. The 
compatibility protocol transfers data to the peripher- 
al device via PD[7:0] (forward direction). Note that 
the Nibble protocol permits data transfers from the 
peripheral device (reverse direction) by using four 
peripheral status signal lines to transfer 4 bits of 
data at a time. 


PS/2-Compatible mode differs from ISA-Compatible 
mode by providing bi-directional transfers on 
PD[7:0]. A bit is added to the PCON Register to al- 
low software control of the data transfer direction. 


For both the ISA-Compatible ancl PS/2-Compatible 
modes, the actual data transfer over the parallel port 
interface is accomplished by software handshake 
(i.e., automatic hardware handshake is not used). 
Software controls data transfer by monitoring hand- 
shake signal status from the peripheral device via 
the PSTAT Register and controlling handshake sig- 
nals to the peripheral device via the PCON Register. 


EPP mode provides bi-directional transfers on 
PD[7:0]. The 82091AA automatically generates the 
address and data strobes in harclware. 


ECP is a high performance peripheral interface 
mode. This mode uses an asynchronous automatic 
handshake to transfer data over the parallel port in- 
_ terface. In addition, the parallel port contains a FIFO 
for transferring data in ECP mode. The ECP register 
set contains an Extended Control Register (ECR) 
that provides a wide range of functions including the 
ability to operate the parallel port in either ECP, ISA- 
Compatible, or PS/2-Compatible modes. 
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In general, this document describes parallel 
port operations and functions in terms of 
how the 82091AA parallel port hardware op- 
erates. Detailed descriptions of the parallel 
interface protocols are beyond the scope of 
this document. Readers should refer to the 
proposed IEEE Standard 1284 for detailed 
descriptions of the Compatibility, Nibble, 
Byte, EPP, and ECP protocols. | 


Special circuitry on the 82091 prevents it from being 
powered up or being damaged while a parallel port 


_ peripheral is powered on and the 82091 is powered 
off. 


6.1 Parallel Port Registers 


This section is organized into three sub-sections— 
ISA-Compatible and PS/2-Compatible Modes, EPP 
Mode, and ECP Mode. Since the register sets are 
similar for ISA-Compatible and PS/2-Compatible 
modes (differing by a direction control bit in the 
PCON Register) the register set descriptions are 
combined. The EPP mode and ECP mode register 
sets are described separately. Each register set de- 
scription contains the I/O address assignment and a 
complete description of the registers and register 
bits. Note that the PSTAT and PCON Registers are 
common to all modes and for completeness are re- 
peated in each sub-section. Any difference in bit op- 
erations for a particular mode is noted in that partic- 
ular register description. 


The registers provide parallel port control/status in- 
formation and data paths for transferring data be- 
tween the parallel port interface and the 8-bit host 
interface. All registers are accessed as byte quanti- 
ties. The base address is determined by hardware 
configuration at powerup (or a hard reset) or via soft- 
ware configuration by programming the 82091AA 
configuration registers as described in Section 4.0, 
AIP Configuration. The parallel port can be disabled 
or configured for a base address of 378h (all 
modes), 278h (all modes), or 3BCh (all modes ex- 
cept EPP and ECP). This provides the system de- 
signer with the option of using additional parallel 
ports on add-in cards that have fixed address de- 
coding. 
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Some of the parallel port registers described in this 
section contain reserved bits. These bits are labeled 
‘“R”’. Software must deal correctly with fields that are 
reserved. On reads, software must use appropriate 
masks to extract the defined bits and not rely on 
reserved bits being any particular value. On writes, 
software must ensure that the values of reserved bit 
positions are preserved. That is, the value of re- 
served bit positions must first be read, merged with 
the new values for other bit positions, and then writ- 
ten back. 


During a hard reset (RSTDRV asserted), the 
82091AA registers are set to pre-determined de- 
fault states. The default values are indicated in the 
individual register descriptions. 


82091AA 


The following nomenclature is used for register ac- 
cess attributes: 


RO Read Only. Note that for registers with read 
only attributes, writes to the I/O address have 
no affect on parallel port operations. 


R/W Read/Write. A register with this attribute can 
be read and written. Note that individual bits in 
some read/write registers may be read only. 


6.1.1 ISA-COMPATIBLE AND PS/2- 
COMPATIBLE MODES 


This section contains the registers used in ISA-Com- 
patible and PS/2-Compatible modes. The I/O ad- 
dress assignment for this register set is shown in 
Table 15 and the register descriptions are presented 
in the order that they appear in the table. 


Table 15. Parallel Port Register (ISA-Compatible and PS/2-Compatible) 


Parallel Port Register | 
Address Access 
(AEN = 0) Base + 


NOTE: 
Parallel port base addresses are 278h, 378h and 3BCh. 
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6.1.1.1 PDATA—Parallel Port Data Register (ISA-Compatible and PS/2-Compatible Modes) 


|/O Address: Base +00h 
Default Value: 00h 
Attribute: Read/Write 
Size: — 8 bits 


ISA-Compatible Mode 


The PDATA Register is a uni-directional data port that transfers 8-bit data from the host to the peripheral 
device (forward transfer). A write to this register drives the written data onto PD[7:0]. Reads of this register 
should not be performed in ISA-Compatible mode. For a host read of this address location, the 82091AA 
- completes the handshake on the ISA Bus and the value is the last value stored in the PDATA Register. 


PS/2-Compatible Mode 


The PDATA Register is a bi-directional data port that transfers 8-bit data between the peripheral device and 
host. The direction of transfer is determined by the DIR# bit in the PCON Register. If DIR#=0 (forward 
direction), and the host writes to this register, the data is stored in the PDATA Register and driven onto 
PD[7:0]. lf DIR# =1 (reverse direction), a host read of this register returns the data on PD[7:0]. Note that read 
data is not stored in the PDATA Register. . 


Description 


PARALLEL PORT DATA: Bits[7:0] correspond to parallel port data lines PD[7:0] and ISA Bus data 
lines SD[7:0]. 


6.1.1.2 PSTAT—Status Register (ISA-Compatible and PS/2-Compatible Modes) 


\/O Address: Base +0Oi1h 
Default:Value: XXXX X1RR 
Attribute: Read Only 
Size: 8 bits 


The PSTAT Register provides the status of certain parallel port signals and whether a CPU interrupt has been 
generated by the parallel port. This register indicates the current state of the BUSY, ACK#, PERROR, 
SELECT, and FAULT # signals. 
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‘Reserved | | 
Not Used (ISA-Compatible Mode) (RO) 
Always 1 4 
IRQ Signal Status (PS/2-Compatible) (RO) 
1=Asserted 
0=Negated 
Fault# Signal Status (RO) 


1=Negated 
0=Asserted 


SELECT Signal Status (RO) 
1=Asserted . 
0=Negated 


PERROR Signal Status (RO) 
1=Asserted | | 
0=Negated 


ACK# Signal Status (RO) 
1=Negated 
=Asserted 


BUSY Signal Status (RO) 
0=Asserted 
1=Negated 
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NOTE: 
X = Default value is determined by signal state at reset. 


Figure 26. Status Register (ISA-Compatible and PS/2-Compatible Modes) 
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. Description or 

BUSY STATUS (BUSYS): This bit indicates the state of the parallel port interface BUSY signal. 


When BUSY is asserted, BUSYS = 0. When BUSY is negated, BUSYS = 1.This bit is an inverted 
version of the parallel port BUSY signal. 


ACK# STATUS (ACKS): This bit indicates the state of the parallel port interface ACK # signal. This 
bit indicates when the peripheral has received a data byte and is ready for another. When ACK # is 

_ asserted, ACKS = 0. When ACK # is negated, ACKS = 1. Note that if interrupts are enabled (via bit 4 
of the PCON Register), the assertion of the ACK # signal generates an interrupt to the CPU. 


PERROR STATUS (PERRS): This bit indicates the state of the parallel port interface PERROR 
signal. This bit indicates when an error has occurred in the peripheral paper path (e.g., out of paper). 
When PERROR is asserted, PERRS= 1, When PERROR is negated, PERRS=0. 


SELECT STATUS (SELS): This bit indicates the state of the parallel port interface SELECT signal. 
When the SELECT signal is asserted, SELS= 1, When the SELECT signal is negated, SELS=0. 


FAULT # STATUS (FAULTS): This bit indicates the state of the parallel port interface FAULT # 
signal being driven by the peripheral device. When the FAULT # signal is asserted, FAULTS=0. 
When the FAULT # signal is negated, FAULTS = 1. 


PARALLEL PORT INTERRUPT STATUS (PIRQ): This bit indicates a CPU interrupt by the parallel 
port. PIRQ indicates that the printer has accepted the previous character and is ready for another. 
In ISA-Compatible mode, interrupt status is not reported in this register and this bit is always 1. 


In PS/2-Compatibile mocle, if interrupts are enabled via the PCON Register and the ACK # signal is 
asserted (low-to-high transition), PIRQ is set to a 0 (and an IRQ generated to the CPU). The 
82091AA sets PIRQ to 1 when this register is read or by a hard reset. If interrupts are disabled via 
the PCON Register, this bit is never set to 0. 


1:0 | RESERVED 
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6.1.1.3 PCON—Control Register (ISA-Compatible And PS/2-Compatible Mode) 


1/O Address: Base + 02h 
Default Value: RROO 0000 

Attribute: Read/Write 

Size: 8 bits 


The PCON Register controls certain parallel port interface signals and enables/disables parallel port inter- 
rupts. This register permits software to control the STROBE #, AUTOFD#, INIT#, and SELECTIN# signals. 
For PS/2-Compatible mode, this register also controls the direction of transfer on PD[7:0]. 


7 6 5 4 3 2 1 0~.-—sC*=BBit 
Ce Tele Te [Te [J 


STROBE# Signal (R/W) 
1=Assert 
0=Negate 
AUTOFD# Signal (R/W) 
1=Assert 
0=Negate 


INIT# Signal (R/W) 


1=Negate 
0=Assert 
SELECTIN# Signal (R/W) 
1=Assert 
0=Negate 
PP Interrupt Enable (ACK#) (R/W) 
1=Enable 
0=Disable 


Reserved (ISA-Compatible Mode) (R/W) 


Direction (PS/2-Compatible Mode) (R/W) 
1=Reverse 
0=Forward 


Reserved 
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Figure 27. Control Register (ISA-Compatible and PS/2-Compatible Modes) 
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Description 
7:6 RESERVED 


RESERVED (ISA-COMPATIBLE MODE): Not used and undefined when read. Writes have no affect 
on parallel port operations. 


DIRECTION (DIR #) (PS/:2-COMPATIBLE MODE): This bit is used to control the direction of data 
transfer on the parallel port data bus (PDI? 0]). When DIR # =O, PD[7:0] are outputs. When 
DIR # =1, PD[7:0] are inputs. 


ACK# INTERRUPT ENABLE (ACKINTEN): ACKINTEN enables CPU interrupts (via either IRQ5 or 
IRQ7) to be generated when the ACK # signal on the parallel port interface is asserted. When 
ACKINTEN = 1, a CPU interrupt is generated when ACK # is asserted. When ACKINTEN =0, the 
ACK # interrupt is disabled. 


SELECTIN # CONTROL (SELINC): This bit controls the SELECTIN# signal. SELINC is set to 1 to 
select the printer. When SELINC = 1, the SELECTIN # signal is asserted, Wnen SELINC=0, the 
SELECTIN # signal is negated. 


INIT# CONTROL (INITC): This bit controls the INIT # signal. When INITC = 1, the INIT # signal is 
negated. When INITC=0, the INIT # signal is asserted. 


AUTOFD # CONTROL (AUTOFDC): This bit controls the AUTOFD# signal. AUTOFDC is set to 1 to 
instruct the printer to advance the paper one line each time a carriage return is received. When 
AUTOFDC = 1, the AUTOFD# signal is asserted. Wnen AUTOFDC=0, the AUTOFD # signal is 
negated. 


STROBE # CONTROL (STROBEC): This bit controls the STROBE # signal. The STROBE # signal 
is set active to instruct the printer to accept the character being presented on the data lines. When 
STROBEC = 1, the STROBE # signal is asserted. When STROBEC=0, the STROBE # signal is 
negated. 
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6.1.2 EPP MODE 


This section contains the registers used in EPP mode. The I/O address assigment for this register set is shown 
in Table 16 and the register descriptions are presented in the order that they appear in the table. 


Table 16. Parallel Port Registers (EPP Mode) 


Register Name 


th stat Sts Register Sid 


Parallel Port Register 
Address Access 
(AEN = 0) Base + 


Abbreviation 


Ph ADDSTR Address Strobe Register 


3h 
4h-7h DATASTR Data Strobe Registers 
NOTE: - 


Parallel port base addresses are 278h (LPT2) and 378h (LPT1). Base address 3BCh is not available in EPP or ECP modes. 


6.1.2.1 PDATA—Parallel Port Data Register (EPP Mode) 


1/O Address: Base +00h 
Default Value: OOh 
Attribute: Read/Write 
Size: 8 bits 


The PDATA Register is a bi-directional data port that transfers 8-bit data between the peripheral device and 
host. The direction of transfer is determined by the DIR# bit in the PCON Register. lf DIR#=0 (forward 
direction) and the host writes to this register, the data is stored in the PDATA Register and driven onto 
PD[7:0]. lf DIR# =1 (reverse direction), a host read of this register returns the data on PD[7:0]. However, read 
data is not stored in the PDATA Register. 


7:0 | PARALLEL PORT DATA: Bits[7:0] correspond to parallel port data lines PD[7:0] and ISA Bus data 
lines. 
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6.1.2.2 PSTAT—Status Register (EPP Mode) 


1/O Address: Base +0ith 


Default Value: . XXXX X1RR - 
Attribute: Read Only 
Size: 8 bits 


The PSTAT Register provides the status of certain parallel port signals. It also indicates whether a CPU 
interrupt has been generated by the parallel port. This register indicates the current state of the BUSY, ACK#, 
PERROR, SELECT, and FAULT # signals. 


IRQ Signal Status (RO) 
Not Used in EPP Mode | 
Always 1 


Fault# Signal Status (RO) 


1=Negated 
0=Asserted 


SELECT Signal Status (RO) 


1=Asserted 
| 0=Negated 
PERROR Signal Status (RO) 
1=Asserted 
0=Negated 
ACK# Signal Status (RO) 
1=Negated 
0=Asserted 
BUSY Signal Status (RO) 
0=Asserted 


1=Negated 
290486-28 


NOTE: 
X= Default value is determined by signal state at reset. 


Figure 28. Status Register (EPP Mode) 
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BUSY STATUS (BUSYS): This bit indicates the state of the parallel port interface BUSY signal. 
When BUSY is asserted, BUSYS = 0. When BUSY is negated, BUSYS = 1. This bit is an inverted 
version of the parallel port BUSY signal. | 


ACK # STATUS (ACKS): This bit indicates the state of the parallel port interface ACK # signal. This 
bit indicates when the peripheral has received a data byte and is ready for another. When ACK # is 

asserted, ACKS = 0. When ACK # is negated, ACKS = 1. Note that if interrupts are enabled (via bit 4 
of the PCON Register), the assertion of the ACK # signal generates an interrupt to the CPU. 


PERROR STATUS (PERRS): This bit indicates the state of the parallel port interface PERROR 
signal. This bit indicates when an error has occurred in the peripheral paper path (e.g., out of paper). 
When PERROR is asserted, PERRS= 1. When PERROR is negated, PERRS=0. 


SELECT STATUS (SELS): This bit indicates the state of the parallel port interface SELECT signal. 
When the SELECT signal is asserted, SELS = 1. When the SELECT signal is negated, SELS = 0. 


FAULT # STATUS (FAULTS): This bit indicates the state of the parallel port interface FAULT # 
signal being driven by the peripheral device. When the FAULT # signal is asserted, FAULTS =0. 


7 

3 
When the FAULT # signal is negated, FAULTS= 1. 

2 PARALLEL PORT INTERRUPT (PIRQ): In EPP mode interrupt status is not reported in this register 
and this bit is always 1. | 


1:0 | RESERVED 
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6.1.2.3 PCON—Control Register (EPP Mode) 


1/O Address: Base + 02h 


Default Value: RROO 0000 
Attribute: Read/Write 
‘Size: 8 bits 


The PCON Register controls certain parallel port interface signals, enables/disables parallel port interrupts, 
and selects the direction of data transfer on PD[7:0]. This register permits software to control the INIT# 
signal. Note that in the EPP parallel interface protocol, the STROBE#, AUTOFD#, and SELECTIN# signals 
are automatically generated by the parallel port and are not controlled by software. 


0 «Bit 


Default 


STROBE# Signal (R/W) 
Write to 0 


AUTOFD# Signal (R/W) 
Write to 0 


INIT# Signal (R/W) 


1=Negate 
0=Assert 


SELECTIN# Signal (R/W) 
Write to 0 


ACK# Interrupt Enable (R/W) 
1=Enable 
0=Disable 


—— Direction (R/W) 
1=Reverse 
0=Forward 


Reserved 
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‘Figure 29. Control Register (EPP Mode) 
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DIRECTION (DIR #): This bit is used to control the direction of data transfer on the parallel port data 
bus (PD[7:0]). When DIR # =0 (forward direction), PD[7:0] are outputs. When DIR # = 1 (reverse 
direction), PD[7:0] are inputs. 


ACK# INTERRUPT ENABLE (ACKINTEN): ACKINTEN enables CPU interrupts (via IRQS or IRQ7) 


to be generated when the ACK # signal on the parallel port interface is asserted. When 
ACKINTEN = 1, a CPU interrupt is generated when ACK # is asserted. When ACKINTEN =0, the 
ACK # interrupt is disabled. | 


3 SELECTIN# CONTROL (SELINC): Write to 0 when programming this register. This bit must be 0 for 
the parallel port handshake to operate properly. , 
INIT # CONTROL (INITC): This bit controls the INIT # signal. When INITC = 1, the INIT # signal is 


negated. When INITC =0, the INIT # signal is asserted. 


STROBE # CONTROL (STROBEC): Write to 0 when programming this register. This bit must be 0 
for the parallel port handshake to operate properly. 


6.1.2.4 ADDSTR—EPP Auto Address Strobe Register (EPP Mode) 


1/O Address: Base +03h 


Default Value: OOh 
Attribute: Read/Write | 4 
Size: 8 bits 


The ADDSTR Register provides a peripheral address to the peripheral (via PD[7:0]) during a host address 
write operation and to the host (via PD[7:0]) during a host address read operation. An automatic address 
strobe is generated on the parallel port interface when data is read from or written to this register. 


EPP ADDRESS: Bits[7:0] correspond to SD[7:0] and PD[7:0]. 
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6.1.2.5 DATASTR—Auto Data Strobe Register (EPP Mode) 


|/O Address: Base + 04h, 05h, O6h, 07h 
Default Value: = OOh 

Attribute: Read/Write 

Size: 8 bits 


The DATASTR Register provides data from the host to the peripheral device (via PD[7:0]) during host write 
_ operations and from the peripheral device to the host (via PD[7:0]) during a host read operation. An automatic 

data strobe is generated on the parallel port interface when data is read from or written to this register. To 
maintain compatibility with Intel’s 82360SL |/O device that has a 32-bit Host Bus interface, four consecutive 
byte address locations are provided for transferring data. 


| } Bit | Description | | 
| EPP DATA: Bits[7:0] correspond to SD[7:0] and PDI7:0]. 


6.1.3 ECP MODE 


This section contains the registers used in ECP mode. The I/O address assignment for this register set is 
shown in Table 17 and the register descriptions are presented in the order that they appear in the table. The 
Extended Control Register (ECR) permits various modes of operation. Note that ECR[7:5] =000 selects ISA- 
Compatible mode and ECR[7:5] =001 selects PS/2-Compatibile mode. These modes are discussed in Sec- 
tion 6.1.1, ISA-Compatible and PS/2 Compatible modes. The other modes selected by ECR[7:5] are dis- 


cussed in this section. 


Table 17. Parallel Port Registers (ECP Mode) 


Parallel Port 


Register Address ; 
Access (AEN = 0) Abbreviation Register Name Read/Write 
Base + ECR(7:5] 
Attribute 


TstausResister Sid 

TcontoiRegiter Sit 

standard Paral Por Gata FIFO | 010 

aw 
a a ee 

NOTES: 


1. Parallel port base addresses are 278h, 378h, and 3BCh. ; 
' 2. A register is accessible when the ECR[7:5] field contains the value specified in the ECR[7:5] column. The register is not 
accessible if the ECR[7:5] field does not match the value specified in this column. The term “All” means that the register 
is accessible in all modes selected by ECR[7:5]. 


ECPAF IFO 


SDFIFO 
ECPDFIFO 


ECPCFGA 
ECPCFGB 
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6.1.3.1 ECPAFIFO—ECP Address/RLE FIFO Register (ECP Mode) 


|/O Address: Base +00h 

Default Value: UUUU UUUU (Undefined) 
Attribute: Read/Write 

Size: 8 bits 


The ECPAFIFO Register provides a channel address or a Run Length Count (RLE) to the peripheral, depend- 
ing on the state of bit 7. This |/O address location is only used in ECP mode (ECR bits[7:5] =011). In this 
mode, bytes written to this register are placed in the parallel port FIFO and transmitted over PD[7:0] using 
ECP protocol. 


16 Byte FIFO 


Default 


ECP Address/RLE Value (R/W) 

Bit 7=1: Bits[6:0] Represents a Channel 
Address to the Peripheral 

Bit 7=0: Bits[6:0] Represents a Run Length 
Count to the Peripheral 


290486-30 


NOTE: 
U = Undefined 


Figure 30. ECP Address/RLE FIFO Register (ECP Mode) 


pity eseription 


ECP ADDRESS/RLE VALUE: Bits[7:0] correspond to parallel port data lines PD[7:0] and ISA Bus 
data lines SD[7:0]. The peripheral device should interpret bits[6:0] as a channel address when 
bit 7= 1 and as arun length count when bit 7=0. Note that this interpretation is performed by the 
peripheral device and the value of bit 7 has no affect on 82091AA operations. Note that the 
82091AA asserts AUTOFD # to indicate that the information on PD[7:0] represents an ECP 
address/RLE count. The 82091AA negates AUTOFD # (drives high) when PD[7:0] is transferring 
data. 
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6.1.3.2 PSTAT—Status Register (ECP Mode) 


|/O Address: = Base + Oth 
Default Value: © XXXX X1RR 
Attribute: Read Only 
Size: 8 bits 


The PSTAT Register provides the status of certain parallel port signals and whether a CPU interrupt has been 
generated by the parallel port. This register indicates the current state of the BUSY, ACK#, PERROR, 
SELECT, and FAULT # signals. | | 


Default 


- IRQ Signal Status (RO) | 
Not Used on ECP Mode 
Always 1 


Fault# Signal Status (RO) 
1=Negated 
0=Asserted 
SELECT Signal Status (RO) 
1=Asserted 
0=Negated 
PERROR Signal Status (RO) 
1=Asserted 
0=Negated 
— ACK# Signal Status (RO) 
1=Negated 
0=Asserted 
BUSY Signal Status (RO 


(d=Asserted 
1=Negated 
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NOTE: . 
X= Default value is determined by the state of the corresponding signal pin at reset. 


Figure 31. Status Register (ECP Mode) 
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BUSY STATUS (BUSYS): This bit indicates the state of the parallel port interface BUSY signal. 
When BUSY is asserted, BUSYS = 0. When BUSY is negated, BUSYS = 1. This is an inverted 
version of the parallel port BUSY signal. Refer to Section 6.2.3 ECP Mode for more detail. 


ACK# STATUS (ACKS): This bit indicates the state of the parallel port interface ACK # signal. This 
bit indicates when the peripheral has received a data byte and is ready for another. When ACK # is 

asserted, ACKS = 0. When ACK # is negated, ACKS = 1. Note that if interrupts are enabled (via bit 4 
of the PCON Register), the assertion of the ACK# signal generates an interrupt to the CPU. Refer to 
Section 6.2.3 ECP Mode for more detail. 


PERROR STATUS (PERRS): This bit indicates the state of the parallel port interface PERROR 
signal. This bit indicates when an error has occurred in the peripheral paper path (e.g., out of paper). 
When PERROR is asserted, PERRS = 1, When PERROR is negated, PERRS = 0. 


SELECT STATUS (SELS): This bit is used in all parallel port modes and indicates the state of the 
parallel port interface SELECT signal. When the SELECT signal is asserted, SELS = 1. When me 
SELECT signal is negated, SELS=0. | 


FAULT # STATUS (FAULTS): This bit is used in all parallel port modes and indicates the state of 
the parallel port interface FAULT # signal being driven by the peripheral device. When the FAULT # 
signal is asserted, FAULTS =0. When the FAULT # signal is negated, FAULTS= 1. 


PARALLEL PORT INTERRUPT (PIRQ): In ECP mode, interrupt status is not reported in this register 


and this bit is always 1. 


RESERVED 
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6.1.3.3 PCON—Control Register (ECP Mode) 


|/O Address: Base + O2h 
Default Value: “~RROO 0000 
Attribute: Read/Write 
Size: 8 bits 


The PCON Register controls certain parallel port interface signals, enables/disables parallel port interrupts, 
and selects the direction of data transfer on PD[7:0]. Note that the function of some bits depends on the 
_ programming of the ECR. 


7 6 5 4&4 3 2 14 0 Bit 
eenooooor™ 


STROBE# Signal (R/W) 


AUTOFD# Signal (R/W) 


INIT# Signal (R/W) 
1=Negate 
0=Assert 
SELECTIN# Signal (R/W) 
1=Assert 
0=Negate 
— ACK# Interrupt Enable (R/W) 
1=Enable 
0=Disable 


ISA-Compatible Mode (ECR[7:5]=000, 010) (RO) 
Not Used (PD[7:0] are outputs) 
Pss/2-Compatible and ECP Modes (ECR[7:5] = 001, 011) (R/W) 


1=Reverse Direction (PD[7:0] are inputs) 
0=Forward Direction (PD[7:0] are outputs) 


Reserved 
290486-32 


Figure 32. Control Register (ECP Mode) 
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a 
RESERVED 


DIRECTION (DIR #): This bit is used to control the direction of data transfer on the parallel nod data 
bus (PD[7:0]). When DIR # = 0 (forward direction), PD[7:0] are outputs. When DIR # = 1 ven 
direction), PD[7:0] are inputs. 


INTERRUPT ENABLE (ACK #) (IRQEN): IRQEN enables interrupts to the CPU to be generated 
when the ACK # signal on the parallel port interface is asserted and is used in all parallel port 
interface modes. When IRQEN= 1, a CPU interrupt is generated when ACK # is asserted. When 
IRQEN =0, parallel port interrupts are disabled. ) 


SELECTIN# CONTROL (SELINC): This bit controls the SELECTIN# signal. SELINC is set to 1 to 
select the printer. When SELINC = 1, the SELECTIN # signal is asserted, When SELINC= 0, the 
SELECTIN # signal is negated. 


INIT # CONTROL (INITC): This bit controls the INIT # signal When INITC = 1, the INIT # signal is 
negated. When INITC =0, the INIT # signal is asserted. 


AUTOFD # CONTROL (AUTOFDC): In ECP mode or ISA-Compatible FIFO mode (ECRI7: = 
010), this bit has no effect. Refer to Section 6.2.3 ECP Mode for more details. 


STROBE # CONTROL (STROBEC): In ECP mode or ISA-Compatible FIFO mode (ECR[7:5] =011, 
010), this bit has no effect. Refer to Section 6.2.3 ECP Mode for more details. 


=011, 
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6.1.3.4 SDFIFO—Standard Parallel Port Data FIFO 


|/O Address: Base + 400h and (ECR[7:5] =010) 
Default Value: UUUU UUUU (undefined) | 
Attribute: Read/Write 

Size: 8 bits 


SDFIFO is used to transfer data from the host to the peripheral when the ECR Register is set for ISA-Compati- 
ble FIFO mode (bits[7:5] = 010). Data bytes written or DMAed from the system to this FIFO are transmitted by 
a hardware handshake to the peripheral using the standard ISA-Compatible protocol. Note that bit 5 in the 
PCON Register must be set to 0 for a forward transfer direction. 


16 Byte FIFO 
Byte 15 


Bit 
Default 


ECP Standard Parallel Port FIFO Mode Data (R/W) 
(ECR[7:5]=010) 
290486-33 


NOTE: 
U = Undefined 


Figure 33. ECP iSA-Compatible Data FIFO 


} Bit | | Description 
| ECP STANDARD PARALLEL PORT DATA: Bits|7:0] correspond to SD[7:0] and PDI7:0]. 
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6.1.3.5 DFIFO—Data FIFO (ECP Mode) 


|/O Address: Base +400h and (ECR[7:5] = 011) 
Default Value: UUUU UUUU (undefined) 
Attribute: Read/Write 

Size: 8 bits 


This I/O address location transfers data between the host and peripheral device when the parallel port is in 
ECP mode (ECR Bits[7:5] =011). Transfers use the parallel port FIFO. Data is transferred on PD[7:0] via 
hardware handshakes on the parallel port interface using ECP parallel port interface handshake protocol. 


16 Byte FIFO 


VA \ 
7 0, Bit 


ECP Mode Data (R/W) 
(ECR[7:5]=011) 
290486-34 


NOTE: 
U = Undefined 


Figure 34. ECP Data FIFO (ECP Mode) 
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ECP MODE DATA: Data bytes written or DMAed from the system to this FIFO in the forward 
direction (PCON bit 5= 0) are transmitted to the peripheral by an ECP mode protocol hardware 
handshake. In the reverse direction (PCON bit 5 = 1) data bytes from the peripheral are transferred 
to the FIFO using the ECP mode protocol hardware handshake. Reads or DMAs from the FIFO 
return bytes of ECP data to the system. Bits[7:0] correspond to SD[7:0] and PD[7:0]. 
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6.1.3.6 TFIFO—ECP Test FIFO Ftegister (ECP Mode) 


1/O Address: Base + 400 arid (ECR[7:5] = 110) 
Default Value: UUUU UUUU (undefined) 
Attribute: Read/Write 

Size: — 8 bits 


The TFIFO Register provides a test mechanism for the ECP mode FIFO. Test mode is enabled via the ECR 
Register. In test mode (ECR[7:5] =: 110), data can be read, written or DMAed to/from the FIFO by accessing 
this register I/O address. 


Data bytes may be read, written or DMAed to or from the system to this FIFO in any direction. The parallel port 
interface signals are not affected by TFIFO accesses and TFIFO data is not transmitted to PD[7:0]. The test 
FIFO does not stall when overwritten or underrun. Data is simply re-written or over-run. The full and the empty 
bits in the ECR always keep track of the correct FIFO state. 


The test FIFO transfers data at the maximum ISA rate so that software can generate performance metrics. 
The FIFO write threshold can be determined by starting with a full TFIFO and emptying it a byte at a time until 
a service interrupt is set to 1 in the ECR. The FIFO read threshold can be determined by setting the direction 
bit in the PCON Register to 1, and filling the FIFO a byte at a time until the service interrupt is set to 1 in the 
ECR. | 


16 Byte FIFO 


Bit 
Default 


ECP Test Mode Data (R/W) 


(ECR[7:5]=110) 
290486-35 


NOTE: 
U = Undefined 


Figure 35. ECP Test FIFO Register (ECP Mode) 


} Bit eo Description 
ECP TEST FIFO Data: Bits[7:0] correspond to SD[7:0]. 
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6.1.3.7 ECPCFGA—ECP Configuration A Register (ECP Mode) 


1/O Address: Base +400h and (ECR[7:5] = 111) 
Default Value: 1001 RRRR 
Attribute: Read/Write 


Size: 8 bits 


The ECPCFGA Register provides information about the ECP mode implementation. Access to this register is 
enabled by programming the ECR Register (ECR[7:5] = 111). 


7 4 3 0 =C+Bit 


Reserved 
Implementation Identification (RO) 
(8-bit Implementation) 
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Figure 36. ECP Configuration A Register (ECP Mode) 


a 


7:4 | IMPLEMENTATION IDENTIFICATION (IMPID): This field is hardwired to 1001 to indicate an 8-bit 
implementation (bit 4) and an ISA-style interrupt (bit 7). This field is read only and writes have no 
affect. 


RESERVED 
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6.1.3.8 ECPCFGB—ECP Configuration B Register (ECP Mode) 


1/O Address: Base + 401th and (ECR[7:5] = 111) 
Default Value: 00h 

Attribute: Read/Write 

Size: 8 bits 


The ECPCFGB Register is part of the ECP specification and is implemented in the 82091AA as a scratchpad 
register. Software can use the fields in this register to maintain system information. Programming these bits 
does not affect parallel port operations. Access to the ECPCFGB. newer is enabled by programming the 
ECR Register (ECR[7:5] = 11 Le 


Default 


Reserved 


— IRQ Signal Status (R/W) 
See Bit Description 


- Reserved 
290486-37 


Figure 37. ECP Extended Control Register (ECP Mode) 


Description 
RESERVED: This bit always reads back as 0. 


INTRVALUE (INTRYV): This bit returns the value on the ISA IRQ line (IRQ5/IRQ7) to determine 
possible conflicts. The value of either IRQ5 or IRQ7 is read back based on the parallel port interrupt 
selection in the 82091AA configuration registers. IRQ5/IRQ7 are tri-stated in ECP configuration 
mode (ECR[7:5] = 111] to allow the state of the selected parallel port interrupt line to be read back. 
Note that the ACKINTEN bit in the PCON register must be written to 0 before the interrupt status can 
be read on this bit. 


RESERVED: These bits always read back as 0. 
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6.1.3.9 ECR ECP—Extended Control Register (ECP Mode) 


1/O Address: Base +402h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


This register selects the ECP mode, enables service and error interrupts and provides interrupt status. The 
ECR also enables/disables DMA operations and provides FIFO empty and FIFO full status. The FIFO empty 
and FIFO full status bits are also used to report FIFO overrun and underrun conditions. 


Default 


FIFO Empty Status (RO) 
1=Empty 
O=Not Empty 

FIFO Full Status (RO) | 


1=Full 
0=Not Full 


Service Interrupt Enable/Status (R/W) 
See Table 


DMA Enable (R/W) 


1=Enable 
0=Disable 


ERROR Interrupt Enable/Status (R/W 
1=ERROR interrupts Disabled 
0=ERROR Interrupts Enabled 


ECP Modes (R/W) 
000=ISA-Compatible 
001=PS/2-Compatible 
010=ISA-Compatible FIFO 
011=ECP 

100=Reserved 
101=Reserved 

110=Test 


111=Configuration 
290486-38 


Figure 38. ECP Extended Control Register (ECP Mode) 
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Description : 
ECP MODE SELECT: This field selects one of the following ECP Modes: 


Mode Operation | 

000 ISA-Compatible Mode. In this mode the parallel port operates in ISA-Compatible mode. 
The FIFO is reset and common collector drivers are used on the control lines (STROBE #, 
AUTOFD#, INIT # and SELECTIN#). Setting the direction bit to 1 in the PCON Register 
does not affect the parallel port interface. For register descriptions in this mode, See 
Section 6.1.1, ISA-Compatible and PS/2-Compatible Modes. 


001 PS/2-Compatibie Mode. In this mode the parallel port operates in PS/2-Compatible 
mode. The FIFO is reset and common collector drivers are used on the control lines 
(STROBE #, AUTOFD#, INIT # and SELECTIN #). Unlike mode 000 above, setting the 
direction bit to 1 in the PCON Register tri-states the data lines and reading the data 
register returns the value on the PD[7:0]. For register descriptions in this mode, see 
Section 6.1.1, ISA-Compatible and PS/2-Compatible Modes. 


010 ISA-Compatible FIFO Mode. This mode is the same as mode 000 above, except that 
data is written or DMAed to the FIFO. FIFO data is automatically transmitted using the ISA- 
style protocol. For this mode, the direction control bit in the PCON register must be 0. 


011 ECP Mode. In the forward direction, bytes written to the ECP DFIFO location and bytes 
written to the ECP AFIFO location are placed in the ECP FIFO and transmitted 
automatically to the peripheral using ECP protocol. In reverse direction bytes are 

- transferred from PD[7:0] to the ECP FIFO. 


- 100 Reserved 
101 Reserved 


110 Test Mode. In this mode, the FIFO may be written and read, but the data will not be 
transmitted on PD[7:0]. 7 


111 Configuration Mode. In this mode, the ECP Configuration A and B Registers are 
accessible. 


ECP Mode Switching Guidelines 


Software will execute P1234 negotiations and all operation prior to a data transfer phase under 
programmed |/O (using mode 000 or 001). Hardware provides an automatic control line handshake, 
moving data between the FIFO and the ECP port only in the data transfer phase (using modes 011 
or 010). | 


Setting the mode to 011 or 010 causes the hardware to initiate the data transfer. 


If the parallel port is in mode 000 or 001, the port can be switched to any other mode. If the parailel . 
port is not in mode 000 or 001, the port can only be switched into mode 000 or 001. The direction 
and the FIFO threshold can only be changed in modes 000 or 001. Note that the FIFO, FIFO Error, 
and TC conditions are also reset when the mode is switched to 000 or 001. 


Once in an extended forward mode, the software should wait for the FIFO to be empty before 
switching back to mode 000 or 001. In this case, all control signals are negated before the mode 
switch. In an ECP reverse mode the software waits for ail the data to be read from the FIFO before 
changing to mode 000 or 001. 
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ERROR INTERRUPT DISABLE (ERRINTREN): This bit enables error interrupts to the host. In ECP 
Mode, When ERRINTREN = 1, interrupts are disabled. When ERRINTREN = 0, interrupts are 
enabled. When enabled and a high-to-low transition occurs on the FAULT # signal (FAULT # 
asserted), an interrupt is generated to the host. Note that if this bit is written from a 1 to a 0 while 
FAULT # is asserted, an interrupt is generated to the host. 


DMA ENABLE (DMAEN): This bit enables/disables DMA. ‘When DMAEN = 1, DMA is enabled and 

the host uses PPDREQ, PPDACK, and TC to transfer data. When DMAEN = 0, DMA is disabled and 
the PPDREQ output is tri-stated. In this case, programmed I/O is used to transfer data between the 
host and the 82091AA FIFO. The Service Interrupt (bit 2) needs to be set to 0 to allow generation of 
a TC interrupt. This bit must be written to 0 to reset the TC interrupt. 


SERVICE INTERRUPT (SERVICEINTR): This bit enables FIFO and TC service interrupts. When the 
CPU writes SERVICEINTR = 1, FIFO request interrupts, FIFO error interrupts, and TC interrupts are 
disabled. Setting this bit to a 0 enables interrupts for one of the four cases listed below. When 
enabled (set to 0) and one of the four conditions below occurs, the 82091AA sets SERVICEINTR to 
a 1 and generates an interrupt to the host. 


1. During DMA operations (DMAEN = 1), when terminal count is reached (TC asserted). To clear the 
TC interrupt, switch to |SA-Compatible or PS/2-Compatible mode (write ECR[7:5] to 000, 001) or 
set DMAEN to 0. 


2. In the forward direction and DMAEN =0, when there is a threshold number of bytes in the FIFO to 
be written. 

3. In the reverse direction and DMAEN = 0, when there is a threshold number of bytes in the FIFO to 
be read. 

4. In either DMA or programmed I/O mode when there is a FIFO overrun or underrun. 

Reading the SERVICEINTR bit indicates the presence of an active interrupt when this bit has been 


written to a O prior to reading it back. To disable interrupts, the SERVICEINTR bit must be explicitly 
written to a 1. 


NOTE: 


The ACK # and FAULT # interrupts can be generated independent of the value of the 
SERVICEINTR bit. ACK # and FAULT # interrupts are enabled via the ACKINTREN bit in the PCON 
Register and the ERRINTREN bit in the ECR Registers, respectively. The parallel port IRQ output 
(IRQ5/IRQ7) is enabled when ACKINTREN = 1 in the PCON Register or when ECR[7:5] = 010, 011, 
or 110. Otherwise, the IRQ output is tri-stated. 


FIFO FULL STATUS (FIFOFS): This bit indicates when the FIFO is full. When FIFOFS = 1 (and 
FIFOES = 0), the FIFO is full and cannot accept another byte of data. When FIFOFS = 0, at least one 
byte location is free in the FIFO. This bit is read only and writes have no affect. When a FIFO overrun 
or underrun occurs, the 82091AA sets both FIFOES and FIFOFS to 1. To clear the FIFO error 
condition interrupt, swiitch the parallel port mode from ECP (011) to either ISA-Compatible or PS/2- 
Compatible modes (000 or 001). 


FIFO EMPTY STATUS (FIFOES): This bit indicates when the FIFO is empty. When FIFOES = 1 (and 
FIFOFS = 0), the FIFO is empty. When FIFOES = 0, the FIFO contains at least one byte. This bit is 
read only and writes have no affect. When a FIFO overrun or underrun occurs, the 82091AA sets 
both FIFOES and FIFOFS to 1. To clear the FIFO error condition interrupt, swiitch the parallel port 
mode from ECP (011) to either ISA-Compatible or PS/2-Compatible modes (000 or 001). 
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6.2 Parallel Port Operations © | 


The parallel port can be placed in ISA-Compatible, 
PS/2-Compatible, or EPP mode by hardware config- 
uration or by writing to the 82091AA’s configuration 
registers (PCFG1 Register). If access to the configu- 
ration registers is not disabled by hardware configu- 
ration, a hardware configured parallel port mode can 
be changed by programming the PCFG1 Register. . 


_ ECP mode is entered by programming the ECP Ex- 
tended Control Register (ECR). Writing to this regis- 
ter changes any previously selected parallel port 
mode (via hardware configuration or writing the 
PCFG1 Register) to one of the ECP ECR Register 
modes selected via ECR[7:5]. Note that ECP mode 
cannot be entered by hardware configuration or pro- 
gramming the 82091AA configuration registers. 


_ 
intel. 
6.2.1 ISA-COMPATIBLE AND PS/2- 
COMPATIBLE MODES 


_ The ISA-Compatible mode is used for standard ISA- 


type parallel port interfaces. Figure 39 shows the 
parallel port interface for ISA-Compatible mode. 
STROBE#, AUTOFD#, INIT#, and SELECTIN# 
are controlled by software via the PCON Register 
and the status of SELECT, PERROR, FAULT, 
ACK#, and BUSY are reported in the PSTAT Regis- 
ter. PD[7:0] are outputs only. Note that for a reverse 
data transfer using the Nibble protocol, the peripher- 
al device supplies data, 4 bits at a time, using the 
BUSY, SELECT, PERROR, and FAULT # signals. 


STROBE# 


AUTOFD# 


Parallel 


Port 
FAULT# 


SELECTIN# 


AIP SELECT 


Paraliel 


PERROR Port 


Connector 


BUSY 
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Figure 39. ISA-Compatible Mode 
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The following general protocol is used when com- 


municating with a printer or other parallel port de- 
vice. 


Software selects the peripheral device by asserting 
the SELECTIN# signal. The peripheral device, in 
turn, acknowledges that it is selected by asserting 
the SELECT signal. The INIT # signal is used to ini- 
tialize the peripheral device. If an error is encoun- 
tered during initialization or normal operations, the 
peripheral device asserts FAULT#. If a printer (or 
plotter) encounters problems in the paper path, the 
device asserts PERROR. Other peripheral devices 
may not use the PERROR signal. 


During normal operation, the peripheral device as- 
serts BUSY when it is not ready to receive data. 
When it has finished processing the previous data, 
the peripheral device asserts ACK# and negates 
BUSY. If interrupts are enabled, a low-to-high tran- 
sition on ACK# when the signal is negated gener- 
ates an interrupt. If interrupts are not enabled, soft- 
ware must poll the PSTAT Register to determine 
when ACK # is pulsed. 


82091AA 


The parallel port driver software sends data to the 
peripheral device by writing to the PDATA Register 
and asserting the STROBE# signal after an appro- 
priate data stabilization interval. After a sufficient 
setup time has elapsed, software then negates 
STROBE #. Valid data is read by the peripheral de- 
vice. 


In the PS/2-Compatible mode, data can be written 
to or read from the parallel port. Figure 40 shows the 
parallel port interface for PS/2-Compatible mode. 
The Byte protocol signal names are shown in paren- 
thesis. Before reading or writing the PDATA Regis- 
ter, the direction control bit in the PCON Register 
must be set to the proper transfer direction on 
PD[7:0]. During a write to the PDATA Register (with 
DIR # =0), data is latched by the PDATA Register 
and driven onto PD[7:0]. During a parallel port read 
of the PDATA Register (with DIR # = 1), the data on 
PD[7:0] is driven onto SD[7:0]. The data is not 
latched by the PDATA Register during the read cy- 
cle. 


STROBE# (HostCik) 
AUTOFD# (HostBusy) 
| INIT# (Init#) 
SELECTIN# (Selectin#) 


AIP 
Parallel 


Port _FAULT# (DataAvall#) 


SELECT (Xflag 
PERROR (AckDataReq) Port 


Parallel 


Connector 


BUSY (PtrBusy) 
_ PDI7:0} | 
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Figure 40. PS/2-Compatible Mode 
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6.2.2 EPP MODE 


The 82091AA is EPP 1.7 compliant. This means EPP 
cycles will begin with WAIT (Busy) inactive, however, 
WAIT will still prolong the cycle when active. Figure 
41 shows the parallel port interface for EPP mode. 
The EPP parallel port interface protocol signal 
names are shown in parenthesis. In EPP mode, the 
initialization, printer selection, and error signals 
(PERROR and FAULT#) work the same way as in 
the ISA-Compatible mode. However, in EPP mode, 
SELECTIN# and AUTOFD# are automatically gen- 
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erated and become Address Strobe (AStrb#) and 

Data Strobe (DStrb#), respectively, enabling direct 
access to parallel port devices. STROBE (Write #) is 
used to indicate a read or write cycle. Note that 
BUSY (Wait) is an active low signal in EPP mode 
rather than an active high signal as in ISA-Compati- 
ble mode. In addition, BUSY, in combination with 
IOCHRDY on the ISA Bus extends clock cycles to 
enable the completion of a read or write without ad- 


_ ditional wait states. EPP write and read cycles are 


shown in Figure 42 and Figure 43. 


STROBE# (Write#) 


AUTOFD# (DStrb#) 
SELECTIN# (AStrb#) 


AIP _ SELECT (Xflag) 


Parallel 


Parallel PERROR (AckDataReq) Port 
eon FAULT# (DataAvail#) onnecter 
| BUSY (Wait) | 
PD[7:0] 


Figure 41. EPP Mode 
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S$D[7:0| — Valid Data 


lOWC# 


i 


. STROBE# 
(write#) 


SELECTIN#/AUTOFDi# - 
(AStrb#/DStrb#) 


PD[7:0] 


| ValidData | \ 


BUSY 
(wait) 


lIOCHRDY 


290486-42 


Figure 42. EPP Mode Write Cycle 
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IORC# ) y 


STROBE# 
(write#) 


SELECTIN#/AUTOFD# 
(AStrb#/DStrb#) 


PD[7:0] ) a "Valid Data 


v 


BUSY 
(wait) =) 


lIOCHRDY 
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Figure 43. EPP Mode Read Cycle 
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6.2.3 ECP MODE 


Figure 44 shows the parallel port interface for ECP 
mode with the ECP protocol signal:names in paren- 
thesis. The ECP modes are selected by program- 
ming the Extended Control Register (ECR bits[7:5]). 
Two of the modes (Test and Configuration) provide 
information about the 82091AA’s parallel port and 
are not used for interfacing with a peripheral device. 
Four peripheral interface modes are selectable via 
the ECR—ISA-Compatible mode, ISA-Compatible 
FIFO. mode, PS/2-Compatible mode, and ECP 
mode. 


intel. 


ISA-Compatible and PS/2-Compatible Modes 
(ECR[7:5] = 000,001) 


The ISA-Compatible and PS/2-Compatible mode se- 


lections in the ECR are equivalent to selecting these 


modes via hardware configuration or programming 
the PCFG1 Register. For these modes the operation 
is the same as described in Section 6.2.1, ISA-Com- 
patible and PS/2-Compatible Modes. | 


STROBE# (HostClk) 


AUTOFD# (HostAck) 
INIT# (ReverseRequest#) 
SELECTIN# 


SELECT (Xflag) 


Parallel 


PERROR (AckReverse#) _ Port 
FAULT# (PeriphRequest#) Connector 


ACK# (PeriphClk#) 
BUSY (PeriphAck) 


(optional) 
PPDIR 
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Figure 44. ECP Mode 
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ISA-Compatible FIFO Mode (ECR[7:5] = 010) Second, the data is transferred to the peripheral us- 
ing an automatic hardware handshake. This hand- 
The |SA-Compatible FIFO mode uses the same sig- shake emulates the standard ISA-Compatible style 


naling protocol on the parallel port interface as the software generated handshake (Figure 45). For ISA- 
ISA-Compatible mode. However, there are two ma- Compatible FIFO mode, the 82091AA does not 
jor operational differences. First, data is written to a monitor the ACK # signal. Service interrupts are en- 
16-byte FIFO (via the SDFIFO address location). abled and reported via the ECR. The generation of 
The FIFO empty and FIFO full bits in the ECR pro- service interrupts is based on the state of the FIFO 
vide FIFO status. In addition, DMA can be used to and not individual transfers (using ACK #) as is the 
transfer data to the FIFO by enabling this feature in case in standard ISA-Compatible mode. | 

the ECR. 


STROBE# 


BUSY / \ 


Figure 45. ISA-Compatible Timing 
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ECP Mode (ECRI7:5] = 011) 


When ECR[7:5] =011, the parallel port operates in 


ECP mode. In ECP mode, both data and commands 


(addresses and RLE) are transferred using the paral- 
lel port 16-byte FIFO. This information can be either 
written to or read from the FIFO using DMA or non- 
DMA ISA Bus transfers. The parallel port interface 
transfers use an automatic handshake generated by 
the 82091AA. The host controls the transfer direc- 
tion by programming the DIR # bit in the PCON Reg- 
ister. 


When the host is writing to the peripheral device 
(forward direction), STROBE#, and BUSY provide 
the automatic handshake for transfer on the parallel 
port interface (Figure 46). The peripheral device ne- 
gates BUSY when it is ready to receive data or com- 
mands. AUTOFD ¥# indicates whether PD[7:0] con- 
tain data (AUTOFD# is high) or a command (AU- 
TOFD# is low). For commands (address or RLE), 
the host writes to the ECPAFIFO Register I/O ad- 
dress and for data, the host writes to the DFIFO 
Register |/O address. The addresses and data are 
placed in the same 16-byte FIFO. When the FIFO is 
full and cannot accept more data/addresses, the 


- FIFO Full status bit is set in the ECR. 


PD[7:0], 
AUTOFD# 


STROBE# 


BUSY 
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- Data/addresses written to the FIFO are transferred 


to the peripheral device via PD[7:0]. To begin a 
transfer on the peripheral interface, the 82091AA 
checks BUSY to make sure the peripheral is in the 
ready state. If BUSY is negated, the 82091AA drives 
PD[7:0] and AUTOFD#, and asserts STROBE # to 
indicate that the data/command is on PD[7:0]. The 
peripheral device asserts BUSY to indicate that it is 
receiving the data/command. BUSY asserted caus- 
es the 82091AA to negate STROBE #. 


When the host is reading from the peripheral device 
(reverse direction), AUTOFD# and ACK# provide 
the automatic handshake for transfer on the parallel 
port interface (Figure 47). Data/commands from the 
peripheral device are placed in the parallel port FIFO 
using this handshake. In this case, BUSY indicates 
whether PD[7:0] contain data ele" is high) or a 
command (BUSY is low). 


The peripheral device asserts ACK # to indicate that 
a data/command is on PD[7:0]. The 82091AA ne- 
gates AUTOFD# when it is ready for a peripheral | 
transfer and asserts AUTOFD # to indicate that it is 
receiving the data/command. AUTOFD#. asserted 
causes the peripheral device to negate ACK#. The 
peripheral transfers are to the parallel port 16-byte 
FIFO. 
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Figure 46. ECP Mode Handshake (Forward Direction) 


“pusy XX 
BUSY 


ACK# \ / 
AUTOFD# / \ 
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Figure 47. ECP Mode Handshake (Reverse Direction) 
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Test Mode :(ECR[7:5]=110) and Configuration 
Mode (ECR7:5] = 111) 


The test mode can be. used to check the FIFO read 
and write interrupt thresholds as described in Sec- 
tion 6.1.3.7, TFIFO—ECP Test FIFO Register. Note 
that for the 82091AA parallel port, the read and write 
FIFO interrupt thresholds are the same. The FIFO 
threshold is set by programming the PCFG1 Regis- 
ter in the 82091AA configuration space. The config- 
uration mode is used to access the ECPCFGA and 
ECPCFGB Registers. This mode must first be set 
before the ECPCFGA and ECPCFGB Registers can 
be accessed. 


6.2.3.1 FIFO Operations 


The parallel port FIFO is used for ECP transfers 
(ECR[7:5] =011), ISA-Compatible FIFO transfers 
(ECR[7:5] =010), and Test mode (ECR[7:5] = 110). 
Either DMA or programmed |/O can be used for 
transfers between the host and the parallel port. 


The FIFO threshold value is selected via the 
82091AA configuration registers (PCFG1 Register). 
The threshold is set to either 1 (forward)/15 (re- 
verse) or 8 in both directions. A threshold setting of 
1 (forward)/15 (reverse) results in longer periods of 
time between service request, but requires faster 
servicing of both read and write requests. A thresh- 
old setting of 8 results in more service requests, but 
tolerates slower servicing of the requests. 


In modes 010 and 011, an internal temporary hold- 
ing register is used in conjunction with the 16-byte 
FIFO to provide 17 bytes of storage for both forward 
and reverse transfers. Thus, in the forward direction 
if the peripheral asserts the BUSY signal during the 
filling of the FIFO, the host needs to write 17 bytes 
before the FIFO full flag in the ECR is-set to 1. In 
Test mode (110) only the 16-byte FIFO is used and 
the temporary holding register is not used. 


The FIFO is reset by a hard reset (RSTDRV assert- 
ed) or whenever the parallel port is placed in ISA- 
Compatible or PS/2-Compatible modes. Note that 
the FIFO threshold can only be changed when the 
parallel port is in [SA-Compatible or PS/2-Compati- 
ble mode. 


6.2.3.2 DMA Transfers 


The 82091AA contains parallel port DMA request 
(PPDREQ) and acknowledge (PPDACK #) signals to 
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communicate with a standard PC DMA controller. 
Before initiating a DMA transfer the direction bit in 
the PCON Register must be set to the proper direc- 
tion. To initiate DMA transfers, software sets the 
DMAEN bit to 1 and the SERVICEINTR bit to 0 in the 
ECR. The PPDREQ and PPDACK # signals will then 
be used to fill (forward direction) or empty (reverse 
direction) the FIFO. When the DMA controller reach- 
es terminal count and asserts the TC signal, an inter- 
rupt is generated and the SERVICEINTR bit is set to 
1. To reset the TC interrupt, software can either 
switch the mode to 000 or 001 or write the DMAEN 
bit to 0. 


In DMA mode, if 32 consecutive reads or writes are 
performed to the FIFO and PPDREQ is still asserted, 
the 82091AA negates PPDREQ for the length of the 
last PPDACK #/command pulse to force an arbitra- 
tion cycle on the ISA Bus. 


6.2.3.3 Reset FIFO and DMA Terminal Count 
Interrupt 


The following operations are used to reset the paral- 
lel port FIFO and TC interrupt 


Function Reset Operations 

FIFO -Changing to modes 000 or 001 
-Hard reset | 

FIFO Error § -Changing to modes 000 or 001 
-Hard reset 


TC Interrupt -Changing to modes 000 or 001 
-Setting DMAEN to 0 in ECR 


-Hard reset 


6.2.3.4 Programmed I/O Transfers — 


Programmed I/O (non-DMA) can also be used for 
transfers between the host and the parallel port 
FIFO. Software can determine the read/write FIFO 
thresholds and the FIFO depth by accessing the 
FIFO in Test mode. To use programmed I/O trans- 
fers software sets the direction bit in the PCON Reg- 
ister to the desired direction and sets the DMAEN bit 
to 0 and the SERVICEINTR bit to 0 in the ECR. The 
parallel port requests programmed I/O transfers 
from the host by asserting IRQ5/IRQ7. 


In the reverse direction an interrupt occurs when 
SERVICEINTR=0 either 8 or 15 bytes (depending 
on threshoid setting) are in the FIFO. IRQ5/IRQ7 
can be used in an interrupt-driven system. The host 
must respond to the interrupt request by reading 
data from the FIFO. 
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In the forward direction an interrupt occurs when 


SERVICEINTR=0 and there are either 8 or 1 byte 
locations available in the FIFO (depending on 
threshold setting). IRQ5/IRQ7 can be used in an in- 
terrupt-driven system. The host must respond to the 
interrupt request by writing data to the FIFO. 


6.2.3.5 Data Compression 


The 82091AA supports Run Length Encoded (RLE) 
decompression in hardware and can transfer com- 
pressed data to the peripheral. To transfer com- 
pressed data to the peripheral (forward direction), 
the compression count is written to the ECPAFIFO 
location and the data is written to the ECPDFIFO 
location. The most significant bit (bit 7) in the byte 
written to the ECPAFIFO Register informs the pe- 
ripheral whether the value is a channe! address (bit 
7=1) or a run length count (bit 7=0). The RLE 
count in the ECPAFIFO (bits[6:0]) informs the pe- 
_ ripheral of how many times the data in the 

ECPDFIFO is to be repeated. An RLE count of 0 
indicates that only one byte of the data is present 
and a count of 127 indicates to the peripheral that 
the next byte should be expanded 128 times. An 
RLE count of 1 should be avoided as it will cause 
unnecessary expansions. Note that the 82091AA 
asserts AUTOFD ¥ to indicate that PD[7:0] contains 
address/RLE instead of data. 


In the reverse direction, the peripheral negates the 
BUSY signal to indicate that PD[7:0] contains ad- 
dress/RLE. During an address/RLE cycle, the 
82091AA checks bit 7 to see if the next byte re- 
ceived needs to be decompressed. If bit 7 is 0, the 


Table 18. Parallel Port Summary 
Parallel Port 


Control Signals IRQ Enable DMA Enabie 
Controlled By PCON 


[Output | Open Drain | ACKINTEN | 
on 
[Output | Push Pull____| Always Enabled 


1. The selected IRQ pin (IRQ5/IRQ7) is enabled if ACKINT 


PD[7:0] 


Parallel Port 
Mode 


000 
001 
Epp NYA 
010 


ECR[7:5] 


Direction 


_ | ECP Test 110 
ECP Configuration 111 
NOTES: 


tri-stated. — 


a 

intel. 
82091AA decompresses (replicates) the next data 
received by the RLE count received on bits[6:0]. ~ 


6.2.4 PARALLEL PORT EXTERNAL BUFFER | 
CONTROL 


A multi-function signal (GCS#/PPDIR) is provided 
for controlling optional external parallel port data 
buffers. The PPDIR function is only available when 
the 82091AA configuration is in software mother- 
board (SWMB) mode. In this mode, this signal oper- 
ates as a parallel port direction control signal 
(PPDIR). Note that, if any other configuration is used 
(SWAI, HWB, or HWE configuration modes), this 
multi-function signal operates as a game port chip 
select (GCS#). In SWMB, PPDIR is low when 
PD[7:0] are outputs and high when PD[7:0] are in- 
puts. Figure 44 shows an example of external buff- 
ers being used when the parallel port is in ECP 
mode. 


External buffering affects the ability of the port to 
read software security devices. Typically these soft- 
ware secutiry devices are designed to hold the data 
pins of the parallel port connector at either high or 
low logic levels when the pins are not being driven 
by the parallel port. The bit pattern read from the 
parallel port by the security software may not be cor- 
rectly transferred through the external buffer. 


6.2.5 PARALLEL PORT SUMMARY 


Table 18 summarizes the parallel port interrupt, 
DMA, and parallel port signal drive type for the vari- 
ous modes of operation. : 


DMAEN 
DMAEN 
DMAEN 
DMAEN 


EN is enabled in the PCON Register. Otherwise, the IRQ pin is 


2. PPDREQ is enabled whenever the DMAEN bit is enabled in the ECR, independent of the parallel port mode. 


5S 
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7.0 SERIAL PORT 


The two 82091AA serial ports are identical. This 
section describes the serial port registers and FIFO 
operations. 


7.1 Register Description 


The register descriptions in this section apply to both 
serial port A and serial port B and provide a com- 
plete operational description of the serial ports. Ta- 
ble 19 shows the I/O address assignments for the 
serial port registers. The individual register descrip- 
tions follow in the order that they appear in the table. 
Note that serial port interrupt assignments (IRQ3 or 
IRQ4) and the base address assignments are made 
by 82091AA configuration as described in Section 
4.0, AIP Configuration. 


All registers are accessed as byte quantities. The 
base address is determined by hardware configura- 
tion at powerup (or a hard reset) or via software con- 
figuration by programming the 82091AA configura- 
tion registers as described in Section 4.0, AIP Con- 
figuration. Note that access to certain serial port reg- 
isters requires prior programming of the DLAB bit in 
the Line Control Register (LCR). 


82091AA 


During a hard reset (RSTDRV asserted), the 


82091AA registers are set to pre-determined de- 
fault states. The default values are indicated in the 
individual register descriptions. Reserved bits in the 
82091AA’s serial port registers must be _ pro- 
grammed to 0 when writing the register and these 
bits are 0 when read. The following bit notation is 
used for default settings: 


X Default bit position value is determined by 
conditions on an 82091AA signal pin. 


The following nomenclature is used for serial port 
register access attributes: 


RO Read Only. Note that for all registers with 


read only attributes, writes to the I/O address 
access a different register. 


WO Write Only. Note that for all registers with 
write only attributes, reads to the |/O address 
access a different register. 


R/W Read/Write. A register with this attribute can 
be read and written. Note that some read/ 
write registers contain bits that are read only. 


Table 19. Serial Port Registers 


Register Address 
Access (AEN = 0) 


: ADVANGE INFORMATION 


Abbreviation Register Name 


a FIFO Control Register 


[TranemitHoting Register| WO 
[Receiver ButerRegisir | RO 
[Owsortaentss | a 
eee Tae 
[wo 
Aw 
aw 
aw 
Rw 
aw 
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Table 20. Serial Port Register Summary 
Receiver | Transmitter Interrupt Interrupt 


Buffer Holding _ Enable identification. 


FIFO Control Line Control 
Register Register 


Register Register Register Register 


Data Bit 0 | Data Bit 0 Enable Received | 0 if Interrupt FIFO Enable | Word Length 
Data Available Pending Select Bit 0 
Interrupt 
Data Bit 1 | Data Bit 1 Enable XMTR Interrupt ID Bit | RCVRFIFO | WordLength 
, Holding Register Reset Select Bit 1 
Empty Interrupt 
Data Bit 2 | Data Bit 2 Enable RCVR ~— | Interrupt!ID Bit | XMIT FIFO Number of Stop 
Line Status Reset Bits - 
Interrupt : 
Data Bit 3 | Data Bit 3 Enable Modem Interrupt ID Bit | DMA Mode Parity Enable 
Status a aoe (Non-FIFO = aia Se Select 


tte Bit 4 | Data Bit 4 Event Parity Select 
| 5 _| Data Bit 5 | Data Bit 5 ee eee See oe Stick Parity 


Data Bit 6 | Data Bit 6 FIFOs Enabled | RCVR Trigger } Set Broan 
(Non-FIFO=0) } (LSB) 
Data Bit 7 | Data Bit 7 FIFOs Enabled | RCVR Trigger | Divisor etch 
? (Non-FIFO=0) | (MSB) Access Bit (DLAB) 


Table 20. Serial Port Register Summary (Continued) 


Modem Line Status Modem ScratchPad | Divisor Latch | Divisor Latch 
Control Register Status Register 
Register g Register 9 


Data Ready Delta Clear to Bit 0 Bit 8 

(DR) Send | 

Overrun Error Delta Data Set Bit 1 Bit 1 Bit 9 

(OE) Ready 

Parity Error Trailing Edge Bit 2 Bit 2 Bit 10 

(PE) | Ring Indicator 

Framing Error Delta Data Bit 3 Bit 3 Bit 11 
Carrier Detect 

Break Interrupt | Clear to Send Bit 4 Bit 4 Bit 12 
(CTS) 

Transmitter Data Set Bit 5 Bit 13 

Holding Register | Ready (DSR) 

(THRE) 

Transmitter Ring Indicator Bit 6 Bit 6 Bit 14 

Empty (TEMT) (Rl) 

Error in RCVR Data Carrier Bit 7 Bit 7 Bit 15 
Detect (DCD) 


Data Terminal 
Ready (DTR) 
Request to 
Send (RTS) 
- Out 1 Bit 


IRQ Enable 
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7.1.1 THR(A,B)—TRANSMITTER HOLDING REGISTER 


1/O Address: Base +0h (DLAB=0) 
Default Value: 00h 

Attribute: Write Only 

Size: 8 bits 


The THR contains data to be transmitted out on the SOUT[A,B] signal line. Bit 0 is the least significant bit and 
is the first bit serially transmitted. If the serial word length is less than 8 bits (as selected in the LCR), the data 
word must be written to this register right-justified. Bit positions above the number of bits selected for the word 
size are discarded (not transmitted). 


a 
Transmit Data: Bits[7:0] correspond to SD[7:0]. 


7.1.2 RBR(A,B)—RECEIVER BUFFER REGISTER 


|/O Address: Base +0h (DLAB=0) 
Default Value: 00h 

Attribute: Read Only 

Size: 8 bits 


The RRB contains data received from the SIN[A,B] signal line. Bit 0 is the least significant bit and is the first bit 
serially received. If the serial word length is less than 8 bits (as selected in the LCR), the data word in this 
register is right-justified. Bit positions above the number of bits selected for the word size are 0. 


ree] SSOSC~™~SCSCSSSCription 
Receiver Data: Bits[7:0] correspond to SD[7:0]. 


7.1.3 DLL(A,B), DLM(A,B)—DIVISOR LATCHES (LSB AND MSB) REGISTERS 


|/O Address: Base +0h,1h (DLAB= 1) 
Default Value: ooh 

Attribute: Read/Write 

Size: 8 bits 


The 82091AA contains two independently programmable baud rate generators. The 24 MHz crystal oscillator 
frequency input is divided by 13, resulting in a. frequency of 1.8462 MHz. This frequency is the input to each 
baud rate generator and is divided by the divisor of the associated serial port. The output frequency of the 
baud rate generator (BOUT[A,B]) is 16 x the baud rate. 


divisor # = (frequency input)/(baud rate x 16) 


The output of each baud rate generator drives the transmitter and receiver sections of the associated serial 
port. Two 8-bit latches per serial port store the divisor in a 16-bit binary format. These divisor latches must be 
loaded during initialization to ensure proper operation of the baud rate generator. Upon loading either of the 
divisor latches, a 16-bit baud counter is loaded. Table 21 provides decimal divisors to use with crystal frequen- 
cies of 24 MHz. Using a divisor of zero is not recommended. 
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Bit 


Divisor Latch MSB (Most Significant Byte) | Base + 1h (DLAB=1) 


Divisor Latch LSB (Least Significant Byte) | Base + 0h (DLAB=1) 


16-Bit Baud Counter (R/W) 
( 290486-48 


Figure 48. Divisor Latches (LSB and MSB) Registers 


. Description 
0 | Divisor Latch Data: Bits[7:0] correspond to SD[7:0]. 


Table 21. AIP Serial Port A and B Divisors, Baud Rates, and Clock Frequencies 


|| 24 MHz Input Divided to 1.8461 MHz 

Decimal Divisor for 16x Clock 
: san 
: a ee 
r eee 
12 Ne io ce 
ae ses 
— ae on 
2 
aa 
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7.1.4 IER(A,B)—INTERRUPT ENABLE REGISTER 


1/O Address: Base + 1h (DLAB=0) 
Default Value: 00h 

Attribute: Read/Write 

Size: 8 bits 


This register enables/disables interrupts for five types of serial port conditions. If a particular condition occurs 
whose interrupt is disabled in this register, the corresponding interrupt status bit in the IIR will not be set and 
an interrupt request (IRQ3 or IRQ4) will not be generated. 


0 Bit 
Default 


Receiver Data Available and 
Timeout Interrupts Enable (R/W) 
1=Enable 

0=Disable 


Transmitter Holding Register Empty 


interrupt Enable (R/W) 
1=Enable 
0=Disable 


Receiver Line Status Interrupt Enable (R/W) 
1=Enable 
0=Disable 


Modem Status Interrupt Enable (R/W) 
1=Enable 
0=Disable 


Reserved 
290486-49 


Figure 49. Interrupt Enable Register 


[) "aan, 
RESERVED | 


3 MODEM INTERRUPT ENABLE (MIE): When MIE= 1, the Modem Status Interrupt is enabled. When 
MIE = 0, the Modem Status Interrupt is disabled. 


RECEIVER INTERRUPT ENABLE (RIE): When RIE = 1, the Receiver Line Status interrupt is 
enabled. When RIE = 0, the receiver line status interrupt is disabled. 


TRANSMITTER HOLDING REGISTER EMPTY INTERRUPT ENABLE (THEIE): When THREIE=1, 
the Transmitter Holding Register Empty Interrupt is enabled. When THREIE = 0, the Transmitter 
Holding Register Empty Interrupt is disabled. 


RECEIVER DATA AVAILABLE INTERRUPT ENABLE AND TIMEOUT INTERRUPT ENABLE IN 
FIFO MODE (RAVIE): When RAVIE = 1, the Received Data Available Interrupt is Enabled. When 

RAVIE =0, the Received Data Available Interrupt is disabled. In addition, in the FIFO Mode, this bit 
enables the Timeout Interrupt when set to 1 and disables the Timeout Interrupt when set to 0. 
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7.1.5 1IR(A,B)}—INTERRUPT IDENTIFICATION REGISTER 


\/O Address: Base +2h 
Default Value: Oth 
Attribute: Read Only 
Size: 8 bits 


This register provides interrupt status and indicates whether the serial port receive/transmit FIFOs are en- 
abled (FIFO mode) or disabled (non-FIFO mode). In order to provide minimum software overhead during data 
character transfers, the serial port prioritizes interrupts into four levels ‘and records these in the Interrupt 
Identification Register. The four levels of interrupt conditions in order of priority are Receiver Line Status; 
Received Data Ready; Transmitter Holding Register Empty; and Modem Status. When the CPU accesses the 
IIR, the serial port freezes all interrupts and indicates the highest priority pending interrupt to the CPU. While 
this CPU access is occurring, the serial port records new interrupts, but does not change its current indication 
until the current access is complete. 


Default 


Interrupt Pending Status (RO) 
1=No Interrupt Pending 
O=Interrupt Pending 


Highest Priority Pending Interrupt (RO) 
see Text 


Timeout Interrupt Pending (RO) 
(FIFO Mode only) 

1=Pending 

0=Not Pending 


— Reserved 


FIFO Enable Status (RO) 
11=XMIT/RCVR FIFOs Enabled 


- 00=XMIT/RCVR FIFOs Disabled (non-FIFO mode) 
a 290486-50 


Figure 50. Interrupt Identification Register 
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FIFO MODE ENABLE STATUS (FIFOES): This status field indicates whether the serial port is in 
FIFO mode or non-FIFO mode (FIFO/non-FIFO mode is selected via the FCR). When FIFOES = 11, 
the serial port is in FIFO mode (FIFOs enabled). When FIFOES=00, the serial port is in non-FIFO 
mode (FIFOs disabled). The 82091AA never sets this field to either =01 or 10. 


ESERVED 


TIMEOUT INTERRUPT PENDING (TOUTIP)—FIFO MODE ONLY: In the non-FIFO mode, this bit is 
0. In FIFO mode TOUTIP is set to 1 when no characters have been removed from or input to the 


receive FIFO during the last 4 character times and there is at least 1 character in the FIFO during 
this time. When a timeout interrupt is pending, the 82091AA sets this bit along with bit 2 of this 
register. 


HIGHEST PRIORITY INTERRUPT INDICATOR: This field identifies the highest priority interrupt 
pending as indicated in Table 22. 


INTERRUPT PENDING STATUS (IPS): This bit can be used in an interrupt environment to indicate 
whether an interrupt condition is pending. When IPS =0, an interrupt is pending and the IIR contents 
may be used as a pointer to the appropriate interrupt service routine. When IPS = 1, no interrupt i Is 
pending. 


Table 22. Interrupt Priority 


Interrupt 
identification interrupt Set and Reset Functions 
Register 


Priority Interrupt interrupt Reset 


— fore. Line one Error, Parity aa the Line 
Status Error, Framing Error, or | Status Register 
Break Interrupt | 
1 Second | Received Receiver Data Read Receiver Buffer 
Data Available | Available 


1 1 Second | Character No Characters Have Reading the Receiver 
Timeout Been Removed from or | Buffer Register 
Indication Input to the RCVR 
FIFO during the Last 4 | 
Char. Times and there ae 
is at least 1 Char. in it : 
during this time | 
1 i Transmitter Transmitter Holding Reading the IIR 
Holding Register Empty Register (if Source or 
Register Interrupt) or Writing the 
Empty Transmitter Holding 
Register 
Fourth | Modem Status | Clear to Send or Data | Reading the Modem 
Set Ready or Ring Status Register 
Indicator or Data 
Carrier Detect. 
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_ 7.1.6 FCR(A,B)—FIFO CONTROL REGISTER 


1/O Address: ‘Base +2h 


Default Value: 00h 
Attribute: Write Only 


Size: 8 bits 


FCR is a write only register that is located at the same address as the IIR (the IIR is a read only register). FCR 
enables/disables the transmit/receive FIFOs, clears the transmit/receive FIFOs, and sets the receive FIFO 
trigger level. 


Default 


Transmit and Receive FIFOs Enable (WO) 
1=Both FIFOs Enabled 
0=Both FIFOs Disabled 


Receive FIFO Reset (WO) 
(self-clearing) 

1=Clears All Bytes 

0=No Reset 


Transmit FIFO Reset (WO) 
(self-clearing) 

1=Clears All Bytes 

O=No Reset 


Not Used 
Fleserved 


Receive Interrupt Trigger Level (WO) 
00=1 Byte 

01=4 Bytes 

10=8 Bytes 

11=14 Bytes 


290486-51 


Figure 51. FIFO Control Register 
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INTERRUPT TRIGGER LEVEL (ITL): The ITL field indicates the interrupt trigger level. When the 
number of bytes in the receive FIFO equals the interrupt trigger level programmed into this field and 
the Received Data Available Interrupt enabled (via the IER), an interrupt will be generated and the 
appropriate bits set in the IIR. 


Bits [7:6] Trigger Level (Bytes) 
00 01 (default) 
01 04 
: 


rs | mesemvew 


NOT USED: Writing to this bit causes no change in serial port operations. The serial port does not 
support DMA operations. Note that the TXRDY # and RXRDY # pins are not available in the 
B2091AA. 


RESET TRANSMITTER FIFO (RESETTF): When RESETTF is set to a 1, the FIFO counter is set to 
0. The shift register is not cleared. When the FIFO is cleared, the 82091AA sets this bit to 0. 


RESET RECEIVER FIFO (RESETRF): When RESETRF is set to a 1, the FIFO counter is set to 0. 
The shift register is not cleared. When the FIFO is cleared, the 82091AA sets this bit to 0. 


TRANSMIT AND RECEIVE FIFO ENABLE (TRFIFOE): TRFIFOE enables/disables the transmit 
and receive FIFOs. When TRFIFOE = 1, both FIFOs are enabled (FIFO Mode). When TRFIFOE= 0, 
the FIFOs are both disabled (non-FIFO MODE). Writing a 0 to this bit clears all bytes in both FIFOs. 
When changing from FIFO mode to non-FIFO mode and vice versa, data is automatically cleared 
from the FIFOs. This bit must be written with a 1 when other bits in this register are written or the 
other bits will not be programmed. 


ADVANGE INFORMATION — 4-105 


“e2001AA intel. 


7.1.7 LCR(A,B)—LINE CONTROL REGISTER 


1/O Address: Base +3h 
Default Value: 00h | 
Attribute: Read/Write 
Size: 8 bits 


This register specifies the format of the asynchronous data communications exchange. LCR also enables/dis- 
ables access to the Baud Rate Generator Divisor latches or the Transmitter Data Holding Register, Receiver 
Buffer Register, and Interrupt Enable Register. 


Bit 
Default 


Word Length Select (R/W) | 
00=5 Data Bits 
01=6 Data Bits 
10=7 Data Bits 
11=8 Data Bits 


Stop Bit Select (R/W) 
1=1.5 Stop Bits if 5 Data Bits Selected or 

2 Stop Bits if 6, 7, or 8 Data Bits Selected 
0=1 Stop Bit : . 


Parity Enable (R/W) 
1=Enable 
0=Disable 
Even Parity Selected (R/W) 
1=Even Parity 
0=Odd Parity 


0=Disable 


Break Control (R/W) 
1=Enable 
0=Disable 


Divisor |.atch Access Bit (R/W) 
1=Enables Access to Divisor Latches 


0=Enables access to THR, RBR and IER 
290486-52 


Figure 52. Line Control Register 
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DIVISOR LATCH ACCESS BIT (DLAB): DLAB controls access to the Baud Rate Generator Divisor 
Latches (and to the Transmit Holding Register, Receiver Buffer Register and Interrupt Enable 
Register which are located at the same I/O addresses). When DLAB = 1, access to the two Divisor 


Latches is selected and access to the THR, RBR, and IER is disabled. Wnen DLAB=0, access to 
the two Divisor Latches is disabled and access to the THR, RBR, and JER is selected. 


During test mode operations, DLAB must be set to 1 for the BOUT signal to appear on the SOUT 
pin. 

BREAK CONTROL (BRCON): When BRCON = 1, a break condition is transmitted from the 
82091AA serial port to the receiving device. When BRCON = 1, the serial output (SOUT) is forced to 
the ‘spacing’ state (logical 0). BRCON only affects the SOUT signal and has no effect on the 
transmitter logic. Note that this feature permits the CPU to alert a terminal. If the following sequence 
is used, no erroneous characters will be transmitted because of the break. 


1. Wait for the transmitter to be idle (TEMT = 1). 


2. Set break (BRCON = 1) for the appropriate amount of time. If the transmitter will be used to time 
the break duration, then check that TEMT = 1 before clearing the BRCON. 


3. Clear break (BRCON = 0) when normal transmission has to be restored. 


During the break, the transmitter can be used as a character timer to accurately establish the break 
duration by sending characters and monitoring THRE and TEMT. 


STICKY PARITY (STICPAR): STICPAR is the Stick Parity bit. When parity is enabled (PAREN = 1) 
this bit is used in conjunction with EVENPAR to select ‘‘Mark’’ or “Space” Parity. When bits PAREN, 
EVENPAR and STICPAR are 1, the parity bit is transmitted and checked as a 0 (Space Parity). If bits 
PAREN and STICPAR are 1 and EVENPAR is 0, the parity bit is transmitted and checked as a 1 
(Mark Parity). When STICPAR = 0, stick parity is disabled. 


EVEN PARITY SELECT (EVENPAR): EVENPAR selects between even and odd parity. When parity 
is enabled (PAREN = 1) and EVENPAR = 0, an odd number of 1s is transmitted or checked in the 
data word bits and parity bit. When parity is enabled and EVENPAR = 1, an even number of 1s is 
transmitted or checked. 


PARITY ENABLE (PAREN): This bit enables/disables parity generation and checking. When 
PAREN = 1, a parity bit is generated (transmit data) or checked (receive data) between the last data 
bit and stop bit of the serial data. (The Parity bit is used to produce an even or odd number of 1s 


when the data bits and the Parity bit are summed.) When PAREN = 0, parity generation and 
checking is disabled. 


STOP BITS (STOPB): This bit specifies the number of stop bits transmitted with each serial 
character. When STOPB = 0, one stop bit is generated in the transmitted data. When STOPB= 1 
and a 5-bit data length is selected, one and a half stop bits are generated. When STOPB = 1 and 
either a 6-, 7-, or 8-bit data length is selected, two stop bits are generated. The receiver checks the 
first Stop bit only, regardless of the number of Stop bits selected. 


SERIAL DATA BITS (SERIALDB): This field specifies the number of data bits in each transmitted or 
received serial character as follows: 
Bits[1:0] | Data Length 
00 5 Bits - Default 
01 6 Bits 
10 7 Bits 
11 8 Bits 
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7.1.8 MCR(A,B)—MODEM CONTROL REGISTER 


|/O Address: Base + 4h 
Default Value: 00h 
Attribute: ‘Read/Write 
Size: 8 bits — 


This register controls the interface with the modem or data set (or a peripheral device emulating a modem). 


Bit 
| Default 
DTR# Signal Control (R/W) 


1=DTR# Asserted 
0=DTR# Negated 


RTS# Signal Control (R/W) 
1=RTS# Asserted 
0=RTS# Negated 


OUT1 Control (R/W) . 
(AIP Does Not Have An OUT1 Signal) 
See Text For Loopback Operation 


IRQ3 and IRQ4 Enable (R/W) 
See Text for Loopback Operation 
1=Interrupts Enabled 
O=Interrupts Disabled 


Local Loopback Enable (R/W) 
1=Enable 
0=Disable 


Reserved 
290486-53 


Figure 53. Modem Control Register 
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LOOPBACK MODE ENABLE (LME): LME provides a local loopback feature for diagnostic testing of 
the serial port module. When LME = 1, the following occurs: 


. The transmitter Serial Output (SOUT) is set to the Marking (logic 1) state. 
. The receiver Serial Input (SIN) is disconnected. 


. The output of the Transmitter Shift Register is “looped back” (connected) to the Receiver Shift 
Register. 


. The four modem control inputs (DSR#, CTS#, RI and DCD #) are disconnected. 


. The DTRC, RTSC, OUT1C, IE bits in the MCR are internally connected to DSRS, CTSS, RIS, and 
DCDS in MSR, respectively. 

. The modem control output pins are forced to their high (inactive) state. 

. Data that is transmitted is immediately received. 


This feature allows the CPU to verify the transmit and received data paths of the serial port. In the 
loopback mode, the receiver and transmitter interrupts are fully operational. The modem status 
interrupts are fully operational. The modem status interrupts are also operational, but the interrupt 
sources are the lower four bits of MCR instead of the four modem control inputs. Writing a 1 to any 


of these 4 MCR bits (bits[3:0]) causes an interrupt. In Loopback Mode the interrupts are still 
controlled by the Interrupt Enable Register. The IRQ3 and IRQ4 signal pins are tri-stated in the 
loopback mode. 


INTERRUPT ENABLE (IE): When IE = 1, the associated interrupt is enabled (either IRQ3 or IRQ4 as 
selected via the associated serial port configuration register - A or B). ln Local Loopback Mode, this 
bit controls bit 7 of the Modem Status Register. 


OUT1 BIT CONTROL (OUT1C): This bit is the OUT 1 bit. It does not have an output pin associated 
with it. It can be written to and read by the CPU. In Local Loopback Mode, this bit controls bit 6 of the 
Modem Status Register. 


REQUEST TO SEND CONTROL (RTS): This bit controls the Request to Send (RTS #) output. 
When RTSC = 1, the RTS # output is asserted. When RTSC=0, the RTS# output is negated. In 
Local Loopback Mode, this bit controls bit 4 of the Modem Status Register. 


DATA TERMINAL READY CONTROL (DTRC): This bit controls the Data Terminal Ready (DTR #) 
output. When DTRC= 1, the DTR# output is asserted. When DTRC=0, the DTR # output is 
negated. In Local Loopback Mode, this bit controls bit 5 of the Modem Status Register. 

NOTE: 
The DTR# and RTS# outputs of the serial port may be applied to an EIA inverting line driver (such 
as the DS1488) to obtain the proper polarity input at the modem or data set. 


7.1.9 LSR(A,B)—LINE STATUS REGISTER 


1/O Address: Base +5h 
Default Value: 60h 
Attribute: Read/Write 
Size: 8 bits 


This 8-bit register provides data transfer status information to the CPU. Note that the Line Status Register is 
intended for read operations only. Writing to this register is not recommended and could result in unintended 
operations. For this reason, the figure shows these bits as RO (read only). 
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Bit 
Default 


— Receiver Data Ready Status (RO) 
1=Character Ready To Be Read 
=No Character 

Overrun Error Status (RO) 

1=Overrun 

0=No Overrun 
Parity Error Status (RO) 
1=Parity Error 
0=No Parity Error 


Framing Error Status (RO) 


1=Framing Error 
0=No Framing Error 


— Break Interrupt Status (RO) 
1=Received a Break 
O=No Break Received 


Transmitter Holding Register Status (RO) 
1=Ready For A Character . 
0=Not Ready For A Character 


Transmitter Empty Status (RO) 
1=Empty 
0=Not Empty 

FIFO Error Status (RO) 

1=Error 


O=No Error 
; 290486-54 


Figure 54, Line Status Register 


Description 


FIFO ERROR STATUS (FIFOE): In the non-FIFO Mode this is a 0. In the F IFO Mode, FIFOE is set to 
1 when there is at least one parity error, framing error, or break indication in the FIFO. FIFOE is set 
to 0 when the CPU reads the LSR, if there are no subsequent errors in the FIFO. 


TRANSMITTER EMPTY STATUS (TEMT): This bit is the Transmitter Empty (TEMT) indicator. When 
the Transmitter Holding Register (THR) and the Transmitter Shift Register (TSR) are both empty, the 
82091AA sets TEMT to a 1. When either the THR or TSR contains a data character, TEMT is set to 
a 0. The default is 0. In FIFO mode, this bit is set to 1 when the transmitter FIFO and the shift 
register are both empty. 
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5 TRANSMITTER HOLDING REGISTER STATUS (THRE): This bit is the Transmitter Holding 

Register Empty (THRE) indicator. THRE indicates that the serial port module is ready to accept a 
new character for transmission. In addition, this bit causes the serial port module to issue an 
interrupt to the CPU when the Transmit Holding Register Empty Interrupt enable is set toa 1. THRE 
is set to 1 when a character is transferred from the Transmitter Holding Register into the Transmitter 
Shift Register. THRE is set to O when the CPU loads the Transmitter Holding Register. In the FIFO 
mode, this bit is set to a 1 when the transmit FIFO is empty, and is set to 0 when at least 1 byte is 
written to the transmit FIFO. | 


BREAK INTERRUPT STATUS (BI): This bit is the Break Interrupt (BI) indicator. Bl is set to a 1 when 
the received data input is held in the Spacing state (logic 0) for longer than a full word transmission 
time (that is, the total time of Start bit + data bits + Parity + Stop bits). When the CPU reads the 

‘contents of the Line Status Register, Bl is set to 0. 


In FIFO mode, this error is associated with the particular character in the FIFO associated with the 
Break. BI is indicated to the CPU when its associated character is at the top of the FIFO. When 
break occurs only one character is loaded into the FIFO. Restarting after a break is received 
requires the SIN pin to be a logical 1 for at least 14 bit times. 


NOTE: 
Bits[3:0] are the error conditions that produce a Receiver Line Status interrupt whenever any of the 
corresponding conditions are detected and that interrupt is enabled. 


FRAMING ERROR STATUS (FE): This bit is the Framing Error (FE) indicator. FE indicates that the 
received character did not have a valid stop bit. FE is set to a 1 when the stop bit following the last 
data bit or parity bit is O (spacing level). FE is set to O when the CPU reads the contents of the Line 
Status Register. 


In FIFO mode, this error is associated with the particular character in the FIFO that it applies to. This 
error is revealed to the CPU when its associated character is at the top of the FIFO. When a framing 
error is due to the next start bit, the serial port attempts to resynchronize. In this case, the serial port 
module samples this start bit twice and, if no FE exists, then the module takes in the rest of the bits. 


PARITY ERROR STATUS (PE): This bit is the Parity Error (PE) indicator. PE indicates that the 
received data character does not have the correct even or odd parity, as selected by the EVENPAR 
bit in the Line Status Register. When a parity error is detected, PE is set to 1. PE is set to 0 when the 
CPU reads the contents of the Line Status Register. In the FIFO mode, this error is associated with 
the particular character in the FIFO that it applies to. This error is indicated to the CPU when its 
associated character is at the top of the FIFO. 


OVERRUN ERROR STATUS (OE): OE indicates that data in the Receiver Buffer Register was not 
read by the CPU before the next character was transferred into the Receiver Buffer Register. In this 
case, the previous character is overwritten. When an overrun is detected, OE is set to 1. when the 
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CPU reads the Line Status Register, OE is set to 0. This bit is read only. 


If the FIFO mode data continues to fill the FIFO beyond the trigger level, an overrun error will occur 
only after the FIFO is completely full and the next character has been received in the shift register. 

OE is indicated to the CPU as soon as it happens. The character in the shift register is overwritten, 

but it is not transferred to the FIFO. 


RECEIVER DATA READY STATUS (DR): DR is set to 1 when a complete incoming character has 
been received and transferred into the Receiver Buffer Register or the FIFO. When the data in the 
Receiver Buffer Register or FIFO is read, DR is set to 0. This bit is read only. 
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7.1.10 MSR(A,B)—MODEM STATUS REGISTER 


I/O Address: Base +6h 
Default Value: XXXX 0000 
Attribute: Read/Write 


Size: «BB bits 


The MSR provides the current state of the control lines from the Modem (or peripheral device) to the CPU. 
Bits[7:4] provide the status of the DCD#, RI, DSR#, and CTS# Modem signals. In addition to the current- 
state information of the Modem signals, bits[3:0] provide change information for these signals. Bits[3:0] are 
set to a 1 when the corresponding input signal changes state. Bits[3:0] are set to a 0 when the CPU reads the 
Modem Status Register. 


7 6 5 4 3 2 1 0 Bit 
PELE LT ET] 
Delta CTS# Status (RO) 


1=Change 
0=No Change 


Delta DSR# Status (RO) 
(Change Since Last CPU Read) 
1=Change 
0:=No Change 
Trailing Edge of I# Status (RO) 
(Low-to-High Transition Since Last CPU Read) 
1=Change 
0=No Change. 


Delta DCD# Status (RO) 
(Change Since Last CPU Read) 
1=Change 
=No Change 
CTS# Status (RO) 


1=Asserted 
0=Negated 


DSR# Status (RO) 
1=Asserted 
0=Negated 

Ri# Status (RO) 

1=Asserted 

0=Negated 


DCD# Status (RO) 
1=Asserted 
0=Negated 
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NOTE: ) 
X= Value determined by state of the corresponding modem control signal. 


Figure 55. Modem Status Register 
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DATA CARRIER DETECT STATUS: This bit is the compliment of the Data Carrier Detect (DCD #) 
input. If bit 4 of the MCR is set to a 1, this bit is equivalent to IRQ ENABLE in the MCR. 


RING INDICATOR STATUS: This bit is the compliment of the Ring Indicator (RI) input. If bit 4 of the 
MCR is set to a 1, this bit is equivalent to OUT1 in the MCR. 


DATA SET READY STATUS: This bit is the compliment of the Data Set Ready (DSR #) input. If bit 4 
of the MCR is set to a 1, this bit is equivalent to DTR in the MCR. 


CLEAR TO SEND STATUS: This bit is the compliment of the Clear to Send (CTS #) —_ If bit 4 of 
the MCR is set to a 1, this bit is equivalent to RTS in the MCR. 


DELTA DATA CARRIER DETECT STATUS: This bit is the Delta Data Carrier Detect (DDCD) 


indicator. Bit 3 indicates that the DCD # input to the chip has changed state. 
NOTE: 
Whenever bit 0, 1, 2, or 3 is set to logic 1, a Modem Status Interrupt is generated. 


TRAILING EDGE OF RING INDICATOR STATUS: This bit is the Trailing Edge of Ring Indicator 
(TERI) detector. Bit 2 indicates that the RI # input to the chip has changed from a low to a high state. 


DELTA DATA SET READY STATUS: This bit is the Delta Data Set Ready (DDSR) indictor. Bit 1 
indicates that the DSR # input to the chip has changed state since the last time it was read by the 
CPU. 


DELTA CLEAR TO SEND STATUS: This bit is the Delta Clear to Send (DCTS) indicator. Bit 0 
indicates that the CTS # input to the chip has changed state since the last time it was read by the 
CPU. 


7.1.11 SCR(A,B)}—SCRATCHPAD REGISTER 


|/O Address: Base +7h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


This 8-bit read/write register does not control the serial port module in any way. It is intended as a scratchpad 
register to be used by the programmer to hold data temporarily. 


C.-T 
SCRATCHPAD DATA: Bits[7:0] of this register correspond to SD[7:0]. 
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7.2 FIFO Operations 

This section describes the FIFO operations for inter- 
rupt and polled modes. 

7.2.1 FIFO INTERRUPT MODE OPERATION 


When the Receive FIFO and receiver interrupts are 
enabled (FCRO=1 and IERO= 1), receiver interrupts 
occur as follows: 


1. The receive data available interrupt is invoked 


when the FIFO has reached its programmed trig- 
ger level. The interrupt is cleared when the FIFO 
drops below the programmed trigger level. 


2. The IIR receive data available indication also oc- 
curs when the FIFO trigger level is reached, and 
like the interrupt, the bits are cleared when the 
FIFO drops below the trigger level. 


3. The receiver line status interrupt (IIR-O6h), as be- 


fore, has higher priority than the received data 
available (IIR =04h) interrupt. 


4. The data ready bit (LSRO) is set as soon as a 
character is transferred from the shift register to 
the. receive FIFO. This bit is set to 0 when the 
FIFO is empty. 


When receiver FIFO and receiver interrupts are en- 
abled, receiver FIFO timeout interrupts occur as fol- 
lows: 


1. A FIFO timeout interrupt occurs, if the following 
conditions exist: 


a. At least one character is in the FIFO. 


b. The most recent serial character received was 
longer than 4 continous character times ago (if 
2 stop bits are programmed, the second one is 
included in this time delay). 


c. The most recent CPU read of the FIFO was 
longer than 4 continous character times ago. 


The maximum time between a received charac- 
ter and a timeout interrupt is 160 ms at 300 
baud with a 12-bit receive character (i.e., 1 
start, 8 data, 1 parity, and 2 stop bits). 


. 2, Character times are calculated by using the RCLK 
input for a clock signal (this makes the delay pro- 
portional to the baud rate). 
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3. When a timeout interrupt occurs, it is cleared and 
the timer reset when the CPU reads one charac- 
ter from the receiver FIFO. 


4. When a timeout interrupt does not occur, the 
timeout timer is reset after a new character is re- 
ceived or after the CPU reads the receiver FIFO. 


When the transmit FIFO and transmitter interrupts 
are enabled (FCRO = 1, IER1 = 1), transmit interrupts 
occur as follows: 


1. The transmitter holding register interrupt occurs 
when the transmit FIFO is empty. The interrupt is 
cleared as soon as the transmitter holding regis- 
ter is written (1 to 16 characters may be written to 
the transmit FIFO while servicing the interrupt) or 
the IIR is read. 


Character timeout and receiver FIFO trigger level in- 
terrupts have the same priority as the current re- 


ceived data available interrupt. Transmit FIFO empty 


has the same priority as the current transmitter hold- © 
ing register empty interrupt. 


7.2.2 FIFO POLLED MODE OPERATION 


With FIFO = 1, setting IER[3:0] to all Os puts the se- 
rial port in the FIFO polled mode of operation. Since 
the receiver and transmitter are controlled separate- 
ly, either one or both can be in the polled mode of 
operation. 


In this mode, software checks receiver and transmit- 

ter status via the LSR. As stated in the register de- 

scription: 

¢ LSRO is set as long as there is one byte in the 
receiver FIFO. . 

e LSR1 and LSR4 specify which error(s) has oc- 
curred. Character error status is handled the 
same way as interrupt mode. The IIR is not af- 
fected since IER2=0. 

e LSR5 indicates when the transmitter FIFO is 
empty. | 

e LSR6 indicates that both the transmitter FIFO 
and shift register are empty. 

e LSR7 indicates whether there are any errors in 
the receiver FIFO. 
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8.0 FLOPPY DISK CONTROLLER 


The 82091AA’s Floppy Disk Controller (FDC) is 
functionally compatible with 82078/82077SL/ 
82077AA/8272A floppy disk controllers. During 
82091AA configuration, the FDC can be configured 
for either two drive support or four drive support via 
the FCFG1 Register. This section provides a com- 
plete description of the FDC when it is configured for 
two drive support. Additional information on four 
drive support is provided in Appendix A, FDC Four 
Drive Support. 


NOTE: 
For FDC compatibility and programming 
guidelines, refer to the 82078 Floppy Disk 
Controller Data sheet. 


8.1 Floppy Disk Controller Registers 


The FDC contains seven status, control, and data 
registers. Table 23 shows the I/O address assign- 
ments for the FDC registers and the individual regis- 
ter descriptions follow in the order that they appear 
in the table. The registers provide control/status 
information and data paths for transfering data be- 
tween the floppy disk controller interface and the 
8-bit host interface. In some cases, two different reg- 
isters occupy the same |/O address. In these cases, 
one register is read only and the other is write only 
(i.e., a read to the I/O address accesses one regis- 
ter and a write accesses the other register). 


All registers are accessed as byte quantities. The 
base address is determined by hardware configura- 
tion at powerup (or a hard reset) or via software con- 
figuration by programming the 82091AA configura- 
tion registers as described in Section 4.0, AIP Con- 
figuration. 
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During a hard reset (RSTDRV asserted), the 
82091AA registers are set to pre-determined de- 
fault states. The default values are indicated in the 
individual register descriptions. Reserved bits in the 
FDC registers must be programmed to 0 when writ- 
ing the register and these bits are 0 when read. The 
following bit notation is used for default settings: 


xX Default bit position value is determined by 


conditions on an 82091AA signal pin. 


The following nomenclature is used for register ac- 
cess attributes: 


RO Read Only. Note that for registers with read 
only attributes, writes to the I/O address have 
no affect on floppy disk operations. 


WO Write Only. Note that for all FDC registers 
with write only attributes, reads of the I/O ad- 
dress access a different register. 


R/W Read/Write. A register with this attribute can 
be read and written. Note that individual bits in 
some read/write registers may be read only. 


Table 23 lists the register accesses that bring the 
FDC out of a powerdown state. All other registers 
accesses are possible without waking the part from 
a powerdown state and reads from these registers 
reflects the true status as shown in the register de- 
scription. For writes that do not affect the power- 
down state, the FDC retains the data and will subse- 
quently reflect it when the FDC awakens. Note that 
for accesses that do not affect powerdown, the ac- 
cess may cause a temporary increase in FDC power 
consumption. The FDC reverts back to low power 
mode when the access has been completed. None 
of the extended registers effect the behavior of the 
powerdown mode. 
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~ FDC Register 
Address Access 


[stausRegster® dN 

aM 
[Tape Ove Register| Noid 
[Mein SiatusRegister | Yes ~—~«dY RO 


es 

[Reserved | 
Digital Input Register No 
Configuration Control Register fo 


NOTES: 

1. The base address is 3FOh (primary address) or 370 (secondary address). . 

2. While writing to the DOR or DSR does not wake up the FDC, writing any of the motor enable bits in the DOR or invoking 
a software reset (either via DOR or DSR reset bits) will wake up the FDC. 
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8.1.1 SRB—STATUS REGISTER B (EREG EN= 1) 


1/O Address: Base + 1h 
Default Value: RRRR RRXX 
Attribute: Read/Write 
Size: 8 bits 


SRB provides status and control information when auto powerdown is enabled. In the AT/EISA mode the SRB 
is made available whenever the EREG EN bit in the POWERDOWN MODE Command is set to 1. When EREG 
EN bit is set to 0, this register is not accessible. In this case, writes have no affect and reads return indetermi- — 
nate values. 


Default 


IDLE Status (RO) 
1=IDLE State 
0=Not in IDLE State 


Powerdown Status (RO) 
1=Powerdown State 
0=Not in Powerdown State 


Reserve 
© d 290486-—56 


NOTE: 
X= Value is determined by the state of the corresponding signal pin. 


Figure 56. Status Register B 


Cae eee 
RESERVED 


POWERDOWN STATUS (PD): This bit reflects the powerdown state of the FDC module. The 


82091AA sets PD to 1 when the FDC is in the powerdown state. When PD=0, the FDC is not in the 
powerdown state. 


IDLE STATUS (IDLE): This bit reflects the idle state of the FDC module. The 82091AA sets IDLE to 
1 when the FDC is in the idle state. When IDLE =0, the FDC is not in the idle state. 
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8.1.2 DOR—DIGITAL OUTPUT REGISTER 


1/O Address: © Base +2h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


The Digital Output Register enables/disables the floppy. disk drive motors, selects the disk drives, enables/dis- 
ables DMA, and provides a FDC module reset. The DOR reset bit and the motor enable bits have to be 
inactive when the FDC is in powerdown. The DMAGATE# and drive select bits are unchanged. During 
powerdown, writing to the DOR does not wake up the FDC, except for activating any of the motor enable bits. 
Setting the motor enable bits to 1 wakes up the FDC. 


NOTES: | 

1. The descriptions in this section for DOR only apply when two-drive support is selected in the FCFG1 
Register (FDDQTY =0). For four-drive support (FDDQTY = 1), refer to Appendix A, FDC Four Drive 
Support. : 

2. The drive motor can be enabled separately without selecting the drive. This permits the motor to 
come up to speed before selecting the drive. Note also that only one drive can be selected at a time. 
However, the drive should not be selected without enabling the appropriate drive motor via bits[5:4] 
of this register. 


Default 


Drive Select (R/W) 
1=Drive 1 Selected 
0=Drive 0 Selected 
Reserved 
FDC Reset (R/W) 


0=Resets FDC 
1=Does Not Reset FDC 


DMA Enable (R/W) 
1=Enable 
0=Disable 


Motor Enable 0 (R/W) 
(for Drive 0) 
1=Enable 
0=Disable 
Motor Enable 1 (R/W) 
1=Enable 
0=Disable 


Reserved 
290486-57 


Figure 57. Digital Output Register 
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RESERVED: For a two-drive system, these bits are not used and have no affect on FDC operation. 
For a four drive system, see Appendix A, FDC Four Drive Support. 


MOTOR ENABLE 1 (ME1): This bit controls a motor drive enable signal. ME1 directly controls either 
the FDME1 # signal or FDMEO# signal, depending on the state of the BOOTSEL bit in the TDR. 
When ME1 = 1, the selected motor enable signal (FDME1# or FOME0 #) is asserted and when 
ME1=0, the selected motor enable signal is negated. 


MOTOR ENABLE 0 (MEO): This bit controls a motor drive enable signal. ME1 directly controls either 
the FDMEO# signal or FDME1 # signal, depending on the state of the BOOTSEL bit in the TDR. 
When MEO = 1, the selected motor enable signal (FDME0# or FDME1 #) is asserted and when 
MEO = 0, the selected motor enable signal is negated. 


DMA GATE (DMAGATE): This bit enables/disables DMA for the FDC. When DMAGATE = 1, DMA 
for the FDC is enabled. In this mode, FDDREQ, TC, IRQ6, and FDDACK # are enabled. When 
DMAGATE = 0, DMA for the FDC is disabled. In this mode the IRQ6, and DRQ outputs are tri-stated 
and the DACK# and TC inputs are disabled to the FDC. Note that the TC input is only disabled to 
the FDC module. Other functional units in the 82091AA (e.g., parallel port or IDE inleriaes can still 
use the TC input signal for DMA activities. 


FDC RESET (DORRST): DORRST is a software reset for the FDC module. When DORRST is set to 
0, the basic core of the FDC and the FIFO circuits are cleared conditioned by the LOCK bit in the 
CONFIGURE Command. This bit is set to 0 by software or a hard reset (RSTDRV asserted). The 
FDC remains in a reset state until software sets this bit to 1. This bit does not affect the DSR, CCR 
and other bits of the DOR. DORRST must be held active for at least 0.5 ws at 250 Kbps. This is less 
than a typical ISA |/O cycle time. Thus, in most systems consecutive writes to this register to toggle 
this bit allows sufficient time to reset the FDC. 


RESERVED: For a two-drive system, this bit is not used and must be programmed to 0. For a four 
drive system, see Appendix A, FDC Four Drive Support. 


DRIVE SELECT (DS): This selects the floppy drive by controlling the FDSO# and FDS1# output 
signals. DS directly controls FDS1 and FDSO as follows: 
Bit 0 Output Pin Status 
0 FDSO# asserted (FDS1 asserted if BOOTSEL = 1) 
1 FDS1# asserted (FDS1 asserted if BOOTSEL = 1) 


8.1.3 TDR—ENHANCED TAPE DRIVE REGISTER 


1/O Address: Base +3h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


This register allows the user to assign tape support to a particular drive during initialization. Any future refer- 
ences to that drive number automatically invokes tape support. A hardware reset sets all bits in this register to 
O making drive O not available for tape support. A software reset via bit 2 of the DOR does not affect this 
register. Drive 0 is reserved for the floppy boot drive. Bits[7:2] are only available when EREG EN = 1; other- 
wise the bits are tri-stated. EREG EN is a bit in the POWERDOWN Command. 
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_ Default 


Tape Select 0 (R/W) 
(for Drive 0) 
_ 1=Selected 
0=Not Selected 
Tape Select 1 (RW) 
(for Drive 1) 
1=Selected 
_ 0=Not Selected | 
Boot Drive Select (R/W) 
see Table 


Reserved 
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Figure 58. Enhanced Tape Drive Register 


Description 
RESERVED 


BOOT DRIVE SELECT (BOOTSEL): The BOOTSEL bit is used to remap the drive selects and 
_motor enables. The functionality is as described below: 


BOOTSEL Mapping 
— 0 DSO —> FDSO, MEO —> FDMEO (default) 
DS1 — DS1, ME1 —> FDME1 
1 - . DSO — DS1, MEO — FDME1 


~DS1— FDSO, ME1 — FDMEO 


Note that this mapping also applies to a four drive system (FDDQTY = 1 in the FCFG1 Register). Ina 
four drive system, only drive 0 or drive 1 can be selected as the boot drive. 


RESERVED: For a two-drive system, this bit is not used and must be programmed to 0. For a four 
drive system, see Appendix A, FDC Four Drive Support. 


TAPE SELECT (TAPESEL): This bit is used by software to assign logical drive number 1 to be a 
tape drive. Other than adjusting precompensation delays for tape support, this bit does not affect the 
FDC hardware. The bit can be written and read by software as an indication of the tape drive 
assignment. Drive 0 is not available as a tape drive and is reserved as the floppy disk boot drive. The 
| tape drive assignment is as follows: 
Bit 0 Drive Selected 

0 None (all are floppy disk drives) 
1 Drive 1 is a tape drive. 
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8.1.4 MSR—MAIN STATUS REGISTER 


1/O Address: Base + 4h 
Default Value: 00h 
Attribute: Read Only 
Size: 8 bits 


This read only register provides FDC status information. This information is used by software to control the 
flow of data to and from the FIFO (accessed via the FDCFIFO Register). The MSR indicates when the FDC is 
ready to send or receive data through the FIFO. During non-DMA transfers, this register shouid be read before 
each byte is transferred to or from the FIFO. 


After a hard or soft reset or recovery from a powerdown state, the MSR is available to be read by the host. The 
register value is 00h until the oscillator circuit has stabilized and the internal registers have been initialized. 
When the FDC is ready to receive a new command, MSR[7:0] = 80h. The worst case time allowed for the MSR 
to report 80h (i.e., RQM is set to 1) is 2.5 us after a hard or soft reset. | 


Main Status Register is used for controlling command input and result output for all commands. Some example 
values of the MSR are: 


¢ MSR = 80H; The controller is ready to receive a command. 
® MSR = 90H; executing a command or waiting for the host to read status bytes (assume DMA mode). 
¢ MSR= DOH; waiting for the host to write status bytes. 


7 6 5 4 3 2 1 0 Bit 
oTe[o[e[s ef] e] mmm 
Drive 0 Busy (RO) 


see Text 


Drive 1 Busy (RO) 
see Text 


Reserved 


Command Busy (RO) 
1=FDC Command in Progress 
0=No FDC Command in Progress 


Non DMA Mode (RO) 
see Text 


Data I/O Direction (RO) 
1=Host Data Read Required 
0=Host Data Write Required 


Request for Master (RO) 
1=Host can Transfer Data 
0=Data Transfer is not Permitted 
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Figure 59. Main Status Register 
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Description 


REQUEST FOR MASTER (RQM): When RQM = 1, the FDC is ready to send/receive data through 
the FIFO (FDCFIFO Register). The FDC sets this bit to.0 after a byte transfer and then sets the bit to 
1 when it is ready for the next byte. During non-DMA execution phase, RQM indicates the status of 
IRQ6. 


DIRECTION I/O (DIO): When RQM = 1, DIO indicates the direction of a data transfer. When 
DIO= 1, the FDC is requesting a read of the FDCFIFO. When DIO =0, the FDC is requesting a write 
to the FDCFIFO. 


NON-DMA (NONDMA): Non-DMA mode is selected via the SPECIFY Command. In this mode, the 
FDC sets this bit to a 1 during the execution phase of a command. This bit is for polled data 
transfers and helps differentiate between the data transfer phase and the reading of result bytes. 


COMMAND BUSY (CMDBUSY): CMDBUSY indicates when a command is in progress. When the 
first byte of the command phase is written, the FDC sets this bit to 1. CMDBUSY is set to 0 after the 
last byte of the result phase is read. If there is no result phase (e.g., SEEK or RECALIBRATE | 
Commands), CMDBUSY is set to 0 after the last command byte is written. 


~|2 


RESERVED: For a two-drive system, these bits are not used and must be programmed to 0. For a 
four drive system, see Appendix A, FDC Four Drive Support. 


DRIVE 1 BUSY (DRV1BUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 1. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. 


DRIVE 0 BUSY (DRVOBUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 0. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. 


8.1.55 DSR—DATA RATE SELECT REGISTER 


1/O Address: Base + 4h 
Default Value: 02h 
Attribute: Write Only 
Size: | 8 bits 


The DSR selects the data rate, amount of write precompenstion, invokes direct powerdown, and invokes a 
FDC software reset. This write only register ensures backward compatibility with the Intel series of floppy disk 
controllers. Changing the data rate changes the timings of the drive control signals. To ensure that drive 
timings are not violated when changing data rates, choose a drive timing such that the fastest data rate will not 
violate the timing. 


In the default state, the PDOSC bit is low and the oscillator is powered up. When this bit is programmed to a 1, | 
the oscillator is shut off. Hardware reset sets this bit to a 0. Neither of the software resets (via DOR or DSR) 
have any effect on this bit. Note that PDOSC should only be set to a 1 when the FDC module is in the 
powerdown state. Otherwise, the FDC will not function correctly and must be hardware reset once the oscilla- . 
tor has turned back on and stabilized. Setting the PDOSC bit has no effect on the clock input to the FDC (the | 
X1 pin). The clock input is separately disabled when the part is powered down. The Save Command checks 
the status of PDOSC. However the Restore Command will not restore this bit to a 1. 


Software resets do not affect the DRATE or PRECOMP bits. 
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Data Rate (R/W) 
00=500 Kbps 
01=300 Kbps 


10=250 Kbps 
11=1 Mbps . 


Precompensation Delays (R/W) 

000=Default Precompensation Delay Values (see Text) 
001=41.67 ns 

010=83.34 ns 

011=125.00 ns 

100=166.67 ns 

101=208.33 ns 

110=250.00 ns 

111=0.00 ns (Disabled) 


Reserved 
Always Write to 0 


Powerdown (R/W) 
1=FDC in Powerdown 
0=FDC Powerdown Not Selected 


Software Reset (R/W) 
(Self-Clearing) 
1=Resets FDC 

0=Does Not Reset FDC 
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Figure 60. Data Rate Select Register 
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Description 


SOFTWARE RESET (DSRRST): DSRRST operates the same as the DORRST bit in the DOR, 
except that this bit is self clearing. 


POWERDOWN (FPD): FPD provides direct powerdown for the FDC module. When FPD = 1, the 
FDC module enters the powerdown state, regardless of the state of the module. The FDC module is 
internally reset and then put into powerdown. No status is saved and any operation in progress is 
aborted. A hardware or software reset causes the 82091AA to exit the FDC module powerdown 
state. 


RESERVED 


PRECOMPENSATION (PRECOMP): Bits[4:2] adjusts the WRDATA output to the disk to 
compensate for magnetic media phenomena known as bit shifting. The data patterns that are 
susceptible to bit shifting are well understood and the FDC compensates the data pattern as it is 
written to the disk. The amount of precompensation depends on the drive and media but in most 
cases the default value is acceptable. The FDC module starts pre-compensating the data pattern 
starting on Track 0. The CONFIGURE Command can change the track where pre-compensating 
originates. 


Bits[4:2] | Precompensation Delays (ns) 


000 Default mode 
001 41.67 

010 83.34 

011 125.00 

100 166.67 

101 208.33 

110 250 

111 0.00 (disabled) 


The default precompensation delay mode provides the following delays: 


Data Rate Default Precompensation Delays (ns) 
1 Mbps 41.67 . 

0.5 Mbps 125.00 

0.3 Mbps 125.00 

0.25 Mbps 125.00 


DATA RATE SELECT (DRATESEL): DRATESEL[1:0] select one of the four data rates as listed 
below. The default value is 250 Kbps. 


Bits[1:0] Date Rate 


11 1 Mbps 

00 ~~ 500 Kbps 

01 300 Kbps 

10 250 Kbps - default 


ADVANCE INFORMATION ; 


intel P | | 82091AA 


8.1.6 FDCFIFO—FDC FIFO (DATA) 


|/O Address: Base + 5h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


All command parameter information and disk data transfers go through the 16-byte FIFO. The FIFO has 
programmable threshold values. Data transfers are governed by the RQM and DIO bits in the MSR. At the start 
of a command, the FIFO action is always disabled and command parameters must be sent based upon the 
RQM and DIO bit settings. At the start of the command execution phase, the FDC clears the FIFO of any data 
to ensure that invalid data is not transferred. An overrun or underrun will terminate the current command and 
the transfer of data. Disk writes complete the current sector by generating a 00 pattern and valid CRC. 


The FIFO defaults to an 8272A compatible mode after a hardware reset (via RSTDRV pin). Software resets 
(via DOR or DSR) can also place the FDC into 8272A compatible mode, if the LOCK bit is set to 0 (see the 
definition of the LOCK bit) maintaining PC-AT hardware compatibility. The default values can be changed 
through the CONFIGURE Command (enable full FIFO operation with threshold control). The FIFO provides 
the system a larger DMA latency without causing a disk error. The following table gives several examples of 
the delays with a FIFO. The data is based upon the formula: Threshold# x< 1/DATA RATE xX 8 — 1.5 
ps = DELAY. 3 


16 Byte FIFO 


/7 o\ Bit 


Data, Commands 
Disk Data or FDC Commands (R/W) 


Figure 61. FDC FIFO 


FIFO DATA: Bits[7:0] correspond to SD[7:0]. 
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8.1.7 DIR—DIGITAL INPUT REGISTER 


|/O Address: Base +7h 
Default Value: 00h 
Attribute: Read Only 
Size: 8 bits 


This register is read only in all modes. In PC-AT mode only bit 7 is driven and all other bits remain tri-stated. 


Bit 


— Not Used (RO) 
_ (Trl-Stated During Reads) 


— Disk Change (RO) 
_ 1=Disk Change (DSKCHG# Signal Asserted) 


0=No Disk Change (DSKCHG# Signal Negated) 
290486-62 


Figure 62. Digital Input Register 


Description , 


DISK CHANGE (DSKCHG): This bit monitors a disk change in the floppy disk drive. DSKCHG is set 
to a 1 when the DSKCHG # signal on the floppy interface is asserted. DSKCHG is set to a 0 when 
the DSKCHG # signal on the floppy interface is negated. During powerdown, this bit is invalid. 


NOT USED: These bits are tri-stated during a read. 
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8.1.8 CCR—CONFIGURATION CONTROL REGISTER 


1/O Address: Base +7h 
Default Value: 02h 
Attribute: Write Only 
Size: 8 bits 


This register sets the data rate. 


Data Rate (WO) 
00=500 Kbps 


01=300 Kbps 
Not Used 10=250 Kbps 


amente ©) 11=1 MBps 
290486-63 


Figure 63. Configuration Control Register 
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8.2 Reset 


There are four sources of FDC reset—a hard reset 
via the RSTDRV signal and three software resets 
(via the FCFG2, DOR, and DSF Registers). At the 


end of the reset, the FDC comes out of the power- — 


down state. Note that the DOR reset condition re- 
mains in effect until software programs the DORRST 
bit to 1 in the DOR. All operations are terminated 
and the FDC enters an idle state. Invoking a reset 
while a disk write activity is in progress will corrupt 
the data and CRC. On exiting the reset state, various 
internal registers are cleared, and the FDC waits for 
a new command. Drive polling will start unless dis- 
- abled by a new CONFIGURE Command. 


8.2.1 HARD RESET AND CONFIGURATION 
REGISTER RESET 


A hard reset (asserting RSTDRV) and a software re- 
set through the FCFG2 Registers have the same af- 
fect on the FDC. These resets clear all FDC regis- 


ters, except those programmed by the SPECIFY 


command. The DOR reset bit is enabled and must 
be set to 0 by the host to exit the reset state. 


8.2.2 DOR RESET vs DSR RESET 


The DOR and DSR resets are functionally the same. 
_ The DSR reset is included to maintain 82072 com- 
. patibility. Both reset the 8272 core, which affects 
drive status information. The FIFO circuits are also 
reset if the LOCK bit is a 0 (see definition of the 
LOCK bit). The DSR reset is self-clearing (exits the 
reset state automatically) while the DOR reset re- 
mains in the reset state until software writes the 
DOR reset bit to 0. DOR reset has precedence over 
‘the DSR reset. The DOR reset is set automatically 
-when a hard reset or configuration reset occurs. 
Software must set the DOR reset bit to 0 to. exit the 
reset state. 


- The AC Specifications gives the minimum amount of 
‘time that the DOR reset must be held active. This 
amount of time that the DOR reset must be held 
active is dependent upon the data rate. FDC re- 
quires that the DOR reset bit must be held active for 
at least 0.5 us at 250 Kbps. This is less than a typi- 
cal ISA 1/O cycle time. 
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8.3 DMA Transfers 


DMA transfers are enabled with the SPECIFY Com- 
mand. When enabled, The FDC initiates DMA trans- 
fers by asserting the FDDREQ signal during a data 
transfer command. The FIFO is enabled directly by 
asserting FDDACK# and addresses need not be 
valid. 


8.4 Controller Phases 


The FDC handles commands in three phases—com- 
mand, execution and result. Each phase is de- 
scribed in the following sections. When not process- 
ing a command, the FDC can be in the idle, drive 
polling or powerdown state. This section describes 
the command, execute and result phases. 


8.4.1 COMMAND PHASE 


After a reset, the FDC enters the command phase 
and is ready to accept a command from the host. 
For each of the commands, a defined set of com- 
mand code bytes and parameter bytes must be writ- 
ten to the FDC (as described in Section 8.8, Com- 
mand Set Description) before the command phase 
is complete. These bytes of data must be trans- 
ferred in the order described. 


Before writing to the FDC, the host must examine 
the RQM and DIO bits of the Main Status Register. 
RQM must be 1 and DIO must be 0, before com- 
mand bytes may be written. The FDC sets RQM to 0 
after each write cycle and keeps the bit at 0 until the 
received byte is processed. After processing the 
byte, the FDC sets RQM to 1 again to request the 
next parameter byte of the command, unless an ille- 
gal command condition is detected. After the last 
parameter byte is received, RQM remains 0, and the 
FDC automatically enters the next phase (execution 
or result phase) as defined by the command defini- 
tion. - 


The FIFO is disabled during the command phase to 
retain compatibility with the 8272A, and to provide 
for the proper handling of the Invalid Command con- 
dition. 
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8.4.2 EXECUTION PHASE 


The following paragraphs detail the operation of the 
FIFO flow control. In these descriptions, threshold is 
defined as the number of bytes available to the FDC 
when service is requested from the host, and ranges 
from 1 to 16. The FIFOTHR parameter, which the 
user programs, is one less and ranges from 0 to 15. 


A low threshold value (e.g., 2) results in longer peri- 
ods of time between service requests but requires 
faster servicing of the request for both read and 
write cases. The host reads (writes) from (to) the 
FIFO until empty (full), then the transfer request 
goes inactive. The host must be very responsive to 
the service request. This is the desired case for use 
with a “fast’’ system. 


A high value of threshold (e.g., 12) is used with a 
“sluggish” system by affording a long latency period 
after a service request, but results in more frequent 
service requests. 


8.4.2.1 Non-DMA Mode Transfers from the FIFO 
to the Host 


The IRQ6 pin and ROM bits in the Main Status Reg- 
ister are activated when the FIFO contains 16 (or set 
threshold) bytes, or the last bytes of a full sector 
transfer have been placed in the FIFO. The IRQ6 pin 
can be used for interrupt driven systems and RQM 
can be used for polled sytems. The host must re- 
spond to the request by reading data from the FIFO. 
This process is repeated until the last byte is trans- 
ferred out of the FIFO, then FDC negates the IRQ6 
pin and RQM bit. 


8.4.2.2 Non-DMA Mode Transfers from the Host 
to the FIFO 


The IRQ6 pin and RQM bit in the Main Status Regis- 
ter are activated upon entering the execution phase 
of data transfer commands. The host must respond 
to the request by writing data into the FIFO. The 
IRQ6 pin and RQM bit remain true until the FIFO 
becomes full. They are set true again when the FIFO 
has (threshold) bytes remaining in the FIFO. The 
IRQ6 pin is also negated if TC and DACK# both go 
inactive. The FDC enters the result phase after the 
last byte is taken by the FDC from the FIFO (i.e. 
FIFO empty condition). 
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8.4.2.3 DMA Mode Transfers from the FIFO to 
the Host 


The FDC asserts the FDDREQ signal when the FIFO 
contains 16 (or set threshold) bytes or the last byte 
of a full sector transfer has been placed in the FIFO. 
The DMA controller must respond to the request by 
reading data from the FIFO. The FDC negates 
FDDREQ when the FIFO is empty. FDDREQ is neg- 
ated after FDDACK # is asserted for the last byte of 
a data transfer (or on the active edge of RD#, on 
the last byte, if no edge is present on FDDACK #). 


NOTE: 
FDDACK# and TC must overlap for at least 
50 ns for proper functionality. A data under- 
run may occur if FDDREQ is not removed in 
time to prevent an unwanted cycle. 


8.4.2.4 DMA Mode Transfers from the Host to 
the FIFO 


The FDC asserts FODDREQ when entering the exe- 
cution phase of data transfer commands. The DMA 
controller must respond by asserting FDDACK # and 
WR#¥ _ signals and placing data in the FIFO. 
FDDREQ remains asserted until the FIFO becomes 
full. FDDREQ is again asserted when the FIFO has 
(threshold) bytes remaining in the FIFO. The FDC 
also negates the FDDREQ when the FIFO becomes 
empty (qualified by DACK# and TC overlapping by 
50 ns) indicating that no more data is required. 
FDDREQ is negated after FDDACK # is asserted for 
the last byte of a data transfer (or on the active edge 
of WR# of the last byte, if no edge is present on 
DACK#). A data overrun may occur if FDDREQ is 
not removed in time to prevent an unwanted cycle. 
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8.4.3 DATA TRANSFER TERMINATION 


The FDC supports terminal courit explicitly through 
the TC signal and implicitly through the underrun/ 
overrun and end-of-track (EOT) functions. For full 
sector transfers, the EOT parameter can define the 
last sector to be transferred in a single or multi-sec- 
tor transfer. If the last sector to be transferred is a 
partial sector, the host can stop transferring the data 
in mid-sector and the FDC will continue to complete 
the sector as if a hardware TC was received. The 
only difference between these implicit functions and 
TC is that they return ‘abnormal termination” result 
status. Such status indications can be ignored if they 
were expected. 


NOTE: 

When the host is sending data to the FIFO, 
the internal sector count will be complete 
when the FDC reads the last byte from its 
side of the FIFO. There may be a delay in 
the removal of the transfer request signal of 
up to the time taken for the FDC to read the 
last 16 bytes from the FIFO. The host must 
be able to tolerate this. In a DMA system, 
FDDREQ is removed (negated) as soon as 
TC is received indicating the termination of 
the transfer. The reception of TC also gener- 
ates an interrupt on IRQ6. However, in a 
non-DMA system the interrupt will not be 
generated until the FIFO is empty. | 
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The generation of IRQ6 determines the beginning of 

the result phase. For each of the commands, a de- 
fined set of result bytes has to be read from the FDC 
before the result phase is complete (refer to Section 
8.5, Command Set/Descriptions). These bytes of 
data must be read out for another command to start. 


RQM and DIO must both be 1 before the result bytes _ 
may be read from the FIFO. After all the result bytes 
have been read, RQM=1, DiIO=0, and CMDBU- 
SY=0 in the MSR. This indicates that the FDC is 
ready to accept the next command. 


8.5 Command Set/Descriptions 


Commands can be written whenever the FDC is in 
the command phase. Each command has a unique 
set of needed parameters and status results. The 
FDC checks to see that the first byte is a valid com- 
mand and, if valid, proceeds with the command. If it 
was invalid, the next time the RQM bit in the MSR 
register is 1 the DIO and CB bits will also be 1, indi- 
cating the FIFO must be read. A result byte of 80h 
will be read out of the FIFO, indicating an invalid 
command was issued. After reading the result byte 


from the FIFO, the FDC returns to the command 


phase. Table 23 shows the FDC Command set. 
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Table 24. FDC Command Set 


Phase | R/W Remarks 


D7 D6 D5 D4 D3 D2 D1 DO 


Read Data ) 


Command| W MT MFM SK 0 1 | 0 | Command Codes 
HDS DS1 DSO | 
Pedckech bbe eg saiaeiida rae Kad ectiies Poima hed a SOClOLID 
“aeseebecaee bas: .fhe §é§ aigtdaacueatadhamens Information Prior to 
sian Ha PRS e Raa See eeeeveceeeeeeee | COmMMand 
Execution 


Cr i cc i i i OO 


Cr i i cc SO i i | 


Cr a 8. cc i 


S=zzz 22=z= 


Ce A | 2 


Execution Data Transfer 
Between the FDD 
and System | 


Sadat etiaiaue-. ... "90. i (Gs eM Beene eee Status Information 
Mieke etek A §§§ aeedsatee Boia salads J After Command 
ua tina eieneae, §§  Eebamet a septecnatn San eee Execution 


Cr i SO 


eaibaeuete tah We  ( ite leeeeeeesseees | S@Ctor ID 

etiunuuatieamhenud. . iF: j.§. | caitd catceate clare pti sot Information After 

Loteuneeneaieenets IN j  ##§  @teet% State b865 5 oon Command 
Execution 


DUVUVVIIAD 


| 0 0 Command Codes 
HDS DSi DSO 


Command 


Sector ID 
Information Prior to 
Command 
Execution 


W 
W 
W 
WwW 
W 
W 
W 
W 
W 


Execution | Data Transfer 
Between the FDD 
and System 


Status Information 
After Command 
Execution 


Result 


Sector ID 
Information After 
Command 
Execution 


DUVIIID 
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| Table 24. FDC Command Set (Continued) 


Data Bus 
Remarks 
D7 D6. D5 D4 D3 D2 D1 DO | 


Write Data 
W MT- MFM 0 0 0 1 0 1 Command Codes 
W 0 0 0 0 0 HDS DS1 DSO | 
W secs mats shinee aad GC <Redeiurecanawsca® Sector ID 
5 ee eee re eee Ee —C«C#E hee penne anne Information Prior to 
MI, spate ats ace cutee tect R Command 
Execution 
WV Cl Bakete dees acdat ieee N. . —i(sé‘é eee 
WY Wi -.2ecwueteeeoe waead EO, . eitadeieleteuusas 
Wes ob inetd be Vers aate saad GPE 4 ~seelsohesiisoutase 
“ WV 1! co cavat ae ewe oora Die Se eetouiensattenies 
| Execution Data Transfer 
Between the FDD 
and System 
Result PAO, Us pain cad tae ert ees ae SEQ 4 -‘é24sneemecdaeows Status Information. 
PA, sociated at ine ata ales | ST 1 . eterna sa ews aeenk After Command 
oa eee Meaney SIZ - | daerimaekaoegaanas Execution 
BYe i dienes ares are eta eata HG ie ea pemanedhens hs 
Fel, Gediette aot re gee A t«(‘(‘é«‘«C ee ee eas Sector ID 
Pe tS chee crate citer eee Te 0”~*~*«C Ce eee ea Information After 
A, dueaee tak tee aed NC 8  0-s aee e ew etion Command 
Execution 
Write Deleted Data 
W MT MFM 0 82026060 1 0 0 1 Command Codes 
W 0 0 0 0 0 HDS DS1 DSO 
WV? Gi) popes elena ae Ci © Bostik hes egies Sector ID 
VV Nill anteater hahha Goes ane re reer errr: Information Prior to 
MG Gens Glen teen etre M> . . }£Syeedeuesteceoenres Command | 
Execution 
WU). 28 Some Baers erates NW. 060 se al eee | 
We'll ate Gee ores hone oue EOQl i gasiuadmrneumeaoas 
We ff! setewuioueeieesess GPo “wu mocievecuucnauins 
We, ccs cuaaetaeat DIL. : .. uses bieaneerenss 
Execution Data Transfer 
Between the FDD 
and System 
Result Faas i} suathaeiutnancvaa era aabasees SIO. - is Rhee dee wae Status Information 
= ee rer rere SVU jg.  ndxeiuidwenoareces After Command 
Fly oaaecesveeliog aaa Siz: aveuedieiccnataaen Execution 
Be tll Zecteeseecne ase esas C0 ete it Lele 
PL), ctacactee tate bee atee PoC ck Reed ote meh aah Sector ID 
Fe li: aan ise erpe cs Me em Anca eee Information After 
Fe ls ae Sat ease dataaesames Ne era hades State ntatee Matai Command 


Execution 
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Table 24. FDC Command Set (Continued) 


D7 D6 D5 D4 D3 D2 D1 DO 


Read Track 


0 1 Q | Command Codes 
HDS DS1 DSO 
Sector ID 
Information Prior to 
Command 
Execution 


Execution . Data Transfer 
Between the FDD 
and System. FDC 
Reads All Sectors 
From Index Hole 
to EOT 


Result | Status Information 
After Command 
Execution 


Sector ID 
Information After 
Command 
Execution 


| Command Codes 
HDS DS1 DSO 

Sector ID 
Information Prior to 
Command 
Execution 

N 

EOT 

GPL 

DTL/SC 

Execution Data Transfer 

Between the FDD 
and System 


Status Information 
After Command 
Execution 


Result 


Sector ID 
Information After 
Command 
Execution 


DDUVVDIVD 
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Command| W 
Result WwW 


Execution 
For Each 
Sector 
Repeat: | 


Execution 
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Table 24. FDC Command Set (Continued) 


Data Bus 
Phase |R/W : 
D7 D6 D5 D4 D3 D2 D1 DO 


0 0 0 
1 0 0 
0 MFM 0O 
0 0 0 
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eseeerte eee see eee eer €& 
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oseveeeeveeveere see eee 
ose seeveveev eo oversee e eee 
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ee 
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eosoeeeeevre ee ee > oO we ww 
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eoereeveeee eee eer eee ee 


Version 


0 0 0 
Format Track 


1 1 0 1 
0 HDS DS1 DSO 


oveeeevreeeseereeeve se seo 


oeoeeveeveet ees ee te te oe + © 


eeoev eve see eveev eset eve 


STO 
ST 1 
ST 2 
Undefined 
Undefined 
Undefined 
Undefined 


Scan Equal 
0 0 0 0 
0 HDS DSi _ DSO 


eeseesevrs eevee see eevee 


eeeee eee ee eo eee es ew @ 


eer eeerer ee eee eee eee 


covers se eee ee vs ee ese 


eee eee eveer eee ee se wo © 
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Command Codes 
Enhanced Controller 


Command Codes 


Bytes/Sector 
Sector/Cylinder 
Gap 3 

Filler Byte 


Input Sector 
Parameters 


FDC Formats an 
Entire Cylinder 


Status information 
after Command 
Execution 


Command Codes 


Sector ID 
Information Prior 
to Command 
Execution 


Data Compared 
Between the FDD 
and Main-System 


Status Information 
After Command 
Execution 


Sector ID 
Information After 
Command 
Execution 
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Table 24. FDC Command Set (Continued) 


Phase | R/W pau Remarks 
. D7. D6 D5 D4 D3. D2 D1 DO . 


Scan Low or Equal 


Command Codes 
HDS DS1 DSO | 

Sector ID 

Information Prior 

to Command 

Execution 


Execution 
Data Compared 

Between the FDD 
and Main-System 


Result Status Information 
After Command 
Execution 


Sector ID 
Information After 
Command | 
Execution — 


Command 1 0 1 Command Codes 
HDS DS1 DSO 
Sector ID 
Information Prior 
to Command 
Execution 


Execution 
Data Compared 
Between the FDD 
and Main-System 


Result Status Information 
After Command 
Execution 


Sector ID 
Information After 
Command 
Execution 


W 1 1 Command Codes 
WwW DSO DS1 | Enhanced 
Controller 


Command 


Head Retracted to 
Track 0 Interrupt 


Execution 
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Table 24. FDC Command Set (Continued) 


. Data Bus 
R/W 3 
. |. D7 D6 D5 D4 D3 D2 D1 DO 
bt 3 7 | Sense Interrupt Status 
Command] W | 0 0 0 0 1 0 0 0 
Result Fit Gh ate oacomee wet ee emes Sl § § .tattv dedi oot 
Ee Sl) Ui taecue eee eee PON:.: . ... «teedebodaniimaairags 
Specify 
.|Command} W 0 0 0 0 . 0 0 1 1 
WS Shh Gat a's SHI <ciiSe¢ece jo j.§ ted: HUT  _ .... . ss 
We ahs Serer tee, wae HE: --. Savceredewsiee cade ND 
Sense Drive Status 
‘Command! W 0 ) 0 0 ) 1 e) ) 
| W 0 0 0 0 0 HDS DS1 DSO 
Result BY cau Guaste aca hb cared SiG  —«-- - Sees eee eas 


Drive Specification Command 


Command WwW 1 0 0 0 1 1 
WwW 0 FD1 FDO PTS DRT1 DRTO 
W DN NRP 0 0 0 0 
Result R | 0 0 0 . PTS _DRT1 DRTO 
| R 0 0 0 PTS DRT1 DRTO 
R 0 0 0 0 0 0 
R 0 0 0 0 0 0 
| Seek 
Command; W 0 0 0 0 1 1 1 1 
| WwW 0. 0 0 0 0 HDS DS1 DSO 
Execution WO a tcdesaew ed ta iews NCN $$ i = —§ wvevdewevecodevers 
| | Configure | 
Command| W 0 0 0 1 0 0 | 1 
WwW 0 0 0 0 0 0 0 0 
WwW 0 EIA EFIFO POLL © ...... FIFOTHR.......... 
WV! 1 fn ere ee esas @ See PRETRK —son_ in. cece eee eens 
Relative Seek | 
Command| W 1 DIR # 0 0 1 1 1 1 
W 0 0 0 0 0 HDS DSi DSO 
WV Uh Seats tee en etn eee RGN: .  - sie Sareea cis 
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Command Codes 


Status Information 
at the End of Each 
Seek Operation 


Command Codes 


Command Codes 


Status Information 
About FDD 


-| Command Code 


0-4 bytes issued 


Drive O 
Drive 1 
RSVD 
-RSVD 


Command Codes 


Head is Positioned 
Over Proper 
Cylinder on Diskette 


Command Code 


Command Code 
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Table 24. FDC Command Set (Continued) 


ee 


Command| W Note: Registers 
Execution placed in FIFO 
Result R PCN-Drive 0 

R PCN-Drive 1 

R PCN-Drive 2 

R PCN-Drive 3 

R 

R 

R GAP WGATE 

R FIFOTHR 

R 
Command; W 0 | 0 Commands 

W HDS DS1 DSO 


The First Correct ID 
Information on the 
Cylinder is Stored in 
Data Register 


Status Information 
After Command 
Execution 


Result 


Disk Status After the 
Command has 
Completed 


Perpendicular Mode . | 


Command! W 0 0 0 1 0 0 1 0 Command Codes 
W OW 0 0 0 D1 DO GAP WGATE 

Command! W |LOCK 0 0 | 0 1 0 0 Command Codes 

Result R 0 0 0 LOCK 0 0 0 0 

Command! W Command Code 

Result R Part ID Number 


| Powerdown Mode 


DUVIIID 


Command; W Command Code 
W 
Result R 
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Table 24. FDC Command Set (Continued) 


R/W 
| D7 D6 D5 D4 D3 D2 D1 DO 
Option 7 
Command| W 0 0 1 1 0 0 1 1 Command Code 
ee eee RSVD.  ahorretdcadweateiate oun Sees ISO . 
| Save . 
0 1 0 1 1 1 
R |RSVD RSVD PD PC2 -PC1 PCO DRATE1 DRATEO|Save Information to 
OSC 
0 () 0 0 0 0 
Tip lewelae awa ese PCN-Drive 0 .. 0... ccc ce cc ee eens 
Sania Bawa Taiaees PGN-Diive 1 o56.ded feae eb ee ei ovens 
baton wena bee aes PCN-Drive 2 ..... 0... cc ccc ce eee eee 
en a rene PON-DING S -cuexceesue pases hunse seus 
SAT: tase 9 - See eies AUT: ..2evstersuto rn’. 
Staak ane tlie ee teseaniaes HLT ee re re 
Ghee Gaara Ne atta and ets SC/EOT mia Lite a haeeatek & desire ate ye 
0 @) 0 . D1 DO GAP 
EIS EFIFO POLL a _ FIFOTHR 
Te re rere PRETRK eae aE Deane 
O EREG 0 RSVD FDI MIN 
EN TRI DLY 
Ba aieie se Macon er DISK/SIATUS winsie in endte tecbueeseds 
dy ie tig teks ster RSVD ee er eee eee 
ee ere ere RSVD ee ae eT ree 
Restore 
1 () 0 1 1 1 0 Command Code 
0 0 PC2 PC1 PCO DRATE1 DRATEO!Restore Original 
0 () 0 0 0 0 ISO | Register Status 
ssihaai ace Mies tue sy a wear PCN-Drive 0 pees bowed a es 
ieee ine eae Soe poedee PCN-Drive 1 Gtasret tiie reiterate oaievemeedy te 
ee Ee Eee Tee PCN-Drive 2 lLateauae omar eeeaue eee 
ee re ee PCN-Drive 3 te Satis Grasse dette 
SRA 25525 ° « eleides es AUT. se teretestnes ants 
er ee eee eee HL. ice tsaa orate meveuiaes- AND 
Paice a Rais eemoned SC/ EO): -si cea dcitS oreeoesekds eae euc 
0 9) 0 D1 DO GAP WGATE 
EIS EFIFO POLL. ...6.245. 04% PIFOPRR asstarsiaeeedss 
wiisarele udaauwacts PRETRK RUealiun te dee ne ae eas 
O EREG 0 RSVD FDI MIN AUTO 
EN TRI DLY PD 
Lice acess Gla outa as DISK/STATUS acs Fewen ceed ovedy eke 
ee ee a oe RSVD iv Bates Sold alone was ee 
iite sce wees RSVD i <-+eeaepbue whine cou gectows 
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Table 24. FDC Command Set (Continued) | 


Data Bus 
Phase |R/W Remarks 
7 D3 D2 D1 DO 


Format and Write 


1 0 1 | Command Code 
HDS DS1 DSO 


Execution Input 
repeated Sector 

for each Parameters 
sector 


Data Transfer Of N Bytes 


FDC Formats and 
Writes Entire Track 
ST 0 
ST 1 

ST 2 
Undefined 
Undefined 
Undefined 
Undefined 


Command] W Invalid Command | 
. Codes (Noop—FDC 
goes into Standby 
State) 


esa | A 
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Parameter Abbreviations 


Symbol 
AUTO PD 


DIR # 


DSO, DS1 


DTL - 


DRATE[O:1] 
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Description 


AUTO POWERDOWN CONTROL: When AUTO PD=0, automatic powerdown is disabled. 
When AUTO PD=1, automatic powerdown is enabled. 

CYLINDER ADDRESS: The currently selected cylinder address, 0 to 255. 

DRIVE SELECT 0-1: Designates which drives are Perpendicular drives. A 1 indicates Per- 
pendicular drive. 

DATA PATTERN: The pattern to be written in each sector data field during formatting. 
DONE: This bit indicates that this is the last byte of the drive specification command. The 
FDC checks to see if this bit is 1 or 0. When DN=0, the FDC expects more bytes. 

DN=0 FDC expects more subsequent bytes. 

DN=1 Terminates the command phase and enters the results phase. An additional benefit 


is that by setting this bit to 1, a direct check of the current drive specifications can be 
done. 


DIRECTION CONTROL: When DIR#=0, the head steps out from the spindle during a 
relative seek. When DIR # = 1, the head steps in toward the spindle. 


DISK DRIVE SELECT: 


*Available wher. FDDQTY =1 in the FCFG1 Register (see Appendix A, FDC Four Drive 
Support) 

SPECIAL SECTOR SIZE: By setting N to zero (00), DTL may be used to control the number 
of bytes transferred in disk read/write commands. The sector size (N=0) is set to 128. If the 
actual sector (on the diskette) is larger than DTL, the remainder of the actual sector is read 
but is not passed to the host during read commands; during write commands, the remainder 
of the actual sector is written with all zero bytes. The CRC check code is calculated with the 
actual sector. When N is not zero, DTL has no meaning and should be set to FFh. 


DATA RATE: Data rate values from the DSR register. 
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Symbol | 
DRTO, DRT1 


DTO,DT1 
EC 
EFIFO 
EIS 

EOT 
EREG EN 


FDI TRI 


82091AA 


Description | 


DATA RATE TABLE SELECT: These two bits select between the different data rate tables. 
The default is the conventional table. These also provide mapping of the data rates selected 
in the DSR and CCR. The table below shows this. 


Bits in DSR | 


SS~*ég in 
fe 

To [ft Kp 
[41 | 0 [esos | 
To [4] Revo [revo [rs [ Rev 
[ef rsvo | revo [revo [revo 
1 [eo [0 [sone [ 
Se a 0 
Se 


DRIVE DENSITY SELECT TYPE: These bits select the outputs on DRVDENO and 
DRVDEN1 (see DRIVE SPECIFICATION Command). 


ENABLE COUNT: When EC=1, the DTL parameter of the Verify Command becomes SC 
(Number of sectors per track). 


Enable FIFO: When EFIFO=0, the FIFO is enabled. EFIFO=1 puts the FDC in the 8272A 
compatible mode where the FIFO is disabled. , 
ENABLE IMPLIED SEEK: When EIS=1, a seek operation is performed before executing 
any read or write command that requires the C parameter in the command phase. EIS=0 
disables the implied seek. 

END OF TRACK: The final sector number of the current track. 


ENHANCED REGISTER ENABLE: When EREG EN=1, the TDR register is extended and 
SRB is made visible to the user. When EREG EN=0, the standard registers are used. 


FLOPPY DRIVE INTERFACE TRI-STATE: When FDI TRI=0, the output pins of the floppy 
disk drive interface are tri-stated. This is also the default state. When FDI TRI= 1, the floppy 
disk drive interface remains unchanged. 
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Symbol 
FDO, FD1 


GAP 
GPL 


H/HDS 
HLT 


HUT 
ISO 
LOCK 


MFM 
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Description 


FLOPPY DRIVE SELECT: These two bits select which physical drive is being specified. The 
FDn corresponds to FDSn and FDMEn on the floppy drive interface. The drive is selected 
independent of the BOOTSEL bit in the TDR. Refer to Section 8.1.3, TDR—Enhanced Tape 
Drive Register, which explains the distinction between physical drives and their virtual map- 
ping as defined by the BOOTSEL bit. | 


*Available if the four floppy drive option is selected in the FCFG1 Register. 

GAP: Alters Gap 2 length when using Perpendicular Mode. 

GAP LENGTH: The gap 3 size. (Gap 3 is the space between sectors excluding the VCO 
synchronization field). 

HEAD ADDRESS: Selected head: 0 or 1 (disk side 0 or 1) as encoded in the sector ID field. 
HEAD LOAD TIME: The time interval that FDC waits after loading the head and before 
initiating a read or write operation. Refer to the SPECIFY Command for actual delays. 


HEAD UNLOAD TIME: The time interval from the end of the execution phase (of a read or 
write command) until the head is unloaded. Refer to the SPECIFY Command for actual 
delays. 

ISO FORMAT: When !SO = 1, the ISO format is used for all data transfer commands. When 
ISO=0, the normal IBM system 34 and perpendicular is used. The default is ISO=0. 


LOCK: Lock defines whether EFIFO, FIFOTHR, and PRETRK parameters of the CONFIG- 
URE Command can be reset to their default values by a software reset (Reset made by | 
setting the proper bit in the DSR or DOR registers). 


MFM MODE: A one selects the double density (MFM) mode. A zero is reserved. 
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Symbol 
MIN DLY 


MT 


NCN 
ND 


NRP 


OW 


82091AA 


Description 

MINIMUM POWERUP TIME CONTROL: This bit is active only if AUTO PD bit is enabled. 
When MIN DLY=0, a 10 ms minimum powerup time is assigned and when MIN DLY=1, a 
0.5 sec. minimum powerup time is assigned. | 
MULTI-TRACK SELECTOR: When MT =1, the multi-track operating mode is selected. In 
this mode, the FDC treats a complete cylinder, under head 0 and 1, as a single track. The 
FDC operates as if this expanded track started at the first sector under head 0 and ended at 
the last sector under head 1. With this flag set, a multitrack read or write operation will 
automatically continue to the first sector under head 1 when the FDC finishes operating on 
the last sector under head 0. 
SECTOR SIZE CODE: This specifies the number of bytes in a sector. When N=O0Oh, the 
sector size is 128 bytes. The number of bytes transferred is determined by the DTL parame- 
ter. Otherwise the sector size is (2 raised to the “N’th’ power) times 128. All values up to 
07h are allowable. A value of 07h equals a sector size of 16 Kbytes. It is the users responsi- 
bility to not select combinations that are not possible with the drive. 


Tw | Sector Size 
Foo | 128 bytes 
roe 


512 bytes 


ee 


NEW CYLINDER NUMBER: The desired cylinder number. 


NON-DMA MODE FLAG: When ND=1, the FDC operates in the non-DMA mode. In this 
mode, the host is interrupted for each data transfer. Wnen ND=0, the FDC operates in DMA 
mode and interfaces to a DMA controller by means of the DRQ and DACK# signals. 


NO RESULTS PHASE: When NRP=1, the result phase is skipped. When NRP=0O, the 
result phase is generated. 


OVERWRITTEN: The bits denoted DO and D1 of the PERPENDICULAR MODE Command 
can only be overwritten when OW = 1. 
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Symbol 

PCN 
-PC2,PC1,PCO 
PDOSC 

PTS 


POLL 


PRETRK 
R 


RCN 
SC 
SK 
SRT 
STO-3 


_WGATE 


a fae 
intel. 
Description 


PRESENT CYLINDER NUMBER: The current position of the head at. ‘the completion of 
SENSE INTERRUPT STATUS Command. 


PRECOMPENSATION VALUES: Precompensation values from the DSR register. 
POWERDOWN OSCILLATOR: When this bit is set, the internal oscillator is turned off. 


PRECOMPENSATION TABLE SELECT: This bit selects whether to enable the precompen- 
sation value programmed in the DSR or not. In the default state, the value programmed in 
DSR will be usec. More information regarding the precompensation is available in Section 
8.1.5. 


PTS=0 DSR programmed precompensation delays 


~ PTS=1No precompensation delay is selected for the corresponding drive. 


POLLING DISABLE: When POLL=1, the internal polling routine is disabled. When 
POLL=0, polling is enabled. 


PRECOMPENSATION START TRACK NUMBER: Programmable from track 00 to FFh. 


SECTOR ADDRESS: The sector number to be read or written. In multi-sector transfers, this 
parameter specifies the sector number of the first sector to be read or written. 

RELATIVE CYLINDER NUMBER: Relative cylinder offset from present cylinder. as used by 
the RELATIVE SEEK Command. | 
NUMBER OF SECTORS: The number of sectors to be initialized by the FORMAT Command. 
The number of sectors to be verified during a Verify Command, when EC= 1. 

SKIP FLAG: When SK = 1, sectors containing a deleted data address mark will automatically 
be skipped during the execution of a READ DATA Command. If a READ DELETED DATA 
Command is executed, only sectors with a deleted address mark will be accessed. When 
SK=0, the sector is read or written the same as the read and write commands. | 

STEP RATE INTERVAL: The time interval between step pulses issued by the FDC. Pro- 
grammable from 0.5 ms to 8 ms, in increments of 0.5 ms at the 1 Mbit data rate. Refer to the 
SPECIFY Command for actual delays. 


STATUS REGISTERS 0-3. Registers within the FDC that store status information after a 


command has been executed. This status information is available to the host during the — 
result phase after command execution. 


WRITE GATE: Write gate alters timing of WE, to allow for pre-erase loads in perpendicular 
drives. 


8.5.1 STATUS REGISTER ENCODING 


The contents of these registers are available only through a command sequence. 
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8.5.1.1 Status Register 0 
Le es eee 
7,6 Interrupt Code 00 Normal termination of command. The specified command 
was properly executed and completed without error. 
01 Abnormal termination of command. Command execution was 
started, but was not successful completed. 
10 Invalid command. The requested command could not be 
executed. 
11 Abnormal termination caused by Polling. 
SE Seek End The 82091AA completed a SEEK or RECALIBRATE command, 
or a READ or WRITE with implied seek command. 
Equipment Check | The TRK pin failed to become a “1” after: 
1. 80 step pulses in the RECALIBRATE COMMAND. 
2. The RELATIVE SEEK command causes the 82078 to step 
outward beyond Track 0. 
rs | — | — _| Unused. This bitis always “0 : 
Head Address The current head address. 
DS1,0 Drive Select The current selected drive. 
8.5.1.2 Status Register 1 
Symbol | Name Description 

End of Cylinder The 82078 tried to access a section beyond the final sector of 
the track (255D). Will be set if TC is not issued after Read or 
Write Data Command. 

Unused. This bit is always “O”’. 

er Error The 82078 detected a CRC error in either the ID field or the data 
field of a sector. 

Overrun/Underrun | Becomes set if the 82078 does not receive CPU or DMA service 
within the required time interval, resulting in data overrun or 
underrun. 

(3 | = | == | Unused. Ths bit is always “0”. | 

2 ND No Data Any one of the following: 

1. READ DATA, READ DELETED DATA command, the 
82091AA did not find the specified sector. 

2. READ ID command, the 82091AA cannot read the ID field 
without an error. 

3. READ TRACK command, the 82091AA cannot find the 
proper sector sequence. 

1 NW Not Writable WP pin became a ‘'1” while the 82091AA is executing a WRITE 
DATA, WRITE DELETED DATA, or FORMAT TRACK 
command. 

MA Missing Any one of the the following: 
Address Mark 1. The 82091AA did not detect an ID address mark at the 
specified track after encountering the index pulse from the 
INDX # pin twice. 
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2. The 82091AA cannot detect a data address mark ora 
deleted data address mark on the specified track. 
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8.5:1.3 Status Register 2 


Unused. This bit is always “O”’. | 


Any one of the following: 
1. READ DATA command, the 82078 encounters a deleted data 
address mark. 


2. READ DELETED DATA command, the 82078 encountered a data 
address mark. 


Data Error in The 82091AA detected a CRC error in the data field. 
Data Field 
Wrong The track address from the sector ID field is different from the track 
Cylinder address maintained inside the 82091AA. 


Unused. This bit is always “0”’. 
Unused. This bit is always “‘O”’. 


Bad Cylinder The track address from the sector ID field is different from the track 
address maintained inside the 82091AA and is equal to FF hex 
which indicates a bad track with a hard error according to the IBM 
soft-sectored format. 


Missing Data 
Address Mark 


Unused. This bit is always “0” _ 
Indicates the status of the HDSEL pin. : 
Indicates the status of the DS1, DSO pins. | ; 
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8.5.2 DATA TRANSFER COMMANDS 


All of the READ DATA, WRITE DATA and VERIFY 
type commands use the same parameter bytes and 
return the same results information. The only differ- 
ence being the coding of bits[4:0] in the first byte. 


An implied seek will be executed if the feature was 
enabled by the CONFIGURE Command. This seek 
is completely transparent to the user. The Drive 
Busy bit for the drive will go active in the Main Status 
Register during the seek portion of the command. A 
seek portion failure is reflected in the results status 
normally returned for a READ/WRITE DATA Com- 
mand. Status Register 0 (STO) contains the error 
code and C contains the cylinder that the seek 
failed. 


8.5.2.1 Read Data 


A set of nine bytes is required to place the FDC into 
the Read Data Mode. After the READ DATA Com- 
mand has been issued, the FDC loads the head (if it 
is in the unloaded state), waits the specified head 
settling time (defined in the SPECIFY Command), 
and begins reading ID address marks and ID fields. 
When the sector address read from the diskette 
matches with the sector address specified in the 
command, the FDC reads the sector’s data field and 
transfers the data to the FIFO. 


After completion of the read operation from the cur- 
rent sector, the sector address is incremented by 
one, and the data from the next logical sector is read 
and output via the FIFO. This continuous read func- 
tion is called ‘‘Multi-Sector Read Operation’. Upon 
receipt of TC or an implied TC (FIFO overrun/under- 
run), the FDC stops sending data. However, the FDC 
will continue to read data from the current sector, 
check the CRC bytes, and, at the end of the sector, 
terminate the READ DATA Command. 


N determines the number of bytes per sector (Table 
25). If N is set to zero, the sector size is set to 128. 
The DTL value determines the number of bytes to 
be transferred. If DTL is less than 128, the FDC 
transfers the specified number of bytes to the host. 
For reads, it continues to read the entire 128 byte 
sector and checks for CRC errors. For writes it com- 
pletes the 128 byte sector by filling in zeroes. If N is 
not set to OOh, DTL should be set to FFh, and has no 
impact on the number of bytes transferred. 
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Table 25. Sector Sizes 


00 ta yt 
256 Bytes 
512 Byte 


1024 Bytes 
16 KBytes 


The amount of data that can be handled with a sin- 
gle command to the FDC depends on MT (multi- 
track) and N (Number of bytes/sector). 


Table 26. Effects of MT and N Bits 
wr) Max. Transfer 
Capacity Read from Disk 
ro [4 | 256% 26 = 656 | 26at sided ort 
Po [2 [62x16 = 7680 | 1Satsidedort 


0 | 3| 1024x 8 = 8192 | 8atsideOor1 
1024 x 16 = 16384 | 16 atside 1 


The Multi-Track function (MT) allows the FDC to 
read data from both sides of the diskette. For a par- 
ticular cylinder, data will be transferred starting at 
sector 1, side 0 and completing at the last sector of 
the same track at side 1. 


If the host terminates a read or write operation in the 
FDC, the ID information in the result phase is depen- 
dent on the state of the MT bit and EOT byte. Refer 
to Table 29. The termination must be normal. 


At the completion of the READ DATA Command, 
the head is not unloaded until after the Head Unload 
Time Interval (specified in the SPECIFY Command) 
has elapsed. If the host issues another command 
before the head unloads, the head settling time may 
be saved between subsequent reads. 


If the FDC detects a pulse on the INDEX # pin twice 
without finding the specified sector (meaning that 
the diskette’s index hole passes through index de- 
tect logic in the drive twice), the FDC sets the IC 
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code in Status Register 0 to 01 (Abnormal termina- 
tion), sets the ND bit in Status Register 1 to 1 indi- 
cating a sector not found and terrninates the READ 
DATA Command. 


After reading the ID and data fields in each sector, 
the FDC checks the CRC bytes. If a CRC error oc- 
curs in the ID or data field, the FDC sets the IC code 
in Status Register 0 to 01 (Abnormal termination), 
sets the DE bit flag in Status Register 1 to 1, sets the 
DD bit in Status Register 2 to 1 if CRC is incorrect in 
the ID field, and terminates the READ DATA Com- 
mand. 


intel. 


Table 27 describes the affect of the SK bit on the 
READ DATA command execution and results. 


8.5.2.2 Read Deleted Data 


This command is the same as the READ DATA 
Command, except that it operates on sectors that 
contain a deleted data address mark at the begin- 
ning of a data field. Table 28 describes the affect of 
the SK bit on the READ DELETED DATA Command. 
execution and results. 


Table 27. Skip Bit vs READ DATA Command 


SK Bit | Data Address Mark 
Value Type Encountered 


Deleted Data 


gs 


Deleted Data 


Results CM Bit oa 
of ST2 Set? Description of Results 


fF NO Normal Termination 
| No Normal Termination 


Address Not Incremented. Next Sector 
Not Searched For. 


Normal Termination Sector Not Read 
(“Skipped’’) 


Except where noted in Table 27, the C or R value of the sector address is automatically incremented (see 


Table 29). : 


Table 28. Skip Bit vs READ DELETED DATA Command 


SK Bit | Data Address Mark 
Value | Type Encountered 
Normal Data 


Deleted Data | 


i 


Deleted Data _ 


Results CM Bit ae 7 
| of ST2 Set? Description of Results | 


Address Not Incremented. Next Sector 
Not Searched For. 

Normal Termination Sector Not Read 
(“Skipped’’) os 


Yes 
Yes 


Normal Termination 


Normal Termination 


Except where noted in Table 28, the C or R value of the sector address is automatically incremented (see 


Table 29). 
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Table 29. Result Phase 


wT | Head Final Sector Transferred to Host _— a0 i<s at Result Phase 


Ss a 
To [| eqattoecor Sic | ne on] Nc 
Tess thanzor nee at [No 
[eaettocor ic ne |r| Nc 
Te [tess thanzor ince at [NC 
a ewaltoeot in| se or | Nc 
[tess thantgor Yn | nc | ava | Nc 


ae eee Equal to EOT 


NOTE: 


1. NC=no change; the same value as the one at the beginning of command execution. 


2. LSB =least significant bit; the LSB of H is complemented. 


8.5.2.3 Read Track 


This command is similar to the READ DATA Com- 
mand except that the entire data field is read contin- 
uously from each of the sectors of a track. Immedi- 
ately after encountering a pulse on the INDEX# pin, 
the FDC starts to read all data fields on the track as 
continuous blocks of data without regard to logical 
sector numbers. If the FDC finds an error in the ID or 
DATA CRC check bytes, it continues to read data 
from the track and sets the appropriate error bits at 
the end of the command. The FDC compares the ID 
information read from each sector with the specified 
value in the command and sets the ND flag to 1 in 
Status Register 1 if there is no comparison. Multi- 
track or skip operations are not allowed with this 
command. The MT and SK bits (Bits D7 and D5 of 
the first command byte respectively) should always 
be set to 0. 


This command terminates when the EOT specified 
number of sectors have been read. If the FDC does 
not find an ID address mark on the diskette after the 
second occurrence of a pulse on the INDEX# pin, 
then it sets the IC code in Status Register 0 to 01 
(Abnormal termination), sets the MA bit in Status 
Register 1 to 1, and terminates the command. 
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8.5.2.4 Write Data 


After the WRITE DATA Command has been issued, 
the FDC loads the head (if it is in the unloaded 
state), waits the specified head load time if unloaded 
(defined in the SPECIFY Command), and begins 
reading ID fields. When the sector address read 
from the diskette matches the sector address speci- 
fied in the command, the FDC reads the data from 
the host via the FIFO, and writes it to the sector’s 
data field. 


After writing data into the current sector, the FDC 
computes the CRC value and writes it into the CRC 
field at the end of the sector transfer. The sector 
number stored in R is incremented by one, and the 
FDC continues writing to the next data field. The 
FDC continues this multi-sector write operation. If a 
terminal count signal is received or a FIFO over/un- 
der run occurs while a data field is being written, the 
remainder of the data field is filled with zeros. 


The FDC reads the ID field of each sector and 
checks the CRC bytes. If the FDC detects a CRC 
error in one of the ID fields, it sets the IC code in 
Status Register 0 to 01 (Abnormal termination), sets 
the DE bit of Status Register 1 to 1, and terminates 
the WRITE DATA Command. 
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The WRITE DATA Command operates in much the 
same manner as the READ DATA Command. The 
following items are the same. Please refer to the 
READ DATA Command for details: , 


e Transfer Capacity 

e EN (End of Cylinder) bit 

e ND (No Data) bit. 

Head Load, Unload Time Interval 


iD information when the host terminates the com- 
mand 

Definition of DTL when N=0 and when N does 
not=0 . 


8.5.2.5 Verify 


The VERIFY Command is used to verify the data 
stored on a disk. This command acts exactly like a 
READ DATA Command except that no data is trans- 
ferred to the host. Data is read from the disk, and 
CRC is computed and checked against the previous- 
ly stored value. 


- agi 

intel. 
Because no data is transferred to the host, the TC 
signal cannot be used to terminate this command. 
By setting the EC bit to 1, an implicit TC will be is- 
sued to the FDC. This implicit TC occurs when the 
SC value has decrement to 0 (a SC value of 0 veri- 
fies 256 sectors). This command can also be termi- 
nated by setting the EC bit to 0 and the EOT value 
equal to the final sector to be checked. When 
EC=0, DTL/SC should be programmed to OFFh. 
Refer to Table 29 and Table 30 for information con- 


cerning the values of MT and EC versus SC and 


EOT value. 

Definitions: 

# Sectors Per Side = Number of formatted 
sectors per each side of 
the disk. 

# Sectors Remaining = Number of formatted 


sectors left that can be 
read, including side 1 of 
the disk when MT = 1. 


Table 30. Verify Command Result Phase 


M SC/EOT Value Termination Result 
SC=DTL Successful Termination 
EOT < # Sectors Per Side Result Phase Valid 
SC=DTL 


NOTE: 


EOT > # Sectors Per Side 


SC < # Sectors Remaining 
AND 
EOT < # Sectors Per Side 


SC > # Sectors Remaining 
OR 
EOT > # Sectors Per Side 
~ SC=DTL 
EOT < # Sectors Per Side 


SC=DTL 


- EOT > # Sectors Per Side 


SC < # Sectors Remaining 
AND 
EOT < # Sectors Per Side 


SC > # Sectors Remaining 
OR 
EOT > # Sectors Per Side 


Unsuccessful Termination 
Result Phase Invalid 


Unsuccessful Termination 
Result Phase Invalid 


Successful Termination 


Result Phase Valid 


Unsuccessful Termination 
Result Phase Invalid 


Successful Termination 
| Result Phase Valid 


Unsuccessful Termination 
Result Phase Invalid — 


~ Successful Termination 


Result Phase Valid 


When MT=1 and the SC value is greater than the number of remaining formatted sectors on Side 0, verification continues 


on Side 1 of the disk. 
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8.5.2.6 Format Track 


The FORMAT TRACK Command allows an entire 
track to be formatted. After a pulse from the 
INDEX # pin is detected, the FDC starts writing data 
on the disk including gaps, address marks, ID fields 
and data fields, per the IBM* System 34 (MFM). The 
particular values written to the gap and data field are 
controlled by the values programmed into N, SC, 
GPL, and D which are specified by the host during 
the command phase. The data field of the sector is 
filled with the data byte specified by D. The ID field 
for each sector is supplied by the host. That is, four 
data bytes per sector are needed by the FDC for C, 
H, R, and N (cylinder, head, sector number, and sec- 
tor size, respectively). 


82091AA 


After formatting each sector, the host must send 
new values for C, H, R, and N to the FDC for the 
next sector on the track. The R value (sector num- 
ber) is the only value that must be changed by the 
host after each sector is formatted. This allows the 
disk to be formatted with nonsequential sector ad- 
dresses (inter-leaving). This incrementing and for- 
matting continues for the whole track until the FDC 
encounters a pulse on the INDEX# pin again and it 
terminates the command. 


Table 31 contains typical values for gap fields that 
are dependent on the size of the sector and the 
number of sectors on each track. Actual values can 
vary due to drive electronics. 


Table 31. Typical PC/AT Values for Formatting 


ss ee 


NOTES: 
1. All values are in hex, except sector size. 


2. Gap3 is programmable during reads, writes, and formats. 


3. GPL1=suggested Gap3 values in read and write commands to avoid splice point between data field and !D field of 


contiguous sections. 


4. GPL2 = suggested Gap3 value in FORMAT TRACK Command. 
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8.5.2.7 Format Field 


System 34 Format Double Density 


ot ae GAP 1 cd ee SYNC caiman AM | DATA 
50x 12x bes 
4E 00 es 
F8 


iso Format 


SYNC basen AM | DATA 
= 
: ince 
F8 


Perpendicular Format 


GAP 4a es 1 4 eas GAP 2 | SYNC | AM | DATA 
ne 50x ; 41x i 
4E 4E ioe 
F8 


- Figure 64. System 34, ISO and Perpendicular Formats 


-290486-64 
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8.5.3 CONTROL COMMANDS 


Control commands differ from the other commands 
in that no data transfer takes place. Three com- 
mands generate an interrupt when complete; READ 
ID, RECALIBRATE and SEEK. The other control 
commands do not generate an interrupt. 


8.5.3.1 READ ID Command 


The READ ID Command is used to find the present 
position of the recording heads. The FDC stores the 
values from the first ID field it is able to read into its 
registers. If the FDC does not find an ID address 
mark on the diskette after the second occurrence of 
a pulse on the INDEX # pin, it then sets the IC code 
in Status Register 0 to 01 (Abnormal termination), 
sets the MA bit in Status Register 1 to 1, and termi- 
nates the command. 


The following commands will generate an interrupt 
upon completion. They do not return any result 
bytes. It is recommended that control commands be 
followed by the SENSE INTERRUPT STATUS Com- 
mand. Otherwise, valuable interrupt status informa- 
tion will be lost. 


8.5.3.2 RECALIBRATE Command 


This command causes the read/write head within 
the FDC to retract to the track 0: position. The FDC 
clears the contents of the PCN counter, and checks 
the status of the TRKO pin from the FDD. As long as 
the TRKO pin is low, the DIR# pin remains 0 and 
step pulses are issued. When the TRKO pin goes 
high, the SE bit in Status Register 0 is set to 1, and 
the command is terminated. If the TRKO pin is still 
low after 79 step pulses have been issued, the FDC 
sets the SE and the EC bits of Status Register 0 to 1 
and terminates the command. Disks capable of han- 
dling more than 80 tracks per side may require more 
than one RECALIBRATE Command to return the 
head back to physical Track 0. 


The RECALIBRATE Command does not have a re- 


sult phase. The SENSE INTERRUPT STATUS Com- 
mand must be issued after the RECALIBRATE Com- 
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mand to effectively terminate it and to provide verifi- 
cation of the head position (PCN). During the com- 
mand phase of the recalibrate operation, the FDC is 
in the busy state, but during the execution phase it is 
in a non-busy state. At this time another RECALI- 
BRATE Command may be issued, and in this man- 
ner, parallel RECALIBRATE operations may be 
done on up to 2 drives simultaneously. 


After powerup, software must issue a RECALI- 
BRATE Command to properly initialize all drives and 
the controller. 


8.5.3.3 DRIVE SPECIFICATION Command 


The FDC uses two pins, DRVDENO and DRVDEN1 
to select the density for modern drives. These sig- 
nals inform the drive of the type of diskette in the 
drive. The DRIVE SPECIFICATION Command speci- 
fies the polarity of the DRVDENO and DRVDEN1 
pins. It also enables/disables DSR programmed pre- 
compensation. 


This command removes the need for a hardware 
work-around to accommodate differing specifica- 
tions among drives. By programming this command 
during BIOS’s POST routine, the floppy disk control- 
ler internally configures the correct values for 
DRVDENO and DRVDEN1 with corresponding pre- 
compensation value and data rate table enabled for 
the particular type of drive. 


This command is protected from software resets. Af- 
ter executing the DRIVE SPECIFICATION Com- 
mand, subsequent software resets will not clear the 
programmed parameters. Only another DRIVE 
SPECIFICATION Command or hard reset can reset 
it to default values. The 6 LSBs of the last byte of 
this command are reserved for future use. 


The DRATEO and DRATE1 are values as pro- 
grammed in the DSR register. See Table 32 for pin 
decoding at different data rates. 


Table 32 describes the drives that are supported 


with the DTO, DT1 bits of the DRIVE SPECIFICA- 
TION Command: 
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- Table 32. DRVDENn Polarities | 


NOTE: 
(*) Denotes the default setting 


8.5.3.4 SEEK Command 


The read/write head within the drive is moved from 
track to track under the control of the SEEK Com- 
mand. The FDC compares the PCN which is the cur- 
rent head position with the NCN and performs the 
following operation if there is a difference: 


PCN < NCN: Direction signal to drive set to 1 (step 


in), and issues step pulses. 


PCN > NCN: Direction signal to drive set to 0 (step 
out), and issues step pulses. 


_ The rate at which step pulses are issued is con- 
trolled by SRT (Stepping Rate Time) in the SPECIFY 
Command. After each step pulse is issued, NCN is 
compared against PCN, and when NCN=PCN, then 
the SE bit in Status Register 0 is set to 1, and the 
command is terminated. 
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es 
[soo kts | 
ee 
ee ee 
i 
[$00 Kips 
[soo Kips ft 
[250 Kips 
1 wtps 
[~~ s00 Kips PS 
—— 
ie 
a 
a 
ar aa 
—— 


300 Kbps 
250 Kbps 


During the command phase of the seek or recaili- 


brate operation, the FDC is in the busy state, but 
during the execution phase it is in the non-busy 
state. 


Note that if implied seek is not enabled, the read and 
write commands should be preceded by: 


1. SEEK Command; 
Step to the proper track 

2. SENSE INTERRUPT STATUS Command; 
Terminate the SEEK Command 

3. READ ID. 
Verify head is on proper track 

4. Issue READ/WRITE Command. 
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The SEEK Command does not have a result phase. 
Therefore, it is highly recommended that the SENSE 
INTERRUPT STATUS Command be issued after the 
SEEK Command to terminate it and to provide verifi- 
cation of the head position (PCN). The H bit (Head 
Address) in STO will always return a 0. When exiting 
DSR Powerdown mode, the FDC clears the PCN 
value and the status information to zero. Prior to is- 
suing the DSR POWERDOWN Command, it is highly 
recommended that the user service all pending in- 
terrupts through the SENSE INTERRUPT STATUS 
Command. 


8.5.3.5 SENSE INTERRUPT STATUS Command 
An interrupt signal on the INT pin is generated by the 
FDC for one of the following reasons: 
1. Upon entering the Result Phase of: 
a. READ DATA Command 
b. READ TRACK Command 
c. READ ID Command 
d. READ DELETED DATA Command 
e. WRITE DATA Command 
f. FORMAT TRACK Command 
g. WRITE DELETED DATA Command 
h. VERIFY Command 
2. End of SEEK, RELATIVE SEEK or 
RECALIBRATE Command 


3. FDC requires a data transfer during the execution 
phase in the non-DMA Mode 


The SENSE INTERRUPT STATUS Command resets 
the interrupt signal and via the IC code and SE bit of 
Status Register 0, identifies the cause of the inter- 
rupt. If aa SENSE INTERRUPT STATUS Command is 
issued when no active interrupt condition is present, 
the status register STO will return a value of 80h 
(invalid command). 
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The SEEK, RELATIVE SEEK and the RECALI- 
BRATE Commands have no result phase. The 
SENSE INTERRUPT STATUS Command must be 
issued immediately after these commands to termi- 
nate them and to provide verification of the head 
position (PCN). The H (Head Address) bit in STO will 
always return a 0. If aSENSE INTERRUPT STATUS 
is not issued, the drive, will continue to be busy and 
may effect the operation of the next command. 


8.5.3.6 SENSE DRIVE STATUS Command 


The SENSE DRIVE STATUS Command obtains 
drive status information. It has no execution phase 
and goes directly to the result phase from the com- 
mand phase. STATUS REGISTER 3 contains the 
drive status information. 


8.5.3.7 SPECIFY Command 


The SPECIFY Command sets the initial values for 
each of the three internal timers. The HUT (Head 
Unload Time) defines the time from the end of the 
execution phase of one of the read/write commands 
to the head unload state. The SRT (Step Rate Time) 
defines the time interval between adjacent step 
pulses. Note that. the spacing between the first and 
second step pulses may be shorter than the remain- 
ing step pulses. The HLT (Head Load Time) defines 
the time between the command phase to the execu- 
tion phase of a READ DATA or Write Data Com- 
mand. The Head Unload Time (HUT) timer goes 
from the end of the execution phase to the begining 
of the result phase of a READ Data or Write Data 
Command. The values change with the data rate 
speed selection and are documented in Table 34. 


Table 33. Interrupt Identification 
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[1 [00 [Normal Terinaton of SEEK or REGALIGRATE Command 
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_ Table 34. Drive Control Delays (ms) 


Table 35. Head Load Time (ms) 


HLT 
500K 300K : 250K 
256 426 512 
2 3.3 4 
4 6.7 8 
252 420 ~ 504 
254 423 - 508 


The choice of DMA or non-DMA operations is made 
by the ND bit. When ND=1, the non-DMA mode is 
selected, and when ND=0, the DMA mode is se- 
lected. In DMA mode, data transfers are signalled by 


the DRQ pin. Non-DMA mode uses the RQM bit and — 


the IRQ6 pin to signal data transfers. 


8.5.3.8 CONFIGURE Command 


Issue the configure command to enable features like 
the programmable FIFO and_set the begining track 
for precompensation. A CONFIGURE Command 
need not be issued if the default values of the FDC 
meets the system requirements. 


CONFIGURE DEFAULT VALUES: 


EFIFO—Enable FIFO. When EFIFO= 1, the FIFO is 


disabled (8272A compatible mode). This means — 


data transfers are asked for on a byte by byte basis. 
The default value is 1 (FIFO disabled). The threshold 
defaults to one. 


POLL—Disable Polling. When POLL=1, polling of 


the drives is disabled. POLL Defaults to 0 (polling 
enabled). When enabled, a single interrupt is gener- 
ated after a reset. No polling is performed while the 
drive head is loaded and the head unload delay has 

not expired. ; 


FIFOTHR—The FIFO threshold in the execution 
phase of a read/write command. This is programma- 
ble from 1 to 16 bytes. FIFOTHR defaults to one 
byte. A 00 selects one byte and a OF selects 


EIS No Implied Seeks 16 bytes. 
EFIFO FIFO Disabled . 7 : 
POLL Polling Enabled PRETRK—Precompensation start track number. 


FIFOTHR FIFO Threshold Set to 1 Byte 
PRETRK Pre-Compensation Set to Track 0 


ElS—Enable Implied Seek. When EIS=1, the FDC 
will perform a SEEK operation before executing a 
read/write command. The default value is 0 (no im- 
plied seek). 
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Programmable from track 0 to 255. PRETRK de- 
faults to track 0. A 00h selects track 0 and a FFh 
selects 255. 
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8.5.3.9 VERSION Command 


The VERSION Command checks to see if the con- 
troller is an enhanced type (82077, 82077AA, 
82077SL) or the older type (8272A/765A). A value 
of 90h is returned as the result byte, defining an en- 
hanced FDD controller is in use. No interrupts are 
generated. 


8.5.3.10 RELATIVE SEEK Command 


The RELATIVE SEEK Command is coded the same 
as for the SEEK Command, except for the MSB of 
the first byte and the DIR # bit. 


DIR# Head Step Direction Control 


ACTION 
fo Step Head Out 
Step Head In 


RCN Relative Cylinder Number that determines 
how many tracks to step the head in or out 
from the current track number. | 


The RELATIVE SEEK Command differs from the 
SEEK Command in that it steps the head the abso- 
lute number of tracks specified in the command in- 
stead of making a comparison against an internal 
register. The SEEK Command is good for drives that 
support a maximum of 256 tracks. RELATIVE 
SEEKs cannot be overlapped with other RELATIVE 
SEEKs. Only one RELATIVE SEEK can be active at 
a time. Bit 4 of Status Register 0 (EC) will be set to 1 
if RELATIVE SEEK attempts to step outward beyond 
Track 0. 


As an example, assume that a floppy drive has 300 
useable tracks and that the host needs to read track 
300 and the head is on any track (0-255). If a SEEK 
Command is issued, the head stops at track 255. Ifa 
RELATIVE SEEK Command is issued, the FDC 
moves the head the specified number of tracks, re- 
gardiess of the internal cylinder position register (but 
increments the register). If the head had been on 
track 40 (D), the maximum track that the FDC could 
position the head on using RELATIVE SEEK, is 296 
(D), the initial track, +256 (D). The maximum count 
that the head can be moved with a single RELATIVE 
SEEK Command is 256 (D). 


The internal register, PCN, would overflow as the 
cylinder number crossed track 255 and would con- 
tain 40 (D). The resulting PCN value is thus (NCN + 
PCN) mod 256. Functionally, the FDC starts count- 
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ing from 0 again as the track number goes above 
255(D). It is the users responsibility to compensate 
FDC functions (precompensation track number) 
when accessing tracks greater than 255. The FDC 
does not keep track that it is working in an “extend- 
ed track area” (greater than 255). Any command is- 
sued uses the current PCN value, except for the RE- 
CALIBRATE Command that only looks for the 
TRACKO signal. RECALIBRATE returns an error if 
the head is farther than 79 due to its limitation of 
issuing a maximum 80 step pulses. The user simply 
needs to issue a second RECALIBRATE Command. 
The SEEK Command and implied seeks function 
correctly within the 44 (D) track (299-255) area of 
the extended track area. It is the users responsibility 
not to issue a new track position that exceeds the 
maximum track that is present in the extended area. 


To return to the standard floppy range (0-255) of 
tracks, a RELATIVE SEEK is issued to cross the 
track 255 boundary. 


A RELATIVE SEEK Command can be used instead 
of the normal SEEK Command but the host is re- 
quired to calculate the difference between the cur- 
rent head location and the new (target) head loca- 
tion. This may require the host to issue a READ ID 
Command to ensure that the head is physically on 
the track that software assumes it to be. Different 
FDC commands return different cylinder results 
which may be difficult to keep track of with software 
without the READ ID Command. 


8.5.3.11 DUMPREG Command 


The DUMPREG Command is designed to support 
system run-time diagnostics and application soft- 
ware development and debug. The command re- 
turns pertinent information regarding the status of 
many of the programmed fields in the FDC. This can 
be used to verify the values initialized in the FDC. 


8.5.3.12 PERPENDICULAR MODE Command 


An added capability of the FDC is the ability to inter- 
face directly to perpendicular recording floppy 
drives. Perpendicular recording differs from the tradi- 
tional longitudinal method by orienting the magnetic 


. bits vertically. This scheme packs in more data bits 


for the same area. 
The PERPENDICULAR MODE Command allows the 


system designers to designate specific drives as 
Perpendicular recording drives. Data transfers be- 
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tween Conventional and Perpendicular drives are al- 
lowed without having to issue PERPENDICULAR 
MODE Commands between the accesses of the two 
different drives, nor having to change write precom- 
pensation values. 


With this command, the length of the Gap2 field and 
VCO enable timing can be altered to accommodate 
the unique requirements of these drives. Table 36 
describes the effects of the WGATE and GAP bits 
for the PERPENDICULAR MODE Command. 


When both GAP and WGATE equal 0 the PERPEN- 
DICULAR MODE Command will have the following 
effect on the FDC: 


1. If any of the new bits DO and D1 are pro- 
grammed to 1, the corresponding drive is auto- 
matically programmed for Perpendicular mode 
(ie: GAP2 being written during a write operation, 
the programmed Data Rate will determine the 
length of GAP2), and data will be written with 0 
ns write precompensation. 


2. Any of the new bits (DO/D1) are programmed for 
0, the designated drive is programmed for Con- 
ventional Mode and data will be written with the 
currently programmed write precompensation 
value. 


3. Bits DO and D1 can only be over-written when 
the OW bit is 1. The status of these bits can be 
determined by interpreting the eighth result byte 
of the DUMPREG Command. (Note: if either 
the GAP or WGATE bit is 1, bits DO and D1 are 
ignored.) 
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Software and Hardware reset have the following ef- 
fects on the enhanced PERPENDICULAR MODE 
Command: 


1. A software reset (Reset via DOR or DSR regis- 
ters) only sets GAP and WGATE bits to 0; 
DO and D1 retain their previously programmed 
values. 


2. A hardware reset (Reset via pin 32) sets all bits 
(GAP, Wgate, DO, and D1) to 0 (All Drives Con- 
ventional Mode). 


8.5.3.13 POWERDOWN MODE Command 


The POWERDOWN MODE Command allows the 
automatic power management and enables the en- 
hanced registers (EREG EN) of the FDC. The use of 
the command can extend the battery life in portable 
PC applications. To enable auto powerdown the 
command may be issued during the BIOS power on 
self test (POST). 


This command includes the ability to configure the 
FDC into the enhanced mode extending the SRB 
and TDR registers. These extended registers ac- 
commodate bits that give more information about 
floppy drive interface, allow for boot drive selection, 
and identify the values of the PD and IDLE status. 


As soon as the command is enabled, a 10 ms ora 
0.5 sec minimum powerup timer is initiated, depend- 
ing on whether the MIN DLY bit is set to 0 or 1. This | 
timer is one of the required conditions that has to be 
satisfied before the FDC will enter auto powerdown. 


Table 36. Effects of WGATE and GAP Bits 


Portion of 


Gap2 VCO 


VCO Low Length of Gap2 Low Time for 
Time after | Gap2 Format] Written by 
: 3 Read 
Index Pulse Field Write Data : 
: Operations 
Operation 


33 Bytes 22 Bytes 24 Bytes 


33 Bytes 22 Bytes 19 Bytes 24 Bytes 


33 Bytes 22 Bytes 0 Bytes 24 Bytes 


18 Bytes | 41 Bytes 38 Bytes 43 Bytes 
NOTE: 


When either GAP or WGATE bit is set, the current value of precompensation in the DSR is used. 


Conventional Mode 


Perpendicular Mode 
(500 Kbps and lL_ower Data Rates) 


Reserved (Conventional) 


Perpendicular Mode 
(1 Mbps Data Rate) 


4-158 ADVANGE INFORMATION 


intel. 


Any software reset will re-initialize the timer. The tim- 
er countdown is also extended by up to 10 ms if the 
data rate is changed during the timer’s countdown. 
Without this timer, the FDC would have been put to 
sleep immediately after FDC is idle. The minimum 
delay gives software a chance to interact with the 
FDC without incurring an additional overhead due to 
recovery time. 


The command also allows the output pins of the 
floppy disk drive interface to be tri-stated or left unal- 
tered during auto powerdown. This is done by the 
FDI TRI bit. In the default condition (FDI TRI=0) the 
output pins of the floppy disk drive are tri-stated. 
Setting this bit leaves the interface unchanged from 
the normal state. 


The results phase returns the values programmed 
for MIN DLY, FDI TRI and AUTO PD. The auto pow- 
erdown mode is disabled by a hardware reset. Soft- 
ware results have no effect on the POWERDOWN 
MODE Command parameters. 


8.5.3.14 PART ID Command 


This command can be used to identify the floppy 
disk controller as an enhanced controller. The first 
stepping of the FDC (all versions) will yield Ox02 in 
the result phase of this command. Any future en- 
hancements on these parts will be denoted by the 5 
LSBs (0x01 to Ox1F). 


8.5.3.15 OPTION Command 


The standard IBM format includes an index address 
field consisting of 80 bytes of GAP 4a, 12 bytes of 
the sync field, four bytes identifying the IAM and 
50 bytes of GAP 1. Under the ISO format most of 
this preamble is not used. The ISO format allows 
only 32 bytes of GAP 1 after the index mark. The 
ISO bit in this command allows the FDC to configure 
the data transfer commands to recognize this for- 
mat. The MSBs in this command are reserved for 
any other enhancements made available to the user 
in the future. 


8.5.3.16 SAVE Command 


The first byte corresponds to the values pro- 
grammed in the DSR with the exception of CLKSEL. 
The DRATE1, DRATEO used here are unmapped. 
The second byte is used for configuring the bits from 
the OPTION Command. All future enhancements to 
the OPTION Command will be reflected in this byte 
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as well. The next nine result bytes are explained in 
the Parameter Abbreviations section after the com- 
mand summary. The 13th byte is the value associat- 
ed with the POWERDOWN MODE Command. The 
disk status is used internally by the FDC. There are 
two reserved bytes at the end of this command for 
future use. 


This command is similar to the DUMPREG Com- 
mand but it additionally allows the user to read back 
the precompensation values as well as the pro- 
grammed data rate. It also allows the user to read 
the values programmed in the POWERDOWN 
MODE Command. The precompensation values will 
be returned as programmed in the DSR register. 
This command, used in conjunction with the RE- 
STORE Command, should prove very useful for 
SMM power management. This command reserves 
the last two bytes for future enhancements. 


8.5.3.17 RESTORE Command 


Using the RESTORE Command with the SAVE 
Command, allows the SMM power management to 
restore the FDC to its original state after a system 
powerdown. It also serves as a succinct way to pro- 
vide most of the initialization requirements normally 
handled by the system. The sequence of initializing 
the FDC after a reset occurred and assuming a 
SAVE Command was issued follows: | 


e Issue the DRIVE SPECIFICATION Command (if 
the design utilizes this command) 


e Issue the RESTORE Command (pass the 
16 bytes retrieved previously during SAVE) 


The RESTORE Command programs the data rate 
and precompensation value via the DSR. It then re- 
stores the values normally programmed through the 
CONFIGURE, SPECIFY, and PERPENDICULAR 
Commands. It also enables the previously selected 
values for the POWERDOWN Mode Command. The 
PCN values are set restored to their previous values 
and the user is responsible for issuing the SEEK and 
RECALIBRATE Commands to restore the head to 
the proper location. There are some drives that do 
not recalibrate in which case the RESTORE Com- 
mand restores the previous state completely. The 
PDOSC bit is retrievable using the SAVE Command, 
however, the system designer must set it correctly. 
The software must allow at least 20 ys to execute 
the RESTORE Command. When using the BOOT- 
SEL bits in the TDR, the user must restore or reini- 
tialize these bits to their proper values. 
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The FORMAT AND WRITE Command is capable of 
simultaneously formatting and writing data to the 


The D value is ignored. This command formats the 
entire track. High speed floppy diskette duplication 
can be done fast and efficiently with this command. 


in a single pass. This is very useful for software du- 
plication applications by reducing the time required 
to format and copy diskettes. 


Primary Secondary 
_ Address Address 


10h 170h 
{Fh 171h 
4 FAh een valie 
1F2h 172h 
41F3h — 173h 
1F4h 174h 
~4F5h 475h 
1F6h 176h 
1F7h 177h 
—4F7h 17h 
3F6h 376h 
3F6h 376h 
3F7h 37h 
3F7h 377h 
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Error Register 


IDECSO# - Write Precomp/Features Register 
Sector Count Register 
R/ 


Cylinder High Register | | R/W 
_ IDECSO# | Drive/Head Register | R/W 
IDECSO# i | | 
_ IDECSO# i WO | 
i | wO 


IDECS1 # 


3 aw 
: aw 
[statusReister ——~SCS~C~idtCS 
[command Register =i WO 
[Atemate Status Register ‘| RO 
[Digital Output Register «two 
[bive dress Renister «tO 
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and lower and upper data byte controls. DMA and 
16-bit data transfers are supported: Minimal external 
logic is required to complete the optional 16-bit IDE 


|/O and DMA interfaces. With external logic, a fully 
buffered interface is also supported. 


9.1 IDE Registers 


The 82091AA does not contain IDE registers. All of 
the IDE device registers are located in the IDE de- 
vice, except bit 7 of the Drive Address Register 
which is the Floppy Controller Disk Change status bit 
and is driven by the 82091AA. : 


The IDE interface contains two chip selects 
(IDECSO# and IDECS1#). These signals are as- 
serted for accesses to the Command and Control 
Block registers located at 01Fxh and 03Fxh, respec- 
tively (Table 37). , 


Data Register 


Sector Number Register 
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9.2 IDE Interface Operation 


The 82091AA implements the chip select signals for 
the IDE interface and decodes the standard PC/AT 
primary and secondary |/O locations. 


The 82091AA provides a data buffer enable signal 
(DEN #) to control the lower data byte path for buff- 
ered designs. Buffering the lower data byte path is 
an application option that requires an external trans- 
ceiver/buffer. For buffered applications, DEN # con- 
trols an external transceiver and enables data bits 
IDED[7:0] onto the system data bus SD[7:0]. For 
non-buffered applications (typically the X-Bus con- 
figuration), IDED[7:0] are connected directly to the 
bus and DEN # is not used and becomes a no-con- 
nect. For 16-bit applications the upper data byte 
path (IDED[15:8]) is controlled by the HEN# signal. 


ISA Bus 
SA[9:0] 
lIORC# 
1OWC# lOWC# 
RSTDRV RSTDRV 
1016# |< 1016# 


SA[9:0] 
lIORC# 


SD[7:0] 


SD[15:8] 
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Figure 65 shows an example IDE interface without 
DMA capability. In this case all IDE accesses for set- 
ting up the IDE registers and transferring data is pro- 
grammed via I/O. The 82091AA generates the chip 
selects (IDECSO# and IDECS1#). The 82091AA 
also generates the DEN# and HEN# signals to en- 
able the data buffers. 


Figure 66 shows an example DMA IDE interface for 
type “F’’ DMA cycles. To set up the IDE interface, 


- the host accesses the IDE registers on the IDE de- 


vice. For programmed I/O accesses, the 82091AA 
generates the chip selects (IDECSO# and 
IDECS1#) to access the IDE registers and the 


-_ DEN# and HEN# signals to control the data buff- 


ers. During DMA transfers the DMA handshake is 
between the DMA controller and IDE device via the 
DREQ and DACK# signals. The DACK# signal is 
ORed with the DEN# and HEN# signals to control 
the upper and lower byte buffers during DMA trans- 
fers. 


DB[15:8] 


A[2:0] 
low# 
lOR# 

BALE 


1016# 
RST# 
290486-65 


Figure 65. IDE Interface Example (without DMA) 
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ISA Bus 


lORC# 


SD[15:8] 


SA[2:0] 
lOWC# 
lIORC# 


1016# 
RSTDRV 


290486-—66 


Figure 66. IDE Interface Example (with DMA) 
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10.0 POWER MANAGEMENT 


The 82091AA provides power management capabili- 
ties for its primary functional modules (parallel port, 
floppy disk controller, serial port A, and serial port 
B). For each module, the 82091AA implements two 
types of power management—direct powerdown 
and auto powerdown. Direct powerdown, enabled 
via control bits in the 82091AA configuration regis- 
ters, immediately places the module in a powerdown 
mode by turning off the clock to the associated mod- 
ule. Direct powerdown removes the clock regardless 
of the activity or status of the module. By contrast, 
when auto powerdown is enabled (via control bits in 
the 82091AA configuration registers), the associated 
module only enters a powerdown mode if it is in an 
idle state. 


NOTE: 
The entire 82091AA can be placed in direct 
powerdown by writing to the CLKOFF bit in 
the AIPCFG1 Register. 


10.1 Power Management Registers 


The floppy disk controller, parallel port, serial port A, 
and serial port B each have two 82091AA configura- 
tion registers. For each module, three configuration 
register bits control power management—xDPDN, 
xIDLE, and xAPDN. 


e xAPDN: auto-powerdown, shuts off the oscillator 
to the module when the module is idle. 


e xIDLE: idle status, a read only pin that indicates 
idle status. 


e xDPDN: direct powerdown, shuts off module os- 
cillator when active regardless of module status. 


The 82091AA exits any powerdown mode after a 
hardware reset (RSTDRV asserted) or reset via the 
XRESET bit in the 82091AA configuration registers. 
Direct powerdown can also be exited by writing the 
corresponding xPDN bit in the configuration register 
to 0. Auto powerdown is exited by events at the 
module (e.g., CPU read/write or module interface 
activity). 


NOTE: 

The configuration registers also contain the 
xEN bit. This bit is used to completely dis- 
able an unused module. Enabling a disabled 
module takes much longer than restoring a 
module from powerdown. Therefore, this bit 
is not recommend for temporarily disabling a 
module as a powerdown scheme. 
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10.2 Clock Power Management 


The internal clock circuitry of the 82091AA can be 
turned on or off as part of a power management 
scheme. The clock circuitry is controlled via the 
CLKOFF bit in the AIPCFG1 Register. If an external 
clock source exists, the user may want to turn off the 
internal oscillator to save power and provide mini- 
mum recovery time. 


Auto powerdown and direct powerdown (in each 
module) have no effect on the state of internal oscil- 
lator. | 


10.3 FDC Power Management 


This section describes the FDC direct and auto pow- 
erdown modes and recovery from the powerdown 
modes. 


Auto Powerdown 


Automatic powerdown (APDN) has an advantage 
over direct powerdown (PDN) since the register con- 
tents are not lost under APDN. Automatic power- 
down is invoked by either the Auto Powerdown com- 
mand, or by enabling the FAPDN bit in the FDC con- 
figuration register. There are four conditions required 
before the FDC will enter powerdown: 


1. The motor enable pins ME[3:0] must be inactive. 


2. The FDC must be in an idle state. FDC idle is 
indicated by MSR=80h and the IRQ6 signal is 
negated (IRQ6 may be asserted even if 
MSR = 80h due to polling interrupt). 


3. The head unload timer (HUT, explained in the 
SPECIFY Command) must have expired. 


4. The auto powerdown timer must have timed out. 


An internal timer is initiated when the POWER- 
DOWN MODE Command is executed. The amount 
of time can be set by the user via the MIN DLY bits 
in the POWERDOWN MODE Command. The mod- 
ule is then powered down, provided all the remaining 
conditions are met. A software reset reinitializes the 
timer. When using the FDC FAPDN bit to enable the 
automatic powerdown feature, the MIN DLY bit is set 
to the default condition. 


Recovery from Auto Powerdown 
When the FDC is in auto powerdown, the module is 
awakened by a reset or access to the DOR, MSR or 


FIFO registers. The module remains in auto power- 
down mode after a software reset (i.e., it will power- 
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down again after being idle for the time specified by 
MIN DLY). However, the FDC does not remain in 
auto powerdown mode after a hardware reset or 
DSR reset. | 


Direct Powerdown 


Direct powerdown is invoked via the Powerdown bit 
in the Data Rate Select Register (bit 6), or the 
FDPDN bit in the FCFG2 Register. Setting FDPDN 
_ to 1 will powerdown the FDC. All status is lost when 
this type of powerdown mode is used. The FDC exits 
powerdown mode after any hardware or software re- 
set. Direct powerdown overrides automatic power- 
down. 


~ Recovery from Direct Powerdown 


The FDC exits the direct powerdown state by setting _ 


the FDPDN bit to 0 followed by a software or hard- 
ware reset. 


After reset, the FDC goes through a normal se- 
quence. The drive status is initialized. The FIFO 
mode is set to default mode on a hardware or soft- 
ware reset if the LOCK Command has not blocked it. 
Finally, after a delay, the polling interrupt is issued. 


10.4 Serial Port Power Management 


This section describes the serial port direct and auto 
powerdown modes and recovery from the power- 
down modes. 


Auto Powerdown 


When auto powerdown is enabled in the SxCFG2 
Register (SxAPDN bit is 1), the serial port enters 
auto powerdown based on monitoring line interface 
activity. During auto powerdown, the status of the 
serial port is maintained (the FIFO and registers are 
not reset). Access to any seria! port register is al- 
lowed during auto powerdown. The transmitter and 
the receiver enter powerdown individually, depend- 
ing on certain conditions. When there are no charac- 
~ ters to transmit (TEMPTY =1 in the LSR), the trans- 
mitter clock is shut off placing the transmitter in auto 
powerdown. In the case of the receiver, when serial 
input signal is inactive for approximately 5 character 
times, indicating that no character is being received, 
the receiver goes into auto powerdown. 


Recovery from Auto Powerdown 


The serial port recovers from auto powerdown when 
either the transmitter or receiver are active. If data is 
written to the transmitter or data is present at the 
receiver, the serial port exits from auto powerdown. 
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Direct Powerdown is invoked via the SxCFG2 Regis- 
ter (setting the SxDPDN bit to 1). When in direct 
powerdown, the clock to the module is shut off. All 
registers are accessible while in direct powerdown. 
A host read of the Receiver Buffer Register or a 
write to the Transmitter Holding Register should not 
be performed during powerdown. The SINx input 
should remain static. | 


Direct Powerdown 


When direct powerdown is invoked, the transmit and 
receive sections of the serial port are reset, includ- 
ing the transmit and receive FIFOs. Thus, to prevent 
possible data loss when the FIFOs are reset, soft- 
ware should not invoke direct powerdown until the 
serial port is in the idle state as indicated by the 
SxIDLE bit in the SxCFG2 Register. | 


Recovery from Direct Powerdown 


Recovery from direct powerdown is accomplished 
by writing the SxDPDN bit in the configuration regis- 
ter to 0 or by a module reset. 


10.5 Parallel Port Power Management 


Auto Powerdown 


Auto powerdown is enabled via the PAPDN bit in the 
PCFG2 Register. When enabled, the parallel port 
enters auto powerdown when the module is in an 
idle state. If the parallel port FIFO is being used to 
transfer data, the parallel port is in an idle state 
when the FIFO is empty. 


Recovery from Auto Powerdown 

Recovery from auto powerdown occurs when the 
FIFO is written or as a result of parallel port interface 
activity. 

Direct Powerdown 

Direct powerdown is invoked via the PCFG2 Regis- 


ter (setting the PDPDN bit to 1). When PDPDN= 1, 
the clock to the printer state machine is disabled 


~ and the state machine goes into an idle state. 


Recovery from Direct Powerdown 


Recovery from direct powerdown is accomplished | 
by setting the PDPDN bit to 0 or the PRESET bit toa 
1 in the PCFG2 Register. An 8209TAA hard reset 
(RSTDRV asserted) also brings the part out of direct 
powerdown. 
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11.0 ELECTRICAL NOTICE: This data sheet contains information on 
CHARACTERISTICS : products in the sampling and initial production phases 
A of development. The specifications are subject to 
change without notice. Verify with your local Intel 
11.1 Absolute Maximum Ratings Sales office that you have the latest data sheet be- 
fore finalizing a design. 
Storage Temperature .......... — 65°C to + 150°C * WARNING: Stressing the device beyond the “Absolute 
Supply Voltage...............6.. —0.5V to + 8.0V Maximum Ratings’”’ may cause permanent damage. 


33 These are stress ratings only. Operation beyond the 
VONABO ON ANY NDUR irene SNe Se oe” “Operating Conditions’ is not recommended and ex- 
Voltage on Any Output ...GND-0.5V toVcocg +0.5V tended exposure beyond the “Operating Conditions” 
Power Dissipation ..............0.eceeeeeeeee 1w _ May affect device reliability. 


11.2 DC Characteristics 
Table 38. DC Specifications (Voc =5V +10%, Tamp = 0°C to 70°C) 


Symbol Paramet bag 
ita esas [ Max(v)_[Notes|MinV)|_Max(¥) [Notes 
ue 


Vue inputlowVonags x1 ——=S~*~‘idt 0s ows | 08] os | 
2.0 


[reap oe [ 


lcc  |Voc Supply Current — 1 Mbps FDC Data Rate 


he Input Load Current +10nA +10 pA 
(all input pins) —10 pA —10 pA 
lOFL Data Bus Output +10 pA 7 +10 pA | 
Float Leakage —10 pA —10 pA 
IBPL Parallel Port Back-Power Leakage | 
feu (All Parallel Port Signals) ae meee te Se 


NOTES: 

1. Test Conditions: Only the data bus inputs may float. All outputs are open. . 

2. Test Conditions: Tested while reading a sync field of “‘O0’’. Outputs not connected to DC loads. This specification reflects 
the supply current when all modules within the 82091AA are active. 

. Test Conditions: Vi_=Vss, ViH= Vcc; Outputs not connected to DC loads. 

. Test Conditions: Typical value with the oscillator off. 

. Test Conditions: All 82091AA modules are in their powerdown state. 

. Test Conditions: 10 wA (Vin=Voc), — 10HA (Vin = OV) 

. Test Conditions: OV<VoH<Vcc 

. Test Conditions: 0.45V<VoH<Vcc 

. Test Conditions: Device in Circuit Voc = OV, Vin=5.5V max. 


OONOD AH WD 
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~F=1 MHz, Ta= 25°C | 
Sampled, not 100% Tested | 


NOTE: 
All pins except pins under test are tiecl to AC ground. 


The following pin groupings are used in Table 40 and Table 41. 


DMA FDDREQ, PPDREQ 

IRQXx IRQ3, IRQ4, IRQS, IRQ6, IRQ7 | 

Serial Port SOUTA, SOUTB, DTRA#, DTRB#, RTSA#, RTSB# 

Parallel Port PD[7:0], STROBE#, AUTOFD#, INIT#, SELECTIN# 
WRDATA, HDSEL#, STEP#, DIR#, WE#, FDMEO#, FDME1#, FDSO#, FDS1#, 
DRVDEN[1:0] | | 


Table 40. Vo, Specifications (Voc =5V +10%, Tamb =0°C to 70°C) 


a 
=e [easy | aema |_| oasv | 12ma 


“Tl 
Oo 
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Fao 
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Table 41. Vow Specifications (Vcc =5V £10%, Tamp = 0°C to 70°C) 


ee ee 
Twin [wax [tow | win | Max | ton 


[vou | soor | zev | [ema [| ev [| ema 
[Vou | owainGe [| zav [| ama | 2av [| ema 
[Vou [SerialPort [zav | | tma [2a [| tm 
[Vou | Paraloleon | zav | [ma | 2av [| Soya 
[Vou | ppomacs* | 2ev [| 1ma | 2av [| tma_ 
[Vou | FoCimerace [| 2v | | ma | 2a | [| 2ma 
[Vou | DENe Hens [2av [| tma_[ 2a [| tm 
[Vou | woecsitoe [ zev [| ema | 2a [| 2m 


VSS 


C load = 50 pF for all logic outputs and 


100 pF for the data bus. 
290486-67 


Figure 67. Load Circuit 


1.5V <+———— Test Points ———- > 1.5V 


Figure 68. AC Testing Input, Output 


290486-68 
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11.3 Oscillator 


The 24 MHz clock can be supplied either by a crystal (Figure 69) or a MOS level square wave. All internal 
timings are referenced to this clock or a scaled count that is data rate dependent. The crystal oscillator must 
be allowed to run for 10 ms after Voc has reached 4.5V or exiting the POWERDOWN mode to guarantee that 
it is stable. 
Crystal Specifications: 


Freq: 24 MHz + 0.1% 


Mode: Parallel Resonant 
a Fundamental Mode 

Series Resistance: < 400 

Shunt Capacitance: <5 pF 

C1, C2: 20 pF-25 pF 


24 MHz ~ 
CRYSTAL 


82091AA 


290486-69 


Figure 69. Crystal Connections 


82091AA 


290486-70 


Figure 70. Oscillator Connections 
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11.4 AC Characteristics | 
Table 42. AC Specifications (Vcc =5V + 10%, Tamb = 0°C to 70°C) 


Clock Rise and Fall Time 


NOTES: 

1. Clock input high level test points for clock high time and clock rise/fall times are 3.5V with Voc at 5V +10% and 2.0V 
with Vcc at 3.3V Voc +10%. Clock input low level test point for clock low time and clock rise/fall time is 0.8V. 

2. Clock input test point for clock period is 0.8V. 

3. Certain Floppy Disk Controller module timings are a function of the selected data rate. The nominal values for the internal 
clock period (tie) for the various data rates are: 


internal Clock Period 
(“nominal values) 


a 


All information contained in () in the following tables represents 3.3V specifications. 


Table 43. AC Specifications (Vcc = 5V + 10%, or [3.3V + 0.3V] Tamb = 0°C to 70°C) 


| Symbol | Parameter =| Min’ | Max | Units | Notes | Figure. 
ee 
a eee tae ae ee 
a" omc! 6 fe tae 
SD[7:0] 
ies en Teron yeas aaa oe a arm aa Aa 
SS a 
fe" | sotpaliaatenionGetaake a [me 


Disk Drive 
Disk Rate 


: ADVANCE INFORMATION | 7 4-169 


—82091AA _ a intel. 


Table 43. AC Specifications (Voc = 5V + 10%, or (3.3V + 0.3V) Tamp = 0°C to 70°C) (Continued) 


Symbol Parameter [Min| Max | Units| Notes | Figure | 


lIOCHRDY 


t4a IOCHRDY Propagation Delay from IORC #/ 55 el 82, 83 
lOWC # 


‘4b _| IOCHRDY Propagation Delay fromBUSY | | 34(65) jms | cpp | 22.09 | 82, 83 | 


IORC # 


t5a__| IORC# Active Pulse Width CO 
t5b —_| IORC# Recovery Time —_ feo] [rf $= fn 


lOWC # 


t6éa__| IOWC# Active Pulse Width Poof df ns | ts—s—~iYSCiB 
t6b lIOWC # Recovery Time | feo; |r] st 3 | 


AEN 


t7a__| AEN Setup to IORC#/IOWC# Active eee 
tro | AENHoldfromIORC#/IOWC# Inactive | 0 | | ns | | 72,78 


NOWS # 


NOWS# Delay from IORC# /IOWG# aE) 
TC Active Pulse Width [a> «3s 


RESET 
RSTDRV 


t10a__| RSTDRV Active Pulse Width eet Tet 


t1 Ob Hardware Configuration Input 2etNP to All Configuration 
RSTDRV Inactive Modes 

t10c Hardware Configuration Input Hold from All Configuration 
RSTDRV Inactive — Modes . 


INTERRUPTS 
RQ([4,3] (Serial Ports) 


t11b IRQ([4,3] Inactive Delay from IORC #/ — 100 THR wr, 90, 91 
lIOWC # Active | RBR rd, 
MSR rd 
tlic IRQ([4,3] Inactive Delay from |IORC# | 100 IIR rd, 
| Inactive. LSR rd 
tlid IRQ([4,3] Active Delay from DCD #/DSR #/ 91 
CTS#/RIi# 
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Table 43. AC Specifications (Voc = 5V + 10%, or (3.3V + 0.3V) Tampb = 0°C to 70°C) (Continued) 


Teymbot| Parameter | Min | Max [Units] Notes | Figure 


IRQ([7,5] (Parallel Port) 


t12b IRQ(7,5] Inactive Delay from IORC # / 70 (90) ECP rev, 
IOWC # Active fwd to FIFO 

t12c IRQ([7,5] Inactive Delay from IOWC # 70 (95) ECP fwd to ECR 
Inactive 


joa TO See 
[re [imorstosayrenraunre [| ron) [oe [coe [or 


IRQ6 (FDC) 
t13b IRQ6 Inactive Delay from IORC # / tle +125 
lIOWC # Active 


FDDREQ, PPDREQ 


tt 4a xDREQ Inactive Delay from xDACK # 75 (100) 
Active 


t14c xDREQ Active from lIORC # /IOWC # 
Inactive 


xDREQ Setup IORC #/IOWC# 


tl4e xDREQ Delay from IORC # /IOWC # 75 a mE 
Active 

t14f FDREQ Inactive Delay from TC Active 110 
PPDREQ Inactive peer aaa from TC Active 80 caine 


t1 Hti4g | | xDREQtoxDACK# Inactive to xDACK # Inactive 2 tte. tle 
FDDACK #, ive] 


tid5a xDACK # Active Delay from xDREQ 74 
Active 

t15b xDACK # Setup to |ORC #/IOWC# 18 74 
Active 

t15c xDACK # Hold from IORC # /IOWC # 74 
Inactive 
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Table 43. AC Specifications (Vcc= 5V + 10%, or (3.3V + 0.3V) Tamb = 0°C to 70°C) (Continued) 


PARALLEL PORT 


| PD[7:0] | 
16a, | PDI7:0] Delay from IOWG# Inactive | 60(90) | ns | ISAPS/2wr| 87 
t16b _| PD[7:0] Delay from IOWC# Active | | 70(100)| ns | EPPwr | 62 __ 
t16c PD[7:0] Float Delay from IOWC # Inactive /50} | ns | EPPwr | 82 _ 
t16f PD[7:0] Setup to STROBE # Active 450} ss {sds ISAFIFO | 84 | 
t16g PD[7:0] Hold from STROBE # Inactive 450} | «| ISAFIFO | 84 _— 
tt6h —_| PDI7:0] Hold from BUSY Inactive fo} | | ECP fwd | 85 | 
t16i | PD[7:0] Setup to ACK# High __ fo | | | EcPrev | 86 |» 
t16, | PD[7:0] Hold from AUTOFD# Low fo} | | EcPrev | 86 | 
STROBE # | 
t17a | STROBE# Delay from |OWG# Inactive | —«| 60/90 | ~—s|_‘ISA,PS/2 | 87 | 
t17b | STROBE# Delay fromionc#/iowc# —|_—s:| 60/90 | Ss |_~—s EPP: 82, 88. | 
t17¢ | STROBE# Active from BUSY Inactive _ 500; | —s|sISAFIFO | 84 | 
t17d | STROBE Active Pulse Width — f450f || ISAFIFO ] 84 | 
t17e STROBE # Active frorn BUSY Inactive Fo] | | ECP fwd | 85 | 
t17f. | STROBE# Inactive Delay fromBUSY Active | 0 | | | ECPfwd | 85 | 
AUTOFD# 
t18a_ | AUTOFD# Delay from IOWG# Inactive | | 6o(go) | ns | ISAPS/2 | 87 | 
t18b | AUTOFD# Delay from IORC#/IOWC# | | 60(@0) | ns | EPP | 82,83 | 
t18c | AUTOFD# Hold from BUSY Inactive /s0| | ns | ECPfwd | 85 __ 
t18d AUTOFD# Low Delay from ACK # Inactive /o | | ns | ECPrev | 86 | 
t18e AUTOFD# High Delay from ACK # Active 1 o |} | ns | EcPrev | 86 _ 


INIT # 


t19a__| INIT# Delay from IOWC¢ Inactive | 60(90) | ns | All Modes 


# f/ . - 
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Table 43. AC Specifications (Voc= 5V + 10%, or (3.3V + 0.3V) Tamp =0°C to 70°C) (Continued) 


Symbol] _____Parameter_————_—([Min| Max [Unite] Notes | Figure 
aoa [SELECTING Belay fom iOWGe nacivew | [0000] ne | WAPSa | a7 
ib [SELECTIN® Belay romiOWG#/ORC® Acie [eo (eo| | eer | a2 
i@ia [BUSY Acive belay tom STROSE® Acive [| 500 [| Warsa | ea 
ib [BUSY Actve Delay fom STROBE® Actve fo] | | _——+dt es 
ic [BUSY inactive Delay rom STROBE nactve [| | | e0Pmad | 05 
id |BUSYSeuptoAcke ace === of | (| ecrry | 86 
‘zit [BUSY Hold rom AUTOFD# hacive [oo] | | ecprv | 06 
ACK# 

(aa [AGK# heive tom AUTORDe High [0 [ [| sory | 06 
22 [AGKe Inactive Holdirom AUTOFD# Low [o| | | ecPrv | 06 
jaa [60S Deaytonsarioo ———SS«|=Cidt@ eo] SC*dSCi 
29 | PPOIR Delay om iOWG# inacivo ———~| ‘(a0 e0)| | SA PS72, £08 a7 
i232 [PPOIRDeleytomiowce Acive | —ieoeo) | eer | a2 
aia [DES Doayronsarool «Yio Od 
asa [BEND Delton sation ——~SC~*~*~dSC‘idr a] YCSSCSC*«*dr RT 
ase [DEN# Delaytromsbacke ————S*d itor] | Sid 
iaéa [HENe Delytomiorse ——SSSS=«dSCidi ws] ~<YSSC~idSCia 
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Table 43. AC Specifications (Vcc= 5V + 10%, or (3.3V + 0.3V) Tamb = 0°C to 70°C) (Continued) 


SERIAL PORTS _ 
DTRx#, RTSx#, DCDx # 
DTRx#/RTSx#/DCDx# Active Delay fromiowc#| | 55(70)| ns |MCRwr| 91. | 
FLOPPY DISK CONTROLLER | 
RDDATA# 


WRDATA# 


aoe [Data war ecoroie|soonow] | 7? | 7 
. HDSEL# . 
| STEP # 


| e | DIR# 
32a _| DIR# Setup to STEP # Active a en oe ee ee 
132b | DIR# Hold from STEP # Inactive | 10 | | ops || 78 | 


WE # 
133a WE # Inactive Delay from RSTDRV Inactive Edge a ee eee 
INDEX# _ | 


NOTES: 

1. The FDC Status Register’s status bits which are not latched may be updated during a host read operation. 

2. The timing t13b is specified for the FDC interrupt signal in the polling mode only. These timings in case of the result 
phase of the read and write commands are microcode dependent. 

3. This timing is for FDC FIFO threshoid=1. When FIFO threshold is N bytes, the value should be multiplied by N and 
subtract 1.5 ws. The value shown is for 1 Mbps, scales linearly with data rate. 

4. This timing is a function of the internal clock period (t1e) and is given as (24) tie. The values of tle are shown in Note 3. 

5. If DACK# transitions before RD#, then this specification is ignored. If there is no transition on DACK#, then this be- 
comes the DRQ inactive delay. 

6. TC width is defined as the time that both TC and DACK# are active. Note that TC and DACK# must overlap at least 
50 ns. 
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NOTES: (Continued) 
7. Based on the internal clock period (tie). For various data rates, the read and write data width minimum values are: 


[sma [oe 
Data Rate 
| 500Kbps_— | 60ns_ 


8.This timing is a function of the selected data rate as follows: 


9. This value can range from 0.5 ms to 8.0 ms and is dependent upon data rate and the Specify Command value. 
10. The minimum MFM values for WE# to HDSEL# change for the various data rates are: 


== 
250 Kbps 


GPL is the size of gap 3 defined in the sixth byte of a Write Command. 


11. Based on internal clock period. 

12. Jitter tolerance is defined as: 
(Maximum bit shift from nominal position + 1% period of nominal data rate) x 100 percent is a measure of the allowable 
bit jitter that may be present and still be correctly detected. The data separator jitter tolerance is measured under 
dynamic conditions that jitters the bit stream according to a reverse precompensation algorithm. 

13. The minimum reset active period for a software reset is dependent on the data rate, after the FDC module has been 
properly reset using the t10a spec. The minimum software reset period then becomes: 

Disk Drive 


Minimum Software Reset 
Active Period 
Data Rate 


PRAM 
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11.4.1 CLOCK TIMINGS 
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Figure 71. Clock Timing 


11.4.2 HOST TIMINGS 
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Figure 72. Host Read 
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Figure 73. Host Write 
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xDREQ 


FDDACK#, 
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IORC#, 
lOWC# 


SD[7:0] 


as ae Valid Data 


t9a 
TC 
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Figure 74. DMA Timing 
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NOTE: 


-FDDREQ, IRQ6 depicts the FDC enabled condition under hardware configuration. Otherwise, these ls tri-state with 
the same timing as IRQ[7,5,4,3]. 


Figure 75. Reset Timing 
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RSTDRV 


HW Config Pins 
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Figure 76. Reset Timing (Hardware Extended Configuration Mode) 


11.4.3 FDC TIMINGS 


WE# 
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Figure 77. Write Data Timing 
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{32a t31a 
STEP# 


t34a : 
INDEX# = one ee 


t30a 


HDSEL 


WE# 
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NOTE: 


For overlapped seeks, only one step pulse per drive selection is issued. Non-overlapped seeks will issue all programmed 
step pulses. 


Figure 78. FDC Drive Control/Timing 
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RDDATA# 
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Figure 79. FDC Internal PLL Timing 
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Figure 80. Floppy Disk Controller Interrupts 


11.4.4 PARALLEL PORT TIMINGS 
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Figure 81. Parallel Port Interrupt Timing 
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Figure 82. EPP Write Timing 
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Figure 83. EPP Read Timing 
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Figure 84. ISA-Compatible FIFO Timing 


/PDI7:0] | Valid Data 


AUTOFD# 


STROBE# 


BUSY 
290486-85 


Figure 85. ECP Write Timing (Forward Direction) 
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Figure 86. ECP Read Timing (Reverse Direction) 
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Figure 87. ISA-Compatible Write Timing 
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11.4.5 IDE TIMINGS 
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Figure 88. IDE Timing 
11.4.6 GAME PORT TIMINGS 
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Figure 89. Game Port Timing 
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11.4.7 SERIAL PORT TIMINGS 
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Figure 90. Serial Port Interrupt Timing 
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Figure 91. Modem Control Timing 
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12.0 PINOUT AND PACKAGE INFORMATION 


12.1 Pin Assignment 
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‘Figure 92. 82091AA Pin Diagram 
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Table 44. Alphabetical 82091AA Pin Assignment 
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Signal Name 
STROBE # 
TRKO# 


= 


Signal Name Pin # Pin # 
SA10 17 
24 
SD1 25 
26 
27 
D 29 
D 30 
SD6 31 
SD7 32 
SINA 37 
45 
SELECT 51 
SELECTIN# 61 
SOUTA 39 
SOUTB 47 
STEP # 81 
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Table 45. Numerical 82091AA Pin Assignment (Continued) 
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12.2 Package Characteristics | 
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Figure 93. 100-Pin Quad Flat Pack (QFP) Dimensions 
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13.0 DATA SEPARATOR CHARACTERISTICS FOR FLOPPY DISK MODE 
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Figure 95. Typical Jitter Tolerance vs Data Rate (Capture Range 300 Kbps) | 
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Figure 97. Typical Jitter Tolerance vs Data Range (Capture Range 1 Mbps) 
Jitter Tolerance measured in percent. Capture range expressed as a percent of data rate, i.e., +3% percent. 
e=Test Points: 250 Kbps, 300 Kbps, 500 Kbps and 1 Mbps are center, +5 percent @ 60 percent jitter. — 


Test points are tested at temparture and Vcc limits. Refer to the datasheet. Typical conditions are: room 
temperature, nominal Vcc. 
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13.1 Write Data Timing 


WRDATA 
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NOTE: 
Invert high. 


13.2 Drive Control 


290486-99 


NOTE: 
For overlapped seeks, only one step pulse per drive selection is issued. Non-overlapped seeks will issue all programmed 
step pulses. Invert high. 
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13.3 Internal PLL 


RDDATA 
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NOTE: 
Invert high. 
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APPENDIX A 
FDC FOUR DRIVE SUPPORT 


Section 8.0 of this document completely describes the FDC when the module is configured for two drive 
support. In addition, the FDC commands in Section 8.0 provide four drive support information. This appendix 
provides additional information concerning four drive support. The signal pins that are affected by four drive 
support are described in Section A.1. Note that the FDC signals not discussed in this appendix operate the 
same for both two and four drive systems. The following registers are described in this appendix; Digital Output 
Register (DOR), Enhanced Tape Drive Register (TDR), and the Main Status Register (MSR). Some bits in 
these registers operate differently in a four drive configuration than a two drive configuration. 


; NOTES: 


-@ The descriptions in this appendix assume that four floppy drive support has been selected by setting 
FDDQTY to 1 in the AIPCFG1 Register. 


e Only drive 0 or drive 1 can be selected as the boot drive. 


A.1 Floppy Disk Controller Interface Signals 


These signal descriptions are for a four drive system (FDDQTY = 1 in the AIPCFG1 Register). See Section 2.0 
for two drive system signal descriptions. 


Signal Name | 


FDME1 #/DSEN # (1) 


Description 


FLOPPY DRIVE MOTOR ENABLE 1, or DRIVE SELECT ENABLE: In a 
_ four drive system, this signal functions as a drive select enable 

| (DSEN#). When DSEN # is asserted, MDS1 and MDSO reflect the 
selection of the drive. 


FLOPPY DRIVE SELECT1, or MOTOR DRIVE SELECT 1: In a four 
drive system, this signal functions as a motor drive select (MDS1). 

MDS1, together with MDSO, indicate which of the four drives | is selected, 
as shown i in note 1. 


FLOPPY DRIVE MOTOR ENABLE 0 or MOTOR ENABLE ENABLE: In 
a four drive system, this signal functions as a motor enable enable 

(MEEN #). MEEN # is asserted to enable the external decoding of MDS1 
and MDSO for the appropriate motor enable (see note 1). 


FLOPPY DRIVE SELECT 0 or MOTOR DRIVE SELECT 0: In a four 
drive system, this signal functions as motor drive select (MDSO). MDSO, 
together with MDS1, indicate which of the four drives is selected as 
shown in note 1. 


FDS1 #/MDS1(1) 


FDMEO # /MEEN #(1) 


FDSO#/MDSO() 


NOTE: 
1. These signal pins are used to control an external decoder for four floppy disk dives as shown below. Refer to the DOR 
Register Description in Section A.2 for details. 


MDS1 MDSO DSEN # =0 MEEN# =0 


0 0 Drive 0 MEO 
0 1 Drive 1 ME1 
1 0 Drive 2 ME2 
1 1 Drive 3 ME3 
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A.2. DOR—Digital Output Register 


1/O Address: = Base +2h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


The Digital Output Register enables/disables the floppy disk drive motors, selects the disk drives, enables/dis- 
ables DMA, and provides a FDC module reset. The DOR reset bit and the Motor Enable bits have to be 
inactive when the 82091AA’s FDC is in powerdown. The DMAGATE # and Drive Select bits are unchanged. 
During powerdown, writing to the DOR does not wake up the 82091AA’s FDC, except for activating any of the 
motor enable bits. Setting the motor enable bits to 1 will wake up the module. The four internal drive select and 
four internal motor enable signals are encoded to a total of four output pins as described in Table 47. Figure 99 
shows an example of how these four output pins can be decoded to provide four drive select and four motor 
enable signals. Note that only drive O or drive 1 can be used as the boot drive when four disk drives are 
enabled. 


7 6 5 4 3 2 1 0.—sC—BBit 
po} o fo} of eo] of o| o| eau 
Drive 0 Select (R/W) 
1=Selected 
0=Not Selected 
Drive 1 Select (R/W) 
1=Selected 
0=Not Selected 
FDC Reset (R/W) 
1=Resets FDC 
0=Does Not Reset FDC 
DMA Enable (R/W) 
1=Enable 
0=Disable 
Motor Enable 0 (R/W) 
(for Drive 0) 
1=Enable 
0=Disable 
Motor Enable 1 (R/W) 
1=Enable 
0=Disable 


Motor Enable 2 (R/W) 
1=Enable 
0=Disable 

Motor Enable 3 (R/W) 


1=Enable 
0=Disable 
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Figure 98. Digital Output Register 
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Motor Enable 3 (ME3): This bit controls a motor drive enable output signal and provides the signal . 
output for the floppy drive 3 motor (via external decoding) as shown in Table 46. | 


Description 


Motor Enable 2 (ME2): This bit controls a motor drive enable output signal and provides the signal 
output for the floppy drive 2 motor (via external decoding) as shown in Table 46. 


Motor Enable 1 (ME1): This bit controls a motor drive enable signal and provides the signal output 
- for the floppy drive 1 motor (via external decoding) as shown in Table 46. 


aS 


Motor Enable 0 (MEO): This bit controls a motor drive enable signal and provides the signal output 
for the floppy drive 0 motor (via external decoding) as shown in Table 46. 


DMA Gate (DMAGATE): This bit enables/disables DMA for the FDC. When DMAGATE = 1, DMA 

for the FDC is enabled. In this mode FDDREQ, TC, IRQ6, and FDDACK# are enabled. When | 
DMAGATE = 0, DMA for the FDC is disabled. In this mode, the IRQ6 and DRQ outputs are tri-stated 
and the DACK # and TC inputs are disabled to the FDC. Note that the TC input is only disabled to 

the FDC module. Other functional units in the 82091AA (e.g., parallel port or IDE interface) can still 
use the TC input signal for DMA activities. 


FDC Reset (DORRST): DORRST is a software reset for the FDC module. Wnen DORRST is set to 
0, the basic core of the 822091 AA’s FDC and the FIFO circuits are cleared conditioned by the LOCK 
bit in the Configure Command. This bit is set to 0 by software or a hard reset (RSTDRV asserted). 

_ The FDC remains in a reset state until software sets this bit to 1. This bit does not affect the DSR, 
CCR and other bits of the DOR. DORRST must be held active for at least 0.5 ws at 250 Kbps. This is 
less than a typical ISA I/O cycle time. Thus, in most systems consecutive writes to this register to 
toggle this bit allows sufficient time to reset the FDC. 


0 | Drive Select (DS[1:0]): This field provides the output signals to select a particular floppy drive (via 
external decoding) as shown in Table 47. Note that the drive motor can be enabled separately 

without selecting the drive. This permits the motor to come up to speed before selecting the drive. 
Note also that only one cirive can be selected at a time. However, the drive should not be selected 
without enabling the appropriate drive motor via bits[7:4] of this register. 


—h 
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Table 46. Output Pin Status for Four Disk Drives 


roa DOR Register Bits 


MEO and 

DSO enable 

ME1 and 

DS1 enable 

ME2 and xX 1 X X 1 1 

DS2 enable 

MES3 and 1 X 4 xX 1 1 1 1 
DS3 enable . 

MEO enable X X X | DS[1:0] #00 

only 


ME1 enable DS[1:0] #01 

only 

ME2 enable DS[1:0] +10 

only 

MES enable 1 DS[1:0] #11 1 1 
only 

No ME or x Xx 1 1 
DS enable 


NOTE: 

To enable a particular drive motor and select the drive, the value for DS[1:0] must match the appropriate motor enable bit 
selected as indicated in the first four rows of the table. For example, to enable the drive 0 motor and select the drive, MEO is 
set to 1 and DS[1:0] must be set to 00. To enable the drive motor and keep the drive de-selected the value for DS[1:0] 
must not match the particular motor enable as shown in the first four rows. For example, to enable the motor for drive 0 
while the drive remains de-selected, MEO is set to 1 and DS[1:0] is set to 01, 10, or 11. 
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2to4 
Decoder 


MDSO# 
_ MDS1# 


DSEN# 


MEEN# 2to4 
Decoder 
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Figure 99. Example External Decoder (Four Drive System) 


A.3. TDR—Enhanced Tape Drive Register 


1/O Address: Base +3h 
Default Value: 00h 
Attribute: Read/Write 
Size: 8 bits 


This register allows the user to assign tape support to a particular drive during initialization. Any future refer- 
ences to that drive number automatically invokes tape support. A hardware reset sets all bits in this register to 
0 making drive 0 not available for tape support. A software reset via bit 2 of the DOR does not affect this 
register. Drive 0 is reserved for the floppy boot drive. Bits[7:2] are only available wnen EREG EN = 1; other- 
wise the bits are tri-stated. 
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Default 


Tape Select (R/W) 
00=None (All are Floppy Drives) 


01=1 
10=2 
11=3 


Boot Drive Select (R/W) 
see Table 


Reserved 
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Figure 100. Enhanced Tape Drive Register 


a 


Boot Drive Select (BOOTSEL): The BOOTSEL bit is used to remap the drive selects and motor | 


enables. The functionality is shown below: 
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BOOTSEL Mapping 


0 DSO —» FDSO, MEO — FDMEO (default) 
DS1 — DS1, ME1 — FDME?1 
1 DSO — DS1, MEO — FDME1 


DS1 — FDSO, ME1.— FDMEO 
Only drive 0 or drive 1 can be selected as the boot drive. 


Tape Select (TAPESEL[1:0]): These two bits are used by software to assign a logical drive number 
to be a tape drive. Other than adjusting precompensation delays for tape support, these two bits do 
not affect the FDC hardware. They can be written and read by software as an indication of the tape 
drive assignment. Drive 0 is not available as a tape drive and is reserved as the floppy disk boot 
drive. The tape drive assignments are as follows: 


Bits[1:0] Drive Selected 


00 None (all are floppy disk drives) 
01 1 
10 2 
174 3 
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A.4 MSR—Main Status Register 


1/O Address: Base + 4h 
Default Value: 00h 
Attribute: Read Only 
Size: 8 bits 


This read only register provides FDC status information. This information is used by software to control the 
flow of data to and from the FIFO (accessed via the FDCFIFO Register). The MSR indicates when the FDC is 
ready to send or receive data through the FIFO. During non-DMA transfers, this register should be read before 
each byte is transferred to or from the FIFO. 


After a hard or soft reset or recovery from a powerdown state, the MSR is available to be read by the host. The 
register value is 00h until the oscillator circuit has stabilized and the internal registers have been initialized. | 
When the FDC is ready to receive a new command, MSR[7:0] = 80h. The worst case time allowed for the MSR 

to report 80h (i.e., RQM is set to ‘) is 2.5 us after a hard or soft reset. 


Main Status Register is used for controlling command input and result output for all commands. Some example 
values of the MSR are: 


@ MSR= 80H; The controller is ready to receive a command. 
¢ MSR=90H; Executing a command or waiting for the host to read status bytes (assume DMA mode). 
~@ MSR = DOH; Waiting for the host to write status bytes. 


Bit 
Default 


Drive 0 Busy (RO) 
see Text 


Drive 1 Busy (RO) 
see Text 


-—— Reserved 
Command Busy (RO) 


1=FDC Command in Progress 
0=No FDC Command in Progress 


Non DMA Mode (RO) 
see Text 


— Data I/O Direction (RO) 
1=Host Data Read Required 
0=Host Data Write Required 


Request for Master (RO) 
1=Host can Transfer Data 


0=Data Transfer is not Permitted 
290486-A4 


Figure 101. Main Status Register 
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Request For Master (RQM): When RQM = 1, the FDC is ready to send/receive data through the 
FIFO (FDCFIFO Register). The FDC sets this bit to 0 after a byte transfer and then sets the bit to 1 
when it is ready for the next byte. During non-DMA execution phase, RQM indicates the status of 
IRQ6. 


Direction I/O (DIO): When RQM = 1, DIO indicates the direction of a data transfer. When DIO= 1, 
the FDC is requesting a read of the FDCFIFO. When DIO=0, the FDC is requesting a write to the 
FDCFIFO. 


NON-DMA (NONDMA): Non-DMA mode is selected via the SPECIFY Command. In this mode, the 
FDC sets this bit to a 1 during the execution phase of a command. This bit is for polied data transfers 
and helps differentiate between the data transfer phase and the reading of result bytes. 


Command Busy (CMDBUSY): CMDBUSY indicates when a command is in progress. When the first 
byte of the command phase is written, the FDC sets this bit to 1. CMDBUSY is set to 0 after the last 


byte of the result phase is read. If there is no result phase (e.g., SEEK or RECALIBRATE 
Commands), CMDBUSY is set to 0 after the last command byte is written. 


Drive 3 Busy (DRV1BUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 3. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. 


Drive 2 Busy (DRV1BUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 2. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. | 


Drive 1 Busy (DRV1BUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 1. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. 


Drive 0 Busy (DRVOBUSY): The FDC module sets this bit to 1 after the last byte of the command 
phase of a SEEK or RECALIBRATE Command is issued for drive 0. This bit is set to 0 after the host 
reads the first byte in the result phase of the SENSE INTERRUPT Command for this drive. 
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82078 CHMOS SINGLE-CHIP 
FLOPPY DISK CONTROLLER 


Small Footprint and Low Height. 
Packages 


Supports Standard 5.0V as Well as ow 


Voltage 3.3V Platforms. 

— Selectable 3.3V and 5.0V 
Configuration 

-—5.0V Tolerant Drive Interface 


Enhanced Power Management 
— Application Software Transparency 


_ — Programmable Powercown 


- Command 

— Save and Restore Commands for OV 
Powerdown | 

— Auto Powerdown and Wakeup 
Modes 

— Two External Power Management 
Pins 

— Consumes No Power While in 
Powerdown 


Programmable Internal Oscillator 


Floppy Drive Support Features 

— Drive Specification Command 

— Media ID Capability Provides Media 
Recognition | 

— Drive ID Capability Allows the User 
to Recognize the Type of Drive 

— Selectable Boot Drive 

— Standard IBM and ISO Format 
Features 

— Format with Write Cornmand for 
High Performance in Mass Floppy 
Duplication 


Integrated Host/Disk Interface Drivers 


m Integrated Analog Data Separator 


— 250 Kbits/sec 
— 300 Kbits/ sec 
— 500 Kbits/ sec 
— 1 Mbits/sec 
— 2 Mbits/sec 


Integrated Tape Drive Support 

— Standard 1 Mbps/500 Kbps/ 
250 Kbps Tape Drives 

— New 2 Mbps Tape Drive Mode 


Perpendicular Recording Support for 
4 MB Drives 


Fully Decoded Drive Select and Motor 
Signals 


‘Programmable Write Precompensation 


Delays 


Addresses 256 Tracks Directly, 
Supports Unlimited Tracks 


16 Byte FIFO 


Single-Chip Floppy Disk Controller 

Solution for Portables and Desktops 

— 100% PC-AT* Compatible 

— 100% PS/2* Compatible 

— 100% PS/2 Model 30 Compatible 

— Fully Compatible with Intel’s 386SL 
Microprocessor SuperSet 

— Integrated Drive and Data Bus 
Buffers 


Available in 64 Pin QFP and 44 Pin QFP 


Package 
(See Package Specification Order Number 240800, Package 
Type S) 


The 82078 Product Family brings a set of enhanced floppy disk controllers. These include several features 
that allow for easy implementation in both the portable and desktop market. The current family includes a 64 
pin and a 44 pin part in the smaller form factor QFP package. The 3.3V version of the 64 pin part provides an 
ideal solution for the rapidly emerging 3.3V piatforms. It also allows for a 5.0V tolerant floppy drive interface 
that lets the users retain their normal 5.0V drives. Another version of the 64 pin part provides support for 2 
Mbps data rate tape drives. 


*Other brands and names are the property of their respective owners. 
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f ntel , 82078 


Table 1-0. 64 Pin Part Versions 


The 44 pin is targeted for platforms that are operated at 3.3V or 5.0V and do not require more than two drive 
support. The 82078-5 is designed for price sensitive 5.0V designs which do not include 4 MB drive support. 


Table 2-0. 44 Pin Part Versions 
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Both parts can be operated at 1 Mbps/500 Kbps/300 Kbps/250 Kbps. Additionally, one version of the 64 pin 
part provides 2 Mbps data rate operation specific for the new tape drives. 


The 82078 is fabricated with Intel’s advanced CHMOS Ill technology. 
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82078 44 PIN 
CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 
= Small Footprint and Low Height m Integrated Tape Drive Support 
Package — Standard 1 Mbps/500 Kbps/ 
m Enhanced Power Management 250 Kbps Tape Drives 
— Application Software Transparency m Perpendicular Recording Support for 
— Programmable Powerdown 4 MB Drives 
Command : ; 
i Gajg.and Rosters Commande ior m Integrated Host/Disk Interface Drivers 
Zero-Volt Powerdown m Fully Decoded Drive Select and Motor 
— Auto Powerdown and Wakeup Signals | 
Modes m Programmable Write Precompensation 
— Two External Power Management Delays 
eee No Power While in m Addresses 256 Tracks Directly, 
Powerdown Supports Unlimited Tracks 
m Integrated Analog Data Separator m 16 Byte FIFO 
— 250 Kbps m Single-Chip Floppy Disk Controller 
— 300 Kbps Solution for Portables and Desktops 
— 500 Kbps — 100% PC/AT* Compatible 
— 1 Mbps — Fully Compatible with Intel386™ SL 
m Programmable Internal Oscillator = hal Drive and Data Bus 
25 See ee baci m Separate 5.0V and 3.3V Versions of the 
— Selectable Boot Drive 44 Pin part are Available 
— Standard IBM and ISO Format m@ Available in a 44 Pin QFP Package 
Features 


— Format with Write Command for 
High Performance in Mass Floppy 
Duplication 


The 82078, a 24 MHz crystal, a resistor package, and a device chip select implements a complete solution. All 
programmable options default to 82078 compatible values. The dual PLL data separator has better perform- 
ance than most board level/discrete PLL implementations. The FIFO allows better system performance in 
multi-master (e.g., Microchannel, EISA). 


The 82078 maintains complete software compatibility with the 82077SL/82077AA/8272A floppy disk control- 
lers. It contains programmable power management features while integrating all of the logic required for floppy 
disk control. The power management features are transparent to any application software. 


The 82078 is fabricated with Intel’s advanced CHMOS Ill technology and is also available in a 64-lead QFP 
package. 


*Other brands and names are the property of their respective owners. 


Refer to the 1995 Peripheral Components Handbook for the complete data sheet on this device. 

The complete document for this product is available on Intel’s “Data-on-Demand”’ CD-ROM product. Contact 
your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 
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82078 64 PIN 
CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 
‘Small Footprint and Low Height — Selectable Boot Drive 
Packages | ' —- Standard IBM and ISO Format 
Features 
m Supports Standard 5.0V as well as Low — Format with Write Command for 
Voltage 3.3V Platforms 
— Selectable 3.3V and 5.0V | High Performance in Mass Floppy 
Configuration Duplication 
— 5.0V Tolerant Drive Interface w Integrated Tape Drive Sipser 


m Enhanced Power Management — Standard 1 Mbps/500 Kbps/ 


snati 250 Kbps Tape Drives 
— Application Software Transparency : 
— Programmable Powerdown — New 2 Mbps Tape Drive Mode 
Command | m Perpendicular Recording Support for 
-—- Save and Restore Commands for : 4 MB Drives 


Zero-Volt Powerdown 
— Auto Powerdown and Wakeup m Integrated Host/Disk Interface Drivers 


Modes m Fully Decoded Drive Select and Motor 
— Two External Power Management Signals 
Pins m Programmable Write Precompensation 
— Consumes no Power when in Delays 
dasha tadesk m Addresses 256 Tracks Directly, 
m Integrated Analog Data Separator Supports Unlimited Tracks 
— 250 Kbps 
— 300 Kbps | - @ 16 Byte FIFO 
— 500 Kbps _ | m Single-Chip Floppy Disk Controller 
— 1 Mbps : Solution for Portables and Desktops 
—2 Mbps — 100% PC AT* Compatible 
i — 100% PS/2* Compatible 
m Programmable Internal Oscillator — 100% PS/2 Model 30 Compatible 
m Floppy Drive Support Features — Fully Compatible with Intel386™ SL 
— Drive Specification Command Microprocessor SuperSet 
 eecee ace a as m Integrated Drive and Data Bus Buffers 


— Drive ID Capability Allows the User m Available in 64 Pin QFP Package 
to Recognize the Type of Drive 


The 82078, a 24 MHz crystal, a resistor package, and a device chip select implements a complete solution. All 
_ programmable options default to 82078 compatible values. The dual PLL data separator has better perform- 
ance than most board level/discrete PLL implementations. The FIFO allows better system performance in 
multi-master (é.g., Microchannel, FFISA). 


The 82078 maintains complete software compatibility with the 82077SL/82077AA/8272A floppy disk control- 
lers. lt contains programmable power management features while integrating all of the logic required for floppy 
disk control. The power managernent features are transparent to any application software. There are two 
versions of 82078 floppy disk controllers, the 82078SL and 82078-1. 


The 82078 is fabricated with Intel's advanced CHMOS III technology and is also available in a A4-lead QFP 
package. 


*Other brands and names are the property of their respective owner. 


Refer to the 1995 Peripheral Components Handbook for the complete data sheet on this device. 
The complete document for this product is available on Intel’s “Data-on-Demand’’ CD-ROM product. Contact 
your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 
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82077SL 
CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 
m Completely Compatible with Industry m Integrated Analog Data Separator 
Standard 82077AA — 250 Kbits/sec 
m Single-Chip Laptop Desktop Floppy — 300 Kbits/sec 
Disk Controller Solution — 500 Kbits/sec 
— 100% PC AT* Compatible — 1 Mbits/sec 
— 100% PS/2* Compatible m Programmable Crystai Oscillator for On 
— 100% PS/2 Model 30 Compatible or Off 
= Sicerccee a ten 386SL = _Integrated Tape Drive Support 
icr u : : , 
— Integrated Drive and Data Bus m Perpendicular Recording Support 
Buffers m@ 12 mA Host interface Drivers, 40 mA 
m Power Management Features pis Utvers 
— Application Software Transparency m@ Four Fully Decoded Drive Select and 
— Programmable Powerdown Motor Signals 
Command m Programmable Write Precompensation 
— Auto Powerdown and Wakeup Delays 
Modes m Addresses 256 Tracks Directly, 
— Two External Power Management Supports Unlimited Tracks 
ene — m 16 Byte FIFO 
— Typical Power Consumption in -m 68-Pin PLCC 
: Power Down: 10 as A (See Packaging Handbook Order Number #240800, Package 
m High Speed Processor Interface Type N) 


The 82077SL, a 24 MHz crystal, a resistor package, and a device chip select implements a complete laptop 
solution. All programmable options default to 82077AA compatible values. The dual PLL data separator has 
better performance than most board level/discrete PLL implementations. The FIFO allows better system 
performance in multi-master systems (e.g., Microchannel, EISA). 


The 82077SL is a superset of 82077AA. The 82077SL incorporates power management features while main- 
taining complete compatibility with the 82077AA/8272A floppy disk controllers. It contains programmable 
power management features while integrating all of the logic required for floppy disk control. The power 
management features are transparent to any application software. The 82077SL is available in three ver- 
sions—82077SL-5, 82077SL and 82077SL-1. 82077SL-1 has all features listed in this data sheet. It supports 
both tape drives and 4 MB floppy drives. The 82077SL supports 4 MB floppy drives and is capable of operation 
at all data rates through 1 Mbps. The 82077SL-5 supports 500/300/250 Kbps data rates for high and low 
density floppy drives. 


The 82077SL is fabricated with Intel’s advanced CHMOS III technology and is available in a 68-lead PLCC 
(plastic) package. 


820775SL 
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Figure 1. 82077SL Pinout 
*PS/2-and PC AT are trademarks of IBM. 


Refer to the 1995 Peripheral Components Handbook for the complete data sheet on this device. 

The complete document for this product is available on Intel's “Data-on-Demand”’ CD-ROM product. Contact 
your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 
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intel PRELIMINARY 
82595 
ISA/PCMCIA HIGH INTEGRATION 
ETHERNET CONTROLLER 
= Optimal Integration for Lowest Cost m Ease of Use - 


Solution 

— Glueless 8-Bit/16-Bit ISA/PCMCIA 2.0 

Bus Interface 

— Provides Fully 802.3 Compliant AUL 
and TPE Serial Interface 

— Local DRAM Support up to 
64 Kbytes 

— FLASH/EPROM Boot Support 

— Hardware and Software Portable 
between Motherboard, Adapter, and 
PCMCIA IO Card Solution 


High Performance Networking 

Functions 

— 16-Bit |O Accesses to Local DRAM 
with Zero Added Wait-States 

— Ring Buffer Structure for Continuous 

- Frame Reception and Transmit 
Chaining 

— Automatic Retransmission on 
Collision 

-— Automatically Corrects TPE poy 
Switching Problems 


Low Power CHMOS IV Technology 


— Design Time Reduced by High 
Integration  — 

— EEPROM Interface to Support 
Jumperless Design 

— Software Structures Optimized to 
Reduce Processing Steps 

— Automatically Maps into Unused PC 
IO Location to Help Eliminate LAN 
Setup Problems 

— All Software Structures Contained in 
One 16-Byte IO Space 

— Automatic or Manual Switching 
between TPE and AUI Ports 


— JTAG Port for Reduced Board 


Testing Times 


Power Management | 

— SL Compatible SMOUT Power Down 
Input 

— Software Power Down Command for 
non-SL Systems 


144-Lead tQFP Package Provides 


Smallest Available Form Factor 
(See Packaging Spec., Order No. 240800) 


A0-9 TPE 
— At4=19 eteouea TPE Serial Link I/F 
DO-15 / Interface 
Bus CSMA/CD LED 
Interface Control 

_ Control 
SMOUT 

AUI Serial AUI 

Local Memory Interface Link I/F 


Interface (DMA) 


MADDRO-8 
MDATAO-3 
Control 


ee 
Local DRAM !/F 


FLASH/BT PROM \/F 
1A PROM I/F (Opt) 
Serial EEPROM I/F 


20 MHz XTAL 
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Figure 1. 82595 Block Diagram 


For the complete data sheet on this product, refer to the 1995 Networking handbook. 
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82593 
CSMA/CD CORE LAN CONTROLLER 
m Supports Industry Standard LANs m High Speed, 5V CHMOS IV (P648.8) 
— IEEE 10BASE5 (Ethernet*) Technology 
— IEEE 10BASE2 (Cheapernet) wal ER } 
—|EEE 10BASE-T (TPE) re segs _— up to 20 Mb/s 
= Simple, High-Performance Control and — Direct Interface to Intel 82C501AD 
Data Interface fe ee ESI or AMD 7992 SIA 
— Control and Status via RD, WR, and — Conforms to 802.3 CSMA/CD 
CS Lines Standard 


— Data Transfers via DMA Interface 
— Two Clocks per DMA Transfer 
— Programmable Bus Throttle Timer 


m On-Board Diagnostics 
— Internal and External Loopback 


Operation 
m High-Performance Networking — Internal Register Dump 

Functions — TDR Functionality 

ee eee from m 44-Lead PLCC Package Type N 
nterna (82593SX), 44-Lead QFP Package 

— Back-to-Back Frame Reception with Type S (82593SX) or 28-Pin PDIP 
No CPU Intervention — 82593SX (8/16-Bit) System Clock up 

— Receive Ring Buffer Memory to 20 MHz | 
Structure _ — 82593SX Package Pin Compatible | 

— Transmit Frame Chaining with Intel 82592 PLCC 

— 96-Byte Transmit FIFO and 96-Byte (See Packaging Spec., Order No. 240800-001 Package Type N, S 
Receive FIFO and P) | 


Port 0 
Command & 
Status Regs 


Bus Throttle Timer 


Control 
interface 


Port 1 
Command & 
Status Regs 


Receive 
; FIFO J 
=. Transmit 
FIFO 


Data 
Input/Output 


Data Interface 
(8 or 16 Bit) 


Back-to-Back 
RCV & Ring 
Buffer Logic 

(RFP & Stop Reg) 


Two Channel 
DMA Interface 
Logic 


Internal 
Retransmit 
Control 


Configuration 
Registers 
& IA 


DMA Interface 


Parallel FIFO Serial 
Subsystem Subsystem Subsystem 


290411-1 
Figure 1. 82593 Block Diagram 


*Ethernet is a registered trademark of Xerox Corporation. 


For the complete data sheet on this product, refer to the 1995 Networking handbook. 
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intel. 
82503 
DUAL SERIAL TRANSCEIVER (DST) 


82503 PRODUCT FEATURE SET OVERVIEW 


m Single Component Ethernet* Interface — m Diagnostic Loopback 


m Automatic or Manual Port Selection Network Status Indicators 
m Manchester Encoder/Decoder and Defeatable Jabber Timer 


rai 
"i 
im 
Clock Recovery ‘ - 
m No Glue Interface to Industry-Standard m User Test Modes 
m 10 MHz Transmit Clock Generator 
a 


LAN Controllers 
— Intel 82586, 82590, 82593 and 82596 One Micron CHMOS** IV vee! 


— National Semiconductor 8390 and 
83932 (SONIC*) m Single 5-V Supply 


— Western Digital 83C690 
— Fujitsu 86950 (Etherstar*) 


INTERFACE FEATURES 


TPE oe AUI 
m Complies with 10BASE-T, IEEE Std. m= Complies with IEEE 802.3 AUI Standard 


802.3i-1990 for Twisted Pair Ethernet = Direct Interface to AUI Transformers 
Selectable Polarity Switching m On-Chip AUI Squelch 
Direct Interface to TPE Analog Filters 


On-Chip TPE Squelch 
Defeatable Link Integrity (LI) 
m= Support of Cable Lengths > 100m 


A block diagram of a typical application is shown in Figure 1. The 82503 Dual Serial Transceiver is a high-inte- 
gration CMOS device designed to simplify interfacing industry standard Ethernet LAN Controllers to IEEE 
802.3 local area network applications (1{0OBASE5, 10BASE2, and 10BASE-T). The component supports both 
an attachment unit interface (AUI) and a Twisted Pair Ethernet interface (TPE). It allows OEMs to design a 
state-of-the-art media interface that is jumperless and fully automatic. The 82503 includes on-chip AUI and 
TPE drivers and receivers; it offers i eae a cost-effective, integrated SOON for interfacing LAN conte: 
lers to the wire medium. > 8 


**CHMOS is a patented process of Intel Corporation. 
*Ethernet is a registered trademark of Xerox Corporation. 
LANCE is a registered trademark of Advanced Micro Devices. 
Etherstar is a registered trademark of Fujitsu Electronics. 
Sonic is a registered trademark of National Semiconductor Corporation. 


For the complete data sheet on this product, refer to the 1995 Networking handbook. 
November 1992 
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Ethernet* LAN Card Product Brief 


Product @ Complete plug and play 


Highlights 


PCMCIA LAN solution. Comes 
with all drivers, installation, 
card and socket services and 
card management software 
necessary for reliable operation 
ina PCMCIA slot. 


@ Drivers for all major network 
operating systems. 


@ Industry-recognized Intel 
SoftSet installation software. 
Easy to operate and manage. 


@ Based on highly integrated Intel 
82595 Ethernet Controller 


@ Complies with PCMCIA 2.0/ 
JEIDA 4.1 68-pin standard. 


@ 5 mm-thick PCMCIA Type II 
card. 


@ Detachable line adapter module 
(LAM) for multiple media 
attachment. 


@ Ethernet IEEE 802.3 
compatibility (1OBASE-T/TPE, 
10BASE- 2/BNC). 


@ Activity and link integrity LEDs. 


The Intel PCMCIA Ethernet LAN Card brings high performance and ease of use to 
PCMCIA networking. It lets you put networking capabilities into laptop computers 
without a lot of hassle, headache or expense. 


It's a plug and play solution, providing PCMCIA network- readiness right out of 
the box. All the software you need comes with the card: drivers for the most popular 
network operating systems (Novell NetWare* 2.2, 3.11, 4.0 and Lite 1.0; Microsoft® 
LAN Manager* 2.X; IBM* LAN Server® 2.X; Banyan Vines* 4.x; and Microsoft 
Window's for Workgroup* 3.1), PCMCIA- compliant card and socket services from 
SystemSoft*, and Intel's own card manager and Softset auto-configuration, auto- 
installation software that gets users on the network fast. 


The Intel PCMCIA Ethernet LAN Card is based on the highly integrated Intel 
82595 ISA/PCMCIA Ethernet Controller, giving you 16-bit desktop performance in 
a PCMCIA form factor. As we develop additional software functionality for our 
82595 line, you'll be able to leverage it across your entire 82595-based product 
line, from chips to NICs to PCMCIA form factor products. You decrease your time 
to market by taking advantage of Intel's product development efforts, while 
increasing the value of your products. 


The card is standard PCMCIA 2.0 68-pin form factor and only 5 mm thick. It's 
passed Intel's extensive quality and reliability testing to ensure that it stands up to 
the rigors of mobile users on the go. These include PCMCIA mechanical 
qualification testing such as torque, bend, shock, vibration and environmental 
testing across extreme temperatures and voltages. Our CMOS technology and 
highly integrated 82595 are very power efficient, letting mobile users stay 
connected to the network for a long time. Maximum power draw is 85 mA; in idle 
mode, power consumption drops to 20 mA. 


Finally, we've made it easy to customize our card and its accessories to vour ow? 
OEM marketing needs. Manuals, software diskettes and the cards themselves can 
all be tailored to reflect your company’s look. 


4-213 


Product Description 
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The Intel PCMCIA Ethernet LAN Card is the fastest, easiest way to deliver network- 
ready laptops to your customers. It snaps in and installs in minutes, not hours, thanks 
to a disk-full of software to make your job easier. Our industry-standard SoftSet 
installation utility automatically configures the card and sets up the software with a 
single command. Built-in card and socket services software provides card recogni- 
tion and compatibility. Built-in card rianagement software performs IRQ manage- 
ment and allows the card to be installed even when the system is running, so you 
don’t have to reboot. There's even built-in driver support for popular network 
operating systems from Novell, Microsoft, IBM and Banyan. There are no jumpers 
or switches to set. no IRQ addresses to labor over. The card’s installed and working 
in five minutes. Once installed, it’s easy to operate too, improving customer 
satisfaction and decreasing the number of support calls you receive. 


The card is fully PCMCIA 2.0/JEIDA 4.1 compliant with a standard 68-pin form 
factor. It’s also fully compliant with Ethernet IEEE 802.3 standard for |OBASE-T 
and IOBASE-2 wiring. , 


There’s a detachable line adapter module (LAM) for attachment to multiple media 
and LEDs to indicate active and link integrity. 


Features Benefits 


— Installation, card manager, and card — A complete solution; no need for any other pieces 
and socket services software Easy to install, easy to configure, easy to use 


— Broad client driver support | — Meets broad target market. Usable on industry- 
standard networks like Novell, 
LAN Manager and Banyan 


— Complies with PCMCIA 2.0/JEIDA 4.1 — No jumpers 
standards Easy to transpert 
Small form factor 


— Intel-based 82595 | — Great performance; 8 & 16-bit data path 
. Glueless interface to PCMCIA Bus — 


— Activity and link integrity LEDs — Indicates card status, improves diagnostics 


— Supports both twisted pair Ethernet (1OBASE-T) — Flexibility to connect to multiple network media 
and ThinCoax (BNC |0BASE-2) 


Product Codes 


MBLA8110 


MBLA8120US 
MBLA8120EU 


Additional Literature 


Local Area Networking Family Product Brief 
EtherExpress Family Brochure 
TokenExpress Family Brochure 

Intel 82595 Data Sheet 

Intel 82595 User's Manual 


* Other brands and names are the property of their respective owners, 


Order Number: 297 120-003 
©Inte! Corporation, 1993 


TPE all geographies 


BNC US 
BNC Europe 


297085-002 
D413.01 
D414.02 
290458-003 
TBD 
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Product 
Highlights 


DataFax 14.4 Card Product Brief 


iB LAM-less design (Integrated 
DAA) 


I Complies with PCIMCIA 2.0 and 
JEIDA 4.1 standards . 


I@ PCMCIA Type II card — 
5.0 mm thick 


if Automatic power clown mode 
tf DITMF AND PULSE dialing 


18 Fully compliant with CCITT 
T.30 and T.4 (Group 3 fax) 


{8 Compliant with CCITT V.17 
(14.4 Kbps fax) 


§8 Provides both Send and Receive 
fax capability 


HB 8-bit I/O bus interface 


H8 Includes Ring Detect notification 
to host computer in the power 
down mode 


§8 FCC Class B, UL and DOC/ 
UL (Canada) 


il Compatible with ELA/TIA 602 
(AT command set) 


fi Data modem complies with 
CCITT V.32bis, V.32, V.22bis, 
V.22, V.21, Bell* 2112a, 103 


Hi Supports CCITT V.42 error 
detection and V.42iis data 
compression 


EB Provides MNP/5* clata compres- 
sion for backward compatibility 


#8 Requires no external power 
aa MNP/10 
Gi Hayes AutoSync 


With the design of the DataFax 14.4, Intel introduces a high speed fax card 
with no external line adapter module. Lightweight. Easy to carry. And less 
chance of damage or loss. Not only does the DataFax 14.4 feature inte- 
grated DAA, but its high speed transmission helps your customers reduce 
the telephone expenses associated with faxes and modems. 


The DataFax 14.4 fax card is clso Group 3 compliant, assuring worldwide 
compatibility with most fax machines operating today. It sends and receives 
faxes and transfers files to or from notebook or hand-held computers over 
the public telephone network. 


The Intel DataFax 14.4 also conforms to PCMCIA 2.0 AND JEIDA 4.1 
physical and electrical standards for portable computers. The size of a 
credit card, the DataFax 14.4 slides into an external slot in the notebook 
computer, and connects easily into the public switched telephone network. 


The card features four power management modes — on-line, active, power 


save, and power down — to ensure the lowest possible power consumption. 
No external power is required. 


Product Description 


The DataFax 14.4 combines high speed with an integrated DAA design to provide 
optimal connectivity for notebooks and sub-notebooks. PCMCIA cards enhance the 
multiple functionality of the notebook computer. Intel's integrated DAA design 
incorporates the telephone interface circuitry directly onto the card.. This includes a 
ring detector and telephone line coupling transformer. A six-foot cable connects the 
DataFax 14.4 card to a standard RJ-11 modular telephone jack for connectivity to 
the telephone network. 


The DataFax 14.4 supports V.42 error correction which ensures that errors caused 
by the phone system are automatically corrected. The DataFax 14.4 increases data 
throughput with V.42bis data compression. This detects redundant characters, 
character sequences, and uses fewer bits to send more frequently occurring 
sequences. DataFax !4.4 users enjoy an effective throughput of up to 57,600 bits 
per second. 


Unlike other card manufacturers, Intel provides a full-service program for card 
labeling, custom kitting with third party applications, and fulfillment. The Intel 
DataFax 14.4 is the only fax card designed with recessed covers in order to accept 
adhesive labels, front and back, insuring quick turnaround for custom orders. 


Features ' Benefits 


— Integrated DAA — No external circuitry 
— Compact and lightweight 
— Easy to carry 
— Fits in briefcase with notebook computer 


— Exchangeable with other cards — Single slot serves multiple functions 


— CCITT V.32bis (14.4 Kbps) — Ensures connectivity worldwide 
— Faster transmission means less costly phone bills 


— Supports V.42/V.42bis — Provides high throughput (57.6 Kbps) 


— MNP/I0 — Enhanced data throughput with cellular connections 
(required phone adapter) 


— Hayes AutoSync — Provides synchronous communications capability 
(software required) 


*Other brands and names are the property of their respective owners. 


Order Number: 29731 1-002 
© Intel Corporation, 1993 
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Product 


Highlights — 


Faxmodem 24/96 Card Product Brief 


i Integrated DAA on the card for 
U.S./Canada 


{8 Complies with PC/MCIA 2.0 and. 
JEIDA 4.1 standards 


1@ PCMCIA Type II card 
@ 5mm thick 


{8 Fully compliant with CCITT 
T.30 and T.4 (group 3 fax) 


18 Provides both Send and Receive 
fax capability 


{8 Fully compliant with EIA/TIA 
578 (fax class 1 connmand set) 


18 Data modem complies with 
CCITT V.22bis, V.22, V.21, 
Bell* 212a and 103 


IB Supports CCITT V.42 error 
detection and V.42bis data 
compression 


if Provides MNPS5 data 
compression for backward 
compatibility 


8 Includes Ring Detect notification 
to host computer in the power 
down mode 


ff. Multiple power conservation 
modes 


§8 Requires no external power 


The Faxmodem 24/96 enables you to send or receive faxes and transfer files 
to or from notebook computers over the public telephone network. It is the 
latest member of Intel’s 1/0 card “plug and play” family. 


The Intel Faxmodem 24/96 also conforms to PCMCIA 2.0 AND JEIDA 4.1 
physical and electrical standards for portable computers. The card's light 
weight and low power consumption combine to provide high-performance, 
low-cost connectivity for notebooks and sub-notebooks. 


Approximately the size of a credit card, the Faxmodem 24/96 slides into 

an external slot in the notebook computer, connecting easily into the public 
switched telephone network. The card features four power management 
modes — on-line, active, powe: save, and power down — to ensure the 
lowest possible power consumption. No external power is required. 


Product Description 


The Faxmodem 24/96 card provides convenient, high-performance mobile fax and 
data communications capability for the notebook computer user. Adherence to 
accepted national and international standards provides the user with worldwide 
connectivity for both fax (approximately 30 million machines in use worldwide) 
and data transfer (V.22bis is the most widely used communications standard in 
the world). Its exchangeability with other PCMCIA cards enhances the multiple 
functionality of the notebook computer. 


The Faxmodem 24/96 card contains the Intel 89C124FX integrated data-fax modem 
chipset, a UART, a microcontroller, an analog front-end and other supporting 
devices. The 16450-type UART provides an 8-bit data bus interface. This, together 
with the Class | AT command set support, provides compatibility with all the major 
fax application software. 


For U.S. and Canada, the DAA is integrated on the card. For rest of world, the 
detachable line adapter module incorporates country-specific telephone interface 
circuitry. This consists of the ring detector and telephone line coupling transformer. 
A standard RJ-11 modular telephone jack provides connectivity to the telephone 
network. A short cable connects the line adapter module to the Faxmodem 24/96 
card. 


The Faxmodem 24/96 supports V.42 error correction which ensures that errors 
caused by the phone system are automatically corrected. The Faxmodem 24/96 also 
supports V.42bis data compression. This increases data throughput by detecting 
redundant characters, character sequences, and using fewer bits to send more 
frequently occurring sequences. Throughput for file transfer operations is increased 
by as much as 400 percent, providing the user with an effective 9600 bits per second. 


Features Benefits 
— Integrated DAA — No external circuitry 
— Compact and lightweight 
— Easy to carry 
— Fits in briefcase with notebook computer 


| — Exchangeable with other cards — Single slot serves multiple functions 


— Group 3 fax — Compatible with worldwide installed base of 
fax machines 


— Class | fax command set — Compatible with standard communications packages 
— CCITT V.22bis, V.22, V.21, Bell 212 and 103 — Ensures connectivity worldwide 
— Supports V.42/V.42bis — Faster; up to 4 to | data compression providing an 


effective throughput of 9600 bits per second for file 
transfers 


— Modem supports AT command set —— Compatible with standard communications packages 


— Multiple power conservation modes — Prolong system battery life 


— Factory Configuration Option for Cellular Network — Ease of use 


* Other brands and names are the property of their respective owners. 


Order Number: 297390-001 
©Intel Corporation, 1993 
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82489DX FEATURES OVERVIEW 


m Advanced Interrupt Controller for 
32-Bit Operating Systems 


@ Solution for Multiprocessor Interrupt 


Management 


mg Dynamic Interrupt Distribution for Load 


Balancing in IMP Systems 


mw Separate Nibble Bus (Interrupt 
Controller Communications (iCC) ove 


for Interrupt Messages 


ExtINTA < 


A[ 10:3] _ 


TCK = 
TRST = 
TDI = 
TMS = 
TDO : 


System 
Decode 
& 
Control 
Unit 


JTAG 
Compatible 
Boundary 
Scan TAP 
Controller 


ADVANCED PROGRAMMABLE 
INTERRUPT CONTROLLER 


m Inter-Processor Interrupts 


m Various Addressing Schemes— 
Broadcast, Fixed, Lowest Priority, etc. 


Compatibility Mode with 8259A 
32-Bit Internal Registers 
Integrated Timer Support 

33 MHz Operation 


132-Lead PQFP Package, Package 
Type KU 


(See Packaging Specification. Order Number: 240800) 


82489DX Block Diagram 


< 
e 
< 
a 
Li 
Oo 
<<. 
iL 
~ 
lad 
— 
z 
= 
a. 
I 

N 
ae] 


PNM: 


LintO 


: Lint 
Local Unit 


ICC Bus 


IntinO 
Intin1 


Intin15 


Timer Int 


32-Bit Timer . TMBASE 


290446-1 


Refer to Application Note AP-388: 82489DX User’s Manual (Order Number 292116) when evaluating your 


design needs. 
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1.0 INTRODUCTION 


The 82489DX Advanced Programmable Interrupt | 


Controller provides multiprocessor interrupt man- 
- agement, providing both static and dynamic sym- 
metrical Interrupt distribution across all processors. 


The main function of the 82489DxX is to provide in- 
terrupt management across all processors. This dy- 
namic interrupt distribution includes routing of the in- 
terrupt to the lowest-priority processor. The 
82489DX works in systems with multiple 1/O subsys- 
tems, where each subsystem can have its own set 
of interrupts. This chip also provides inter-processor 
interrupts, allowing any processor to interrupt any 


processor or set of processors. Each 82489DX I/O. 


unit Interrupt Input pin is individually programmable 
by software as either edge or level triggered. The 
interrupt vector and interrupt steering information 


DATA/ADDR Bus 
To Processor /MBC 


Interrupt 
Managemsent Logic 


Local Bus 
Interface 


Internal DATA/ ADDR 
Bus 


34 
Entry 0 


Redirection Table 
16 Entries 


interrupt Lines 
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Entry 15 


Entry 15 


32 ISR Bits : 
32 TMR Bits i 


32 IRR Bits 


0 31 


| 

: intel ® 
can be specified per pin. A 32-bit wide timer is pro- 
vided that can be programmed to interrupt the local 
processor. The timer can be used as a counter to 
provide a tirne base to software running on the proc- 
essor, or to generate time slice interrupts locally to 
. that processor. The 82489DX provides 32-bit soft- 
ware access to its internal registers. Since no 
82489DX register reads have any side effects, the 
82489DX registers can be aliased to a user read- 
only page for fast user access (e.g., performance 

monitoring timers). | 


The 82489DX supports a generalized naming/ad- 
dressing scheme that can be tailored by software to 
fit a variety of system architectures and usage mod- 
els. It also supports 8259A compatibility by becom- 
ing virtually transparent with regard to an externally 
connected 8259A style controller, making the 8259A 
visible to software. | 
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Figure 1. 82489DX Architecture 
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2.0 FUNCTIONAL OVERVIEW 


82489DX Functional Blocks 


82489DX contains one Local Unit, one I/O unit and 
a timer. The ICC bus is used to pass interrupt mes- 
sages. | 3 


ICC BUS 


The ICC bus is a 5-wire synchronous bus connecting 
all 82489DxXs (all |/O Untis and all Local Units). The 
Local Units and !/O Units communicate over this 
ICC bus. Four of these five wires are used for data 
transmissions and arbitration, and one wire is a 
clock. 


LOCAL UNIT 


The Local Unit contains the necessary intelligence 
to determine whether or not its processor should ac- 
cept interrupt messages sent on the ICC bus by oth- 
er Local Units and I/O Units. The Local Unit also 
provides local pending of interrupts, nesting and 
masking of interrupts, and handles all interactions 
with its local processor such as the INT/INTA/EOI 
protocol. The Local Unit further provides inter-proc- 
essor interrupt functionality and a timer to its local 
processor. The interface of a processor to its 
82489DX Local Unit is identical for every processor. 


1/O UNIT 


The !/O Unit provides the interrupt input pins on 
which I/O devices inject interrupts into the system in 
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the form of an edge or a level. The !/O unit also | 
contains a Redirection Table for the interrupt input 
pins. Each entry in the Redirection Table can be in- 
dividually programmed to indicate whether an inter- 
rupt on the pin is recognized as either an edge or a 
level; what vector and also what priority the interrupt 
has; and which of all possible processors should 
service the interrupt and how to select that proces- 
sor (statically or dynamically). The information in the 
table is used to send interrupt messages to all 
82489DX Units via the ICC bus. 


TIMER 


The 82489DxX provides a 32-bit wide timer that can. 
be programmed to interrupt the local processor. The 
timer can be used as a counter to provide a time- 
base to software running on the processor, or to 
generate time-slice interrupts local to that proces- 
sor. 


3.0 PIN DESCRIPTION 


The 82489DxX pin description is organized in a small 
number of functional groups. Pin definitions and pro- 
tocols have been designed to minimize interface is- 
sues. In particular, they support the notion of inde- 
pendently controlled address and data phases. The 
primary host interface is synchronous in nature. 


In the following pin definition table if the signal name 
has (__) over it, the signal is in its active state when it 
has a low level. The signal direction column identi- 
fies output only signals as a continuous drive (O), 
tristate (T/S), or open drain (O/D). All bi-directional 
(BI-D) signals have tri-stating outputs. 
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Pin Definition Table 
FEE Pin No. Function 
SYSTEM PINS | 
RESET The RESET INPUT forces 82489DxX to enter its initial state. The 82489DX 
l.ocal Unit in turn asserts it PRST (Processor Reset) output. All tri-state 
Outputs remain in high impedance until explicitly enabled. 


ExtINTA 


The EXTERNAL INTERRUPT ACKNOWLEDGE output is asserted (high) 
when an external interrupt controller (e.g., 8259) is expected to respond to 
the current INTA cycle. If deasserted (low), 82489DxX will respond, and the 
INTA cycle must not be delivered to the external controller. 


CLKIN 


CLOCK INPUT provides reference timing for most of the bus signals. | 


TEST RESET is the JTAG compatible boundary scan TAP conten reset 
pin. A weak pull-up keeps the pin high if not driven. 


ee CLOCK is the clock input for the JTAG compatible boundary scan 
* ioe and latches. 


TEST DATA INPUT is the test data input pin for the JTAG compatible 
boundary scan chain and TAP controller. A weak pull-up keeps this pin high 
if not driven. 


TEST DATA OUTPUT is the test data output for the JTAG compatible - 
boundary scan chain. 


TEST MODE SELECT is the test mode select pin for the JTAG boundary 
scan TAP controller. A weak pull-up keeps this pin high if not driven. 


TIMER PINS 


TMBASE The TIME BASE input provides a standard frequency that is only used by 
the 82489DX timer and that is the 82489DxX timer and that is independent of the system clock. 


INTERRUPT PINS 


INTIN[15:0] | 82-97 These 16 INTERRUPT INPUT pins accept edge or level sensitive interrupt 
requests from |/O or other devices. The pin numbers are specified 
respectively. INTIN15 corresponds to pin number 82, INTIN14 corresponds 
to pin number 83 etc., and INTINO corresponds to pin number 97. These 
pins are active ‘Pinsareactivehigh, . 

80 
81 


LINTIN(1] Two LOCAL INTERRUPT INPUT pins accept edge or level sensitive 
LINTIN[O] interrupt requests that can only be delivered to the connected processor. 
| These _These pins are active high. are active high. 


REGISTER ACCESS PINS 


ADDRESS "ADDRESS STROBE signal indica! signal indicating the start of a bus cycle. 82489DX 
does not commit to start the cycle internally until BUS GRANT is detected 
active. 
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Pin Definition Table (Continued) 


[Smmvor] Pn [tye[_——————~neton 
[REGISTER ACCESSPINS (Continued) 


REGISTER ACCESS PINS (Continued) 


63 Bus cycle definition signals. Note that since the > 82489DxX registers can be 
61 mapped in either memory or I/O space, the M/IO pin is not used for register 
62 access cycles; it is only used to decode interrupt acknowledge cycles. _ 


82489DX does not respond to code read cycles. 


The BUS GRANT input is optional and is used to indicate the address phase of 
a bus cycle in configurations where address timing cannot be inferred from 
ADS. This signal is really used as an address latch enable, but is named as it is 
to indicate that it can normally be connected to the Intel Cache Controller 
generated signal of the same name. Must be tied low if not used. 


BI-D | The address pins are used as inputs in addressing internal register space. 
BI-D | Output function is reserved. They are also used to latch local unit ID on reset. 


DATA LATCH/ENABLE is optional and is used to indicate committing the data | 
phase of a bus cycle in configurations where data timing cannot be inferred 
from other cycle timings. Must be tied low if not used. 


BI-D | The DATA BUS is for all register accesses and interrupt vectoring. 
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Pin Definition Table (Continued) 


| Symbol | Pin No. Type | | Function 


REGISTER ACCESS PINS (Continued) 


One Data Parity pin for each byte on the data bus. EVEN parity is generated 
any time the data bus is driven by the 82489DxX. 


READY output indicates that the current bus cycle is complete. In the case of 
a read cycle, valid data and the return to inactive state after going active low 
may be delayed till DLE goes active. 


The PROCESSOR INTERRUPT OUTPUT indicates to the processor that one 
or rnore maskable interrupts are pending. This pin is tri-stated at reset, and has 
an internal pull-down resistor to preverit false signaling to the processor until 
the 82489DX Local Unit is enabled and this pin is actively driven. 


The PROCESSOR RESET OUTPUT is asserted/de-asserted upon 82489DX 
reset, and also in response to ICC bus messages with “RESET” delivery 
‘mode. This pin should be used with care. 


The NON-MASKABLE INTERRUPT output is signaled in respone to ICC bus 

messages with “NMI” delivery mode. This pin is tri-stated at reset, and has an 

internal puil-down resistor to prevent false signaling to the processor until the 
Local Unit is enabled and this pin is actively driven. 


ICC BUS PINS 


The ICC BUS CLOCK input provides synchronous operation of the ICC bus. 


The four ICC BUS IN inputs are used for incoming ICC bus messages. In 
smaller configurations the ICC bus input and outputs may be tied directly 
together at the pins. Pin number for MBI3 is 76, MBI2 is 77, MBI1 is 78 and 
MBIO is 79. 


The four ICC BUS OUT outputs are used for outgoing ICC bus messages. The 
current capacity is only 4 mA. So external bufferes will be needed. 
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Pin Definition Table (Continued) 


| Symbol | Pino. | Type | Function 


RESERVED PINS 


34,42 | NC | These pins MUST BE LEFT OPEN. 
70, 72, 75 ae Reserved by intel. These pins should be strapped to Vcc. 
71, 19, 20 — Reserved by Intel. These pins should be strapped to GND. 


POWER AND GROUND PINS 


Voc 1,32,69,98 | POWER | Nominally + 5V. These pins along with Vss and Vss; should be 
separately bypassed. 
Vocp 6, 15, 25,100, | POWER | Nominally + 5V. These pins along with Vssp should be separately 
108, 117, 126 bypassed. 
Vcocpo 39, 46 POWER | Nominally + 5V. These pins along with Vsspo should be separately 
bypassed. 
5, 33, 67, Nominally OV. These pins along with Vcc should be separately 
68, 99 bypassed. 


Vssp 10, 17, 23, 30, Nominally OV. These pins along with Vccp should be separately 
bypassed. 


106, 113, 120, 
127, 132, 


Vsspo 36, 40, 44, Nominally OV. These pins along with Vccpo should be separately 
47,50 bypassed. 
Vss Nominally OV. These pins along with Vcc should be separately 
bypassed. 


NOTE: 
Voc: Vecp and Vocpo should be of same voltage. Vss, Vssp, Vsspo and Vsg; should be OV. 


4.0 FUNCTIONAL DESCRIPTION — I/O Unit 


As far as interrupt management is concerned, the The I/O Unit consists of a set of Interrupt Input pins, 
82489Dx’s interrupt control function spans over two an Interrupt Redirection Table, and a message unit 
functional units, the |/O Unit of which there is one for sending and receiving messages from the ICC 
per 1/O subsystem, and the Local Unit of which bus. The I/O Unit is where I/O devices inject their 
there is one per processor. 82489DX has one I/O interrupts, the I/O Unit selects the corresponding 
unit and one Local Unit in a single package. This entry in the Redirection Table and uses the informa- 
section takes a detailed look at both local and I/O tion in that entry to format an interrupt request 
Units. message. The message unit then broadcasts this 
message over the ICC bus. The content of the Redi- 
rection Table is under software control and is as- 
signed benign defaults upon reset. The masks in the 
Redirectional Table entries are set to 1 at hardware 
reset to disable the interrupts. 
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Figure 2. 82489DxX I/O Unit Block Diagram 
Local Unit | | mode of the interrupt, zero, one or more units can 
accept an interrupt. A Local Unit accepts an inter- 
Interrupt Management of the Local Unit is responsi- __rupt only if it will deliver the interrupt to its processor. 
ble for local interrupt sources, interrupt acceptance, Accepting an interrupt is purely an inter-82489DX 


dispensing interrupts to the processor, and sending _—- Matter; dispensing an interrupt to the local proces- 
inter-processor interrupts. Depending on the delivery sor only involves a 82489DX and its local processor. 
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Figure 3. 82489DX Local Unit Block Diagram 
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5.0 INTERRUPT CONTROL 
MECHANISM 


This section describes briefly the interrupt control 
~mechanism in the 82489DX. | 


5.1 Interrupts 


The interrupt control function of all 82489DXs are 
collectively responsible for delivering interrupts from 
interrupt sources to interrupt destinations in the mul- 
tiprocossor system. When a processor accepts an 
interrupt, it uses the vector to locate the entry point 
of the handler in its interrupt table. The 82489DX 
architecture allows for 16 possible interrupt priori- 
ties; zero being the lowest priority and 15 being the 
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highest. Priority of interrupt A “is higher than” the 


Priority of interrupt B if servicing A is more urgent — 


than servicing B. An interrupt’s priority is implied by 
its vector; namely priority = vector/16. 


With 256 vectors and 16 different priorities, this im- 


plies that 16 different interrupt vectors can share a 
single interrupt priority. 


TOTAL ALLOWED INTERRUPT VECTORS 


Out of 256 vectors, interrupt vectors 0 to 15 should 


not be used in the 82489DxX. Only 240 interrupt vec- 


tors (vectors from 16 to 255) are supported in the 
82489DxX. 


Processor 
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Figure 4. 1/O Units and Local Units. 
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INTERRUPT SOURCES 


Interrupts are generated by a number of different in- 
terrupt sources in the system. 


Possible interrupt sources are: 


e Externally connected (I/O) devices. Interrupts 
from these external sources manifest themselves 
as edges or levels on interrupt input pins and can 
be redirected to any processor. 


e Locally connected devices. These originate as 
edges or levels on interrupt pins, but they are al 
ways directed to the local processor only. 


¢ 82489DX timer generated interrupts. Like locally 
connected devices, 82489DX timer can only in- 
' terrupt its local processor. 


e Processors. A processor can interrupt any indi- 
vidual processor or sets of processors. This sup- 
ports software self-interrupts, preemptive sched- 
uling, TLB flushing, and interrupt forwarding. A 
processor generates interrupts by writing to the 
interrupt command register in its Local Unit. 


INTERRUPT DESTINATIONS 


I/O Units can only source interrupts whereas Local 
Units can both source and accept interrupts, so 
whenever “interrupt destination’ is discussed, it is 
implied that the Local Unit is the destination of the 
interrupt. In physical mode the destination processor 
is specified by a unique 8-bit 82489DxX local ID. Only 
a single destination or a broadcast to all (LOCAL ID 
of all ones) can be specified in physical destination 
mode. 


In logical mode destinations are specified using a 
32-bit destination field. All Local Units contain a 
32-bit Logical Destination register against which the 
destination field of the interrupt is matched to deter- 
mine if the receiver is being targeted by the interrupt. 


An additional 32-bit Destination Format register in | 


each Local Unit enables the logical mode address- 
ing. 


INTERRUPT DELIVERY 


The description of interrupt delivery makes frequent 
use of the following terms: 


e Each processor has a processor priority that re- 
flects the relative importance of the code the 
processor is currently executing. This code can 
be part of a process or thread, or can be an inter- 
rupt handler. A processor’s priority fluctuates as a 
processor switches threads, a thread or handler 
raises and lowers its priority level to mask out 
interrupt, and the processor enters an interrupt 
handler and returns from an interrupt handler to 
previously interrupted activity. 
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e A processor is lowest priority within a given group 
of processors if its processor priority is the lowest 
of all processors in the group. Note that more 
than one processor can be the lowest priority in a 
given group. 


e A processor is the focus of an interrupt if it is 
currently servicing that interrupt, or if it currently 
has a request pending for the interrupt. 


Interrupt delivery begins with an interrupt source in- 
jecting its interrupt into the interrupt system at one of 
the 82489DxX. Delivery is complete only when the 
servicing processor tells its 82489DX Local Unit it is 
complete by issuing an end-of-interrupt (EOI) com- 
mand to its 82489DX Local Unit. Only then has all 
(relevant) internal state regarding that occurrence of 
the interrupt been erased. The interrupt system 
guarantees exactly-once delivery semantics of inter- 
rupts to the specified destinations. Exactly-once 
guaranteed delivery implies a number of things: 


e The interrupt system never rejects interrupts; it 
never NAKs interrupt injection, interrupts are nev- 
er lost, and the same interrupt (occurrence) is 
never delivered more than once. 


Clearly a single edge interrupt or level interrupt 
counts as a single occurrence of an interrupt. In uni- 
processor systems, an occurrence of an interrupt 
that is already pending (IRR) cannot be distin- 
guished from the previous occurrence. All occur- 
rences are recorded in the same IRR bit. They are 
therefore treated as “the same” interrupt occur- 
rence. 


For lowest-priority delivery mode, by delivering an 
interrupt first to its focus processor (if it currently has 
one), the identical behavior can be achieved in a MP 
(Multiprocessor) system. If an interrupt has a focus 
processor then the interrupt will be delivered to the 
interrupt’s focus processor independent of priority 
information. This means that even if there is a lower 
priority processor compared to the focus processor, 
the interrupt still gets delivered to the focus proces- 
sor. 


Each edge occurring on an edge triggered interrupt 
input pin is clearly a one-shot event; each occur- 
rence of an edge is delivered. An active level on a 
level triggered interrupt input pin represents more of 
a “continuous event’. Repeatedly broadcasting an 
interrupt message while the level is active would 
cause flooding of the ICC bus, and in effect trans- 
mits very little useful information since the same 
processor (the focus) would have to be the target. 


Instead, for level triggered interrupts the 82489DX 
merely recreate the state of the interrupt input pin at 
the destination . The source 82489DX accomplishes 
this by tracking the state of the appropriate destina- 
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tion 82489DX’s Interrupt Request Register (or pend- 
ing bit) and only sending inter-82489DX messages 
when the state of the interrupt input pin and the des- 
tination’s interrupt request enter a disagreement. 
Unlike edge triggered interrupts, when a level inter- 
rupt goes into service, the interrupt request at the 
servicing 82489DX is not automatically removed. If 
the handler of a level sensitive interrupt executes an 
EOI then that interrupt will immediately be raised to 
the processor again, unless the processor has ex- 
plicitly raised its task priority, or the source of that 
interrupt has been removed. 


5.2 Interrupt Redirection 


This section specifically talks about how a processor 
is picked during interrupt delivery. The 82489DX 
supports two modes for selecting the destination 
processor: Fixed arid Lowest Priority. 


e Fixed Dellvery Mode 
In fixed delivery mode, the interrupt is uncondi- 
tionally delivered to all local 82489DXs_ that 
match in the destination information supplied with 


the interrupt. Note that for 1/O device interrupts — 


typically only a single 82489DX would be listed in 
the destination. Priority and focus information are 
ignored. If the priority of a destination processor 
equal to or higher than the priority of the interrupt, 
then the interrupt is held pending locally in the 
destination processor’s Local Unit, until the proc- 
essor priority becomes low enough at which time 
the interrupt is cispensed to the processor. More 
than one processor can be the destination in 
fixed-delivery mode. 


e Lowest Priority Dellvery Mode 

Under the lowest priority delivery mode, the proc- 
essor to handle the interrupt is the one in the 
specified destination with the lowest processor 
priority value. If more than one processor is at the 
lowest priority, then a unique arbitration ID is 
used to break ties. For lowest priority dynamic 
delivery, the interrupt will always be taken by its 
focus processor if it has one. The lowest priority 
delivery methocl assures minimum interruption of 
high priority tasks. Since each Local Unit only 
knows its own processor priority, determining the 
lowest priority processor is dorie by arbitration on 
the.ICC bus. Only one processor can be the des- 
tination in lowest-priority delivery mode. 


| INTER-82489DX (COMMUNICATION | 


All |/O and Local Units communicate during interrupt 
delivery. Interrupt information is exchanged between 
different units on a. dedicated five wire ICC bus in the 
form of broadcast messages. A 82489DX Unit’s 8-bit 
ID is used as its name for the purpose of using the 
ICC bus, and all 82489DX units using one ICC bus 
should be assigned a different |D. The Arbitration 
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ID of the Local Units used to resolve tles during low- 


est priority arbitration is also derived from the Local 
Unit’s ID. 


6.0 82489DX LOCAL UNIT 
REGISTERS DESCRIPTION 


6.1 Local Unit ID Register | 


Each 824891)X Local Unit has a register that con- 
tains the Local Unit’s 8-bit ID. The Local Unit ID 
serves as a physical name of the 82489DX Local 
Unit. It can be used in specifying destination infor- 
mation and is also used for accessing the ICC bus. | 
Eight address lines A[10]-A[3] are sampled on ev- 
ery clock ecige while RESET is asserted. The last 
sample remains in the Local Unit ID register after 
reset. Alternatively, the ID can be loaded with a reg- 
ister write as part of software initialization. The Local. 
Unit ID is read-write by software. 


82489DX Local Unit ID Register 


Bits [31..24] Local Unit ID: The Local Unit ID 
serves as the physical “name” of the 
Local Unit used for addressing the 
82489Dx in physical destination mode 
and for the ICC bus usage. In a sys- 
tem with say four 82489DX, there are 
4 Local Units and 4 |/O Units. All the 8 
units should be assigned different IDs. 
For future compatibility use only IDs 
from 0 to 14. 


Bits [23..0] are Reserved. They 
should be written with 0. 


Bits [23..0] 


6.2 Destination Format Register 


Interrupt Destination can be either addressed physi- 
cally or logically. When the interrupt message ad- 
dresses the destination physically, each 82489DxX in 
the ICC bus compares the address with its own unit 
ID. If the message is a broadcast type then every 
Local Unit accepts the interrupt. 


When the rnessage addresses the destination using 
logical addressing scheme each Local Unit in the 
ICC bus compares the logical address in the inter- 
rupt message with its own Logical Destination Reg- 
ister. If there is a bit match (i.e., if at least one of the 
corresponding pair of bits match) this local unit is 
selected for delivery. 
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All the 32 bits of Destination Format Register of all 
82489DX connected in the ICC bus should be writ- 
ten with “1” to enable the addressing scheme. 


Destination Format Register 
Logical Destination Register 


Bits [31:0] 


For future compatibility, use only bits 31-24 of Logi- 
cal Destination Register. For binary compatibility, it 
is strongly recommended that 82489DX software 
use only 8 MSB of Logical Destination Register. 


6.3 Local Interrupt Vector Table 
Registers 


The Redirection Table serves to steer interrupts that 
originate in the I/O subsystems to the processors. 
The Local Vector Table is its equivalent for inter- 
rupts that are restricted to only the local processor. 
The Local Vector Table contains three 32-bit regis- 
ters. Register 0 corresponds to the timer, registers 1 
and 2 correspond to local interrupt input pins, 
LINTINO and LINTIN1. 


The format of both the Local 0 and Local 1 interrupt 
vector tables are identical. The following register de- 
scription talks about both Local Interrupts 0,1 vec- 
tors. 


Local Interrupts 0, 1 Interrupt Vectors 


Vector: [Bits 7-0] 


This is the vector to use when gener- 
ating an interrupt for this entry. 


Delivery Mode: [Bits 10-8] 
000: Fixed 
001: <reserved> 
010: <reserved> 
011: <reserved> 
100: NMI 
101: <reserved> 


Local INTO Vector Table 


Bits [94171 Bis {10:8) | Bits [7-0] 


Local INT1 Vector Table 
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110: <reserved> 
111: ExtINT 


000: (fixed) means deliver the signal 
on the INT pin of the local proc- 
essor. Trigger mode for “fixed” 
Delivery Mode can be edge or 
level. 


100: (NMI) means deliver the signal 
on the NMI pin of the local proc- 
essor. Vector information is ig- 
nored. A Delivery Mode equal to 
“NMI” requires a “‘level’’ trig- 
gered mode. 


111: (ExtiINTA) means deliver the sig- 
nal to the INT pin of the local 
processor as an interrupt that 
originated in an externally con- 
nected (8259A compatible) in- 
terrupt controller. ExtINTA pin is 
as- serted also. The INTA cycle 
that corresponds to this ExtIN- 
TA delivery, should be routed to 
the external controller that is ex- 
pected to supply the vector. A 
delivery mode of ExtINT re- 
quires an edge trigger mode. 
(See the section on compatiblity 
for more details.) 


Bit 11 is Reserved. It should be writ- 
ten 0. 


Bit 11: 


Delivery Status: [Bit 12] 


This field is software-read only. Soft- 
ware writes to this field (as part of a 

32-bit word) have no effect on this 
bit. Delivery status is a 1-bit field that 
contains the current status of the de- 
livery of this interrupt. Two states are 
defined. 


0: (idle) means that there is currently 
no activity for this interrupt. 


1: (send pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
the recently injected interrupts that 
are in the process of being deliv- 
ered. 


| sisisti7] | atte | sis | atts | era | etre | ett | sistios) | Bits t70) 


Figure 5. Local Vector Table 
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Bit 13 is Reserved. It should be written 
0. 


Bit 13: 


Remote IRR: [Bit 14] 


This bit is used for level triggered local 
interrupts; its meaning is undefined for 
edge triggered interrupts. Remote IRR 
mirrors the interrupt’s IRR bit of this 
local unit. Remote IRR is software 
read-only; software writes to this bit do 
not affect it. 


Trigger Mode: [Bit 15] 
The Trigger Mode field indicates the 
type cf signal on the local interrupt pin 
that triggers an interrupt. 


0: indicates edge senisitive, 
1: indicates level sersitive. 


Only the local interrupt pins can be 
programmed as edge or level trig- 
gered. Timer interrupts are always 
treated as edges. 


MASK: [Bit 16] 
0: enables injection of the interrupt, 
1: masks injection of the interrupt. 


Bits [31:17] Bits [31:17] are Reserved. Should be 
written 0. . 


6.4 Inter-Processor Interrupt 
Registers 


A processor generates inter-processor interrupts by 
writing to the Interrupt Command Register in its 
82489DX Local Unit. Conceptually, this can be 
thought of as the processor proviciing the interrupt’s 
Redirection Table [Entry on the fly. Not surprisingly, 
the layout of the Interrupt Command Register re- 
sembles that of ari entry in the Redirection Table. 
Note that the format of this register allows a proces- 
sor to generate any interrupt. A processor may use 
this to forward device interrupts criginally accepted 
by it to other processors. 


All fields of the Interrupt Command Register are 
read-write by software with the exception of the De- 
livery Status field which is read-only. Writing to the 
32-bit word that contains the interrupt vector causes 
the interrupt messzige to be sent. 


Interrupt Command Register [31:0] 


| Bits [31:20] | Bits [19:18] | Bits [17:16] Bit 12 Bits [10:8] | Bits [7:0] 
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Vector: [Bits 7—0] - 


The vector identifies the interrupt being 
sent. If the Delivery Mode is “Remote 
Read’, then the Vector field contains 
the address of the register to be read in 
the remote 82489DX’s Local Unit. The 
addresses are listed in the section dis- 
cussing 82489DX Local Unit register 
summary. For example, for ID register, 
remote read address of 02 should be 
specified in vector field. 


Delivery Mode: [Bits 10-8] 


The Delivery Mode is a 3-bit field that 
specifies how the 82489DxX listed in the 
destination field (bits 63:32) should act 
upon reception of this signal. Note that 
certain Delivery Modes will only operate 
as intended when used in conjunction 
with a specific Trigger Mode. These re- 
strictions are indicated for each Delivery 
Mode. 


000: (Fixed) means deliver the signal on 
the INT pin of all processors listed 
in the destination. Trigger Mode 
for ‘fixed’ Delivery Mode can be 
edge or level. 


001: (Lowest Priority) means deliver the 
signal on the INT pin of the proc- 
essor that is executing at the lower 
priority among all the processors 
listed in the specified destination; 
Trigger Mode for “lowest priority” 
Delivery Mode can be edge or lev- 
el. 


010: Intel Reserved. Should not be 
used. 


011: (Remote Read) is a request to a 
remote 82489DX Local Unit to 
send the value of one of its regis- 
ters over the ICC bus. The register 
is selected by providing its address 
in the Vector field. The register val- 
ue is latched by the requesting 
82489DX and stored in the Re- 
mote Register where it can be 
read by the local processor. A De- 

- livery Mode of “Remote Read”’ re- 
quires an “edge” Trigger Mode. 
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100: (NMI) means deliver the signal on 
the NMI pin of all processors list- 


ed in the destination, vector infor- 
mation is ignored. A. Delivery 


Mode equal to “NMI” requires a 
“level” Trigger Mode. 


101: (Reset) means deliver the signal 
to all local units listed in the desti- 
nation. The destination local unit 
will assert/deassert its PRST out- 
put pin. All addressed 82489DX 
Local Units will assume their re- 
set state but preserve their ID. 
One side effect of an ICC bus 
message with Delivery Mode 
equal to “Reset” that results in a 
deassert of reset is that all Local 
Units (whether listed in the desti- 
nation or not) will reset their low- 
est-priority tie breaker arbitration 
ID to their Local Unit ID (see the 
section on the ICC bus for de- 
tails). A Delivery Mode of “Re- 
set” requires a ‘level’ Trigger 
Mode. “RESET” should not be 
used with “self” or “all incl self’’ 
Shorthand mode since it will 
leave the system in non-recover- 
able reset state. If “RESET” is 
used with ‘‘all excl self’? mode 
software should make sure that 
only one CPU executes this in- 
struction in a MP system. 


110: Intel Reserved. Should not be 
used. 


111: Intel Reserved. Should not be 
used 


Destination Mode: [Bit 11] 


This field determines the interpretation 
of the Destination field. 


0: (Physical Mode): in Physical Mode, 
a destination 82489DX is identified 
by its Local Unit ID. Bits 56 through 
63 (8 MSB of the destination field) 
specify the 8-bit 82489DX Local 
Unit ID. 


1: (Logical Mode): in Logical Mode, 
destinations are identified by match- 
ing on Logical Destination under the 
control of the Destination Format 
Register in each Local 82489DX. 
The 32-bit Destination field is the 
logical destination. 
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Delivery Status: [Bit 12] 


Bit 13: 


Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 


0: (Idle) means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
other recently injected interrupts 
that are in the process of being de- 
livered; 


Delivery Status is software read-only; 
software writes to this field (as part of 
a 32-bit word) do not affect this bit. 
Software can read to find out if the 
current interrupt has been sent, and 
the Interrupt Command Register is 
available to send the next interrupt. If 
the Interrupt Command Register is ov- 
erwritten before the Delivery Status is 
“Idle”, then the destiny of that inter- 
rupt is undefined; i.e., the interrupt may 
have been lost. 


Bit 13 is Reserved . Should be written 
0. 


Level: [Bit 14] 


Software can use this bit in conjunc- 
tion with the Trigger Mode bit when is- 
suing an inter-processor interrupt to 
simulate assertion/deassertion of lev- 
el sensitive interrupts. 


To assert: Trigger mode = 1 and Lev- 
el = 1. 


To deassert: Trigger mode = 1 and 
Level = 0. 


For example, a message with Delivery 
Mode of “Reset’’, a Trigger Mode of 
“Level’’, and Level bit of 0 deasserts 
Reset to the processor of. the ad- 
dressed 82489 DX Local Unit(s). As a 
side effect, this will also cause all 
82489DxX to reset their Arbitration ID 
to their unit ID. (The Arb ID is used for 
tie breaking in lowest priority arbitra- 
tion.) | 


Trigger Mode: [Bit 15] 


Software can use this in conjunction 
with Level Assert/Deassert to gener- 
ate interrupts that behave as edges or 
levels. 


0: Edge 
1: Level 
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Remote Read Status: [Bits 17,16] 


This field indicates the status of the 
data contained in the Remote Read 
register. This field is read-only to soft- 
ware. Whenever software writes to the 
Interrupt Command Register using De- 
livery Mode “Remote Read” the Re- 
mote Read status becomes “in-prog- 
ress” (waiting for the remote data to ar- 
rive). The remote 82489DxX Local Unit is 
expected to respond in a fixed amount 


of ICC bus cycles. If the remote 


824891)X Local Unit is unable to do so, 
then the Remote Read status becomes 
“invalid”. lf successful, the Remote 
Read status resolves to “Valid’’. Soft- 
ware should poll this field to determine 
completion and success of the Remote 
Read command. 


00: (invalid): the content of the Remote 
Read Register is invalid. This is the 
case after a Remote Read com- 
mand issued and the remote 
82489DX Local Unit was unable to 
deliver the Register content in time. 


01: (in progress): a Remote Read com- 
mand has been issued and this 
82489DX is waiting for the data to 
arrive from the rernote 82489DX Lo- 
cal Unit. 


10: (valid): the most recent Remote 
Read command has completed and 
the remote register content in the 
Remote Read Register is valid. 


11: reserved. 


Destination Shorthand: [Bits 19,18] 
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This field indicates whether a shorthand 


- notation is used to specify the destina- 


tion of the interrupt and if so, which 
shorthand is used. Destination Short- 
hands do no use the 32-bit Destination 
field, and can be ser by software with 
a single 32-bit write to the 82489DX’s 
Interrupt Command Register. Short- 
hands are defined for the following 
common cases: software self interrupt, 
interrupt to all processors in the system 
including the sender, interrupts to all 
processors in the system excluding the 
sender. 


00: (dest field): means that no short- 
hand is used. The destination is 
specified in the 32-bit Destination 
field in the secorid word (bits 32 to 
63) of the Interrupt Command Reg- 
ister. 


a 
intel. 
01: (self); means that the current 
82489DX Local Unit is the single 
destination of the interrupt. This is 
useful for software interrupts. The 
Destination field in the Interrupt 
Command Register is ignored. RE- 
SET Delivery mode should not be 
used with self destination: Only 
FIXED delivery mode should be 
used with SELF. 


10: (all incl self): means that the inter- 
rupt is to be sent to “all” proces- 
sors in the system including the 
processor sending the _ interrupt. 
The 82489DxX will broadcast a mes- 
sage with destination unit ID field 
set to all ones. RESET assert Deliv- 
ery mode should not be used with 
“all incl self” destination. 


11: (all excl self): means that the inter- 
rupt is to be sent to “all” proces- 
sors in the system with the excep- 
tion of the processor sending the 
interrupt. The 82489DX will broad- 
cast a message with destination 
unit ID field set to all ones. All-excl- 
self is useful during selection of a 
boot processor (init) and also for 
TLB flush where ‘self’ is flushed 
using the processor flush instruc- 
tion. Only one CPU in a MP system 
should execute “all exci self’ desti- 
nation if used with RESET Delivery 
mode. 


Bits [31:20] Bits [31:20] are Reserved. They should 
be written 0. 


Interrupt Command Register [63:32] 
| Bits [63:32] | 


Destination: [Bits 63-32] 


This field is only used when the Desti- 
nation Shorthand is set to ‘‘Dest Field’. 
lf Destination Mode is Physical Mode, 
then the 8 MSB contain a Destination 
unit ID. If Logical Mode, the full 32-bit 
Destination field contains the logical ad- 
dress. The enabling is done by Destina- 
tion Format Register. 


6.5 IRR, ISR, TMR Registers 


INTERRUPT ACCEPTANCE 


All 82489DX Local Units listen to all messages sent 
over the ICC bus. For each message, the local unit © 
first checks if it belongs to the destination in the 
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message. It does this by matching the 32-bit Desti- 
nation field in the message against its logical Desti- 
nation Register, if the message addresses in logical 
mode, and against its physical ID, if the message 
addresses in physical mode. All 82489DX Local 
Units that match are said to “belong to the group”’. 


Each 82489DX Local Unit contains three 256-bit 
registers that play a role in the acceptance of inter- 
rupts and in dispensing accepted interrupts to the 
local processor. Each of these registers is a bit array 
where bit position | tracks information about the in- 
terrupt with vector i. These bits track information 
about the (PINT) maskable interrupts only. They are 
not relevant for NMI, RESET or ExtiNT type of inter- 
rupts. The Interrupt Request Register (IRR) contains 
the interrupts accepted by this 82489DX Local Unit 
but not yet dispensed to the processor. The In Serv- 
ice Register (ISR) contains the interrupts that are 
currently in service by the processor, i.e., the inter- 
rupts that have been dispensed to the processor but 
for which the processor has not yet signaled the 
End-Of-interrupt. 


Note that the 82489DX’s IRR and ISR registers have 
the same meaning and operation as in the 8259A in 
fully nested/non-specific EOI mode. Note also that 
these registers play no role in providing 8259A com- 
patibility. Compatibility is handled by making an ex- 


Device B 


INTIN 


end "level assert” 


82489DX 


ternal 8259A-style controller directly visible to the 
processor and having the 82489DX become trans- 
parent. 


Each interrupt has a vector associated with it, which 
determines the bit position, and hence the priority for 
the interrupt. When an interrupt is being serviced, all 
equal or lower priority interrupts are automatically 
masked by the 82489DX Local Unit. 


The Trigger Mode Register (TMR) indicates for each 
interrupt whether the interrupt is edge or level. This 
information is transmitted with each 82489DxX inter- 
rupt request message and reflects the Trigger Mode 
bit in the interrupt’s Redirection Table entry. If an 
interrupt goes in service and the TMR bit is 0 (edge), 
then the interrupt’s IRR bit is cleared at the same 
time the ISR bit is set. If the TMR bit is 1 (level), then 
the IRR bit is not cleared when the interrupt goes in 
service. In the latter case, the IRR bit mirrors the 
state of the interrupt’s input pin. 


The following diagram shows 82489DX operation 
with devices A and B sharing a level triggered inter- 
rupt input. The diagram illustrates how Remote IRR, 
and the IRR bit at the destination 82489DX track the 
state of INTIN. It also illustrates how an EOI is fol- 
lowed immediately be re-raising the interrupt as long 
as the INTIN is still asserted by some device. 


/ \ XOR of INTIN and Remote IRR / \ 
. io "level deassert” =~ 


message 


Remote IRR 


IRR at dest Unit 


INT/EOI 


message 
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Figure 6. Interrupt Sharing 
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ISR, IRR, and TMR are read-only by software. Each 
of these 256-bit registers is accessed as four sepa- 
rate 32-bit registers. Note that there is no general 
Interrupt Mask Register (IMR) as in the 8259A. The 
processor masks interrupts temporarily by writing to 


the Icoal unit’s Task Priority Register (described 


shortly). 


ISR [Interrupt Status Register] 

| Bits [255:0] | i 

IRR [Interrupt Request Register] 
| Bits [255:0] | 
TMR [Trigger Mode Register] 


| Bits [255:0] | 


Figure 7. ISR, IRR, and TMR 


TMR (Trigger Mode Register): 


lf O [edge triggered] the corresponding 
IRR bit is automatically cleared when 
interrupt service starts. If 1[level trig- 
gered] this is not the case; instead, the 
source 82489DX must explicitly request 
the IRF? bit be cleared (upon deassert 
of the interrupt input pin or upon send- 
ing an appropriate interprocessor inter- 
rupt). Upon acceptance of an interrupt, 
the TMR bit is cleared for edge trig- 
gered interrupts and set for level trig- 
gered interrupts. This information is car- 
ried in the accepted iriterrupt message. 
The source 82489DX 1!/O unit also 
tracks the state of the destination unit’s 
IRR bit (Remote IRR bit in the Redirec- 
tion Table). When a level triggered in- 


_ terrupt input is deasserted, the source . 


824891)X I/O unit detects the discrep- 
ancy between the input pin state and 
the Remote IRR, and automatically 
sends a message telling the destination 
824891)X to clear IRR for the interrupt. 
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IRR (Interrupt Request Register): 


It contains the active interrupt requests 
that have been accepted, but not yet 
dispensed by this 82489DX Local Unit. 
A bit in IRR is set when the 82489DX 
Local Unit accepts the interrupt. When 
TMR is 0, it is cleared when the inter- 
rupt is serviced; when TMR is 1, it is 
cleared when the 82489DX Local Unit 
receives a message to clear it. 


ISR (In Service Register): 


lt marks the interrupts that have been 
delivered to the processor, but that 
have not been fully serviced in that an 
End-Of-Interrupt has not yet been re- 
ceived. The ISR register reflects the 
current state of the processor’s inter- 
rupt stack. 


ACCEPTANCE MECHANISM 


Interrupt acceptance proceeds as follows. If the de- 
livery mode is Fixed, then each unit in the destina- 
tion group uriconditionally accepts the interrupt mes- 
sage and sets the interrupt’s IRR bit. If the delivery 
mode is Lowest Priority, then each processor in the 


group first checks if it is currently the focus of the 


interrupt by checking its ISR and IRR. If an 82489DX 
finds one of these bits set for the incoming interrupt, 
then that 82489DX Local Unit accepts the interrupt 
independent of priority, and “signals” the other 
82489DX Local Units to abort the priority arbitration. 
This avoids multiple delivery of a same interrupt oc- 


_ currence to different processors, consistent with in- 


terrupt delivery semantics in uniprocessor systems 
as described above. If a message is to be delivered 
for NMI or Reset, then all 82489DX Local Units list- 
ed in the destination unconditionally assert/deassert 
the corresponding output pin. ISR, IRR, etc. are by- 
passed for NMI or reset and vector information is 
undefined. 
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The acceptance decision process is illustrated in the flow chart below.. 


Wait to receive 
ICC message 
; o” Belong to * 
Discard Message e-! Hest nation: Js 
Delivery Mode Set Interrupt's IRR 


Lowest Pty 


_ Arbitrate 


Am | 
the Winner 


Figure 8. Interrupt Acceptance Flow Chart 
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6.6 Tracking Processor Priority 


Each 82489DX Local Unit should be programmed 
with task priority so that it can mask interrupts 
that are less priority than that of the processor tem- 
porarily. 


Task switching and task priority changes are the re- 
sult of explicit software action. The operating system 
may define a number of task scheduling classes. Ex- 
amples are an idle class, a background class, a fore- 
ground class, and a time critical class. Alternatively, 
different classes can be assigned to user code 
versus system code. If tasks in different classes are 
executing when an iriterrupt comes in, then it may be 
advantageous to interrupt the processor currently 
running the task in the least important class. Clearly, 
if one processor is idle while others; are doing work, 
the idle processor would be the obvious target for 
servicing the interrupt. This implies that there is use 
in defining priority levels below all interrupt levels 


that can participate in lowest priority delivery selec- 


tion. 


At times, the operating system may need to block 
out interrupts from being serviced. For example, to 
synchronize access to a shared data structure be- 
tween a device driver and its interrupt handler the 
driver raises it priority to equal or higher than the 
interrupt’s priority. — 


The local 82489DX supports this via its Task Priority 
Register (the 8259A supports this via the interrupt 
mask register (IMR)}.) Software thet wants to make 
use of this is required to inform its 82489DX Local 
- Unit of the prioity change by updating the Task Prior- 
ity Register. The Task Priority field is 8 bits providing 
up to 256 distinct priorities. The 4 MSB of this regis- 
ter correspond to the 16 interrupt priorities while the 


4 LSB provide more precision. Priorities are best 


noted as x:y, where x is the value of the 4 MSB and y 
is the value of the 4 LSB. For example, Task Priority 
Register values 0:y with O < y < ‘5 (and 0 in the 4 
MSB) can be used to represent the priorities of the 
task scheduling classes described above (y = 0 for 
idle; y = 1 for background; etc.). Except for inter- 
rupts with vectors 0 through 15 (which are often pre- 
defined by the processor) which ali have priority 0:0, 
the priorities of all other interrupts and their handlers 
is x:0 with 1 < x < 15 and is above the base task 
priorities Ory. 


For example, interrupt vector 123 has priority 7:0 


(123/16 = 7) and can be masked by any task that 
raises its priority to a value equal or higher than 7:0. 
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82489DX uses Task priority register for the purpose 
of masking the interrupts. The task priority register 
should be programmed with a priority value to speci- 
fiy the priority of task the processor is executing. 
82489DX masks any interrupts of lower or equal pri- 
ority when compared with task priority. 


When task priority register is programmed with the 
priority 15, all the interrupts are masked. When task 
priority register is programmed with priority level X, 
by definition, all the interrupts of priority X and below 
X will be masked. When task priority register is pro- 
grammed with the priority 0 then all the interrupts 
above priority 0 are allowed to interrupt the proces- 
sor. This means that when task priority register is 
programmed even with the lowest value, i.e., 0, inter- 
rupts of priority 0 will be masked. So only 240 inter- 
rupt vectors should be used in 82489Dx. Interrupt 
vecotrs from 0 to 15 should not be used. 


The first priority value computed is the maximum of: 
e Task Priority (4msb : 4lsb) and 


e the priority of the highest order ISR bit set ((vec- 
tor/16) :0). 


The value is used to determine whether or not a 
pending interrupt can be dispensed to the proces- 
sor. 


The second priority value computed is the maximum 
of: 
e Task Priority (4msb : 4isb), and 


e the priority of the highest order ISR bit set ((vec- 
tor/16) :0), and | 


e the priority of the highest order IRR bit set ((vec- 
tor/16) :0). 


This value is: used during arbitration as part of low- 
est-priority delivery. 


Task Priority Register 
| Bits [91:8 


From the information in the Task Priority Register 
and the priority information derived from the ISR and 
IRR register, the 82489DX Local Unit computes two 
additional priority values: 


Bits [31:8] Bits [31:8] are Reserved. They should be 
written 0. 


Bits [7:0] Task Priority 


Bits [7:0] are used to specify the task pri- 
ority. 
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6.7 Dispensing Interrupts 


DISPENSING INTERRUPTS TO THE LOCAL 
PROCESSOR 


Once a 82489DX Local Unit accepts an interrupt, it 
guarantees delivery of the interrupt to its local proc- 
essor. (This part of the 82489DX functions similarly 
to an 8259A.) Dispensing a maskable interrupt to the 
local processor begins when the Local Unit asserts 
the INT pin of its processor. If the processor has 
interrupts enabled, it will respond by issuing an INTA 
cycle. This causes the Local Unit to freeze its inter- 
nal priority state and release the 8-bit vector of the 
highest priority interrupt on the data bus where it is 
read by the processor and used to find the handler’s 
entry point. The INT/INTA protocol also causes the 
interrupt’s ISR bit to be set. The corresponding bit in 
the IRR register is only cleared if the TMR register 
indicates it should do so (edge triggered interrupts), 
otherwise (level triggered interrupts), IRR is only 
cleared when the Interrupt Input Pin is deasserted. 


6.8 Spurious Interrupt Vector Register 


SPURIOUS INTERRUPT | 


Note that it can happen that a level-triggered inter- 
rupt is deasserted right before its INTA cycle. In that 
case, all IRR bits may be clear and the prioritizer 
may not find a vector to give to the processor. To 
satisfy the processor's demand for a vector, instead, 
the 82489DX will return a spurious interrupt vector 
instead. | 


A similar situation may occur when the processor 
raises its Task Priority at or above the level of the 
interrupt for which the Processor INT pin is currently 
being asserted. When the INTA cycle is issued, the 
interrupt that was to be dispensed has become 
masked (masked but remembered). 


Dispensing the spurious interrupt vector does not af- 
fect the ISR register, so the handler for this vector 
should just return without EOI. If the vector is shared 
with a valid interrupt, then the handler can read the 
vector’s bit in the ISR register to check if it is in- 
voked for the valid interrupt (ISR bit set) or not (ISR 
bit clear). Given the range of 240 vectors, overload- 
ing the spurious interrupt with a valid interrupt is not 
expected to be common practice. The spurious in- 
terrupt vector to be used by a Local Unit is program- 
mable via the Spurious Interrupt Vector Register. 


UNIT ENABLE 
It is possible that Local Units exist in the system that 


do not have a processor to which to dispense inter- 
rupts. The only danger this represents in the system 
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is that if any interrupt is broadcast to all processors 
using lowest priority delivery mode when all proces- 
sors are at the lowest priority, there is a chance that 
a Local Unit without the processor may accept the 
interrupt if this Local Unit happens to have the low- 
est Arb ID at the time. To prevent this from happen- 
ing, all Local Units initialize in the disabled state and 
must be explicitly enabled before they can either 
start accepting or transmitting messages from the 
ICC bus. A disabled 82489DX Local Unit only re- 
sponds to messages with Delivery Modes set to 
“Reset”. Reset deassert messages should be sent 
in Physical Destination mode using the target’s Lo- 
cal Unit ID since the logical destination information 
in the Icoal units is undefined (all zeroes) when the 
82489DX comes out of Reset. | 


Bits [31:9] Bits [7:0] 


Figure 9. Spurious Interrupt Vector Register 
Bits [31 .. 9] Reserved Bits. Should be written 0. 


Unit Enable: [Bit 8] 


0: When a 0 is written to this bit, this 
Local Unit gets disabled with regard 
to responding to messages sent as 
well as transmitting on the ICC bus. It 
only responds to messages with De- 

_livery Mode set to ‘Reset’. Reading 
a 0 at this bit indicates that the unit is 
disabled. 


1: When a 1 is written to this bit, the 
current Local Unit is enabled for both 
transmitting and receiving unit mes- 
sages. Reading a 1 at this bit indi- 
cates that the unit is enabled. 


Spurious Vector: [Bits 7-0] 


For future compatibility, bits [38-0] 
should be 1111. 


6.9 End-of-interrupt (EOI) Register 


Before returning from the interrupt handler, software 
must issue an End-Of-Interrupt (EOI) command to 
the 82489DX Local Unit. The data written to EOI 
register is don’t care. This tells the 82489DxX to clear 
the highest priority bit in the ISR register since the 
interrupt is no longer in service. Upon EOI, 82489DX 
goes through prioritization returning to the next high- 
est priority activity. This can be a previously inter- 
rupted handler (from ISR), a pending interrupt re- 
quest (from IRR), or an interrupted task (from Task 
Priority). 


Bits [31:0] 


Figure 10. EOI Register 
Bits [81:0]: are don’t care. 
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6.10 Remote Fiead Register 


Since all 82489DxX l_ocal Units would typically occu- 
py the same address range, an 824.89DxX local unit’s 
registers can only be accessed by the local proces- 
sor. From a system debugging point of view, this 
would mean that a large amount of state would be- 
come inaccessible if its corresponding processor 
hangs for whatever reason. To assist in the debug- 
ging of MP systems, the 82489DX support a mecha- 
nism that provides read-only access to any register 
in any other 82489DxX Icoal unit in the system. 


To read any register in a “remote’’ 82489DX Local 
Unit, the processor writes to the Interrupt Command 


Register specifying a Delivery Mode equal to ‘‘Re-- 


mote Read’’. The remote 82489D)X is specified in 
the Destination fielc! of the Interrupt Command Reg- 
ister in the usual fashion. Debug software would 
make sure that this selects a single 82489DX only— 
for example by using the target’s 82489DX Local 
Unit ID in physical destination mode. Since no vector 
is associated with remote register access, the Vec- 
tor field in the Interrupt Command Register is used 
to select the indivicual remote 32-bit register to be 
read. The selector value corresponds to the address 
(offset) of the register in the local 82489DX’s ad- 
dress space. Sending a “Remote Read” command 
results in sending € message on the ICC bus. The 
destination 82489DX responds by placing the 32-bit 
content of the selected register on the ICC bus. This 
value is read by sending the 82489DxX and place it in 
the Remote Register where software can get at it 
using regualr register access to its 82489DX Local 
Unit. The Remote Register is software read-only. 
The contents of the Remote Register is valid when 
the Delivery Status in the Interrupt Command Regis- 
ter has become “Idle” again. 


Remote Read Register 


| Bits [31:0] | 


Figure 11. Remote Register 


Bits [31:0] Bits [31:0] contain the contents of Re- 
mote Read Register. 


6.11 82489DX Local Configuration 


LOCAL VERSION REGISTER | 


Each 82489DX Local Unit contains a hardware Ver- 
sion Register that identifies this 82489DX Local Unit 
version. This register is read only. 
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Local Version Register | 


| Bits (91:8) 


Figure 12. Local Version Register 


Version: [Bits 7-0] 


This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as ‘1’ for 
82489DxX. 


Bits [31:8] Bits 31:8 are reserved. 


6.12 82489DX Timer Registers 


Overview 


82489DX Local Unit contains one 32-bit wide pro- 
grammable binary timer for use by the local proces- 
sor. The timer can select its clock base from one of 
three possible clock inputs. A timer mode can be 
programmed to operate in either one-shot mode or 
periodic mocle. The timer can be configured to inter- 
rupt the local processor with a vector. 


Time Base 


The 82489DX has two independent clock input pins: 


1. The CLK pin provides the clock signal that drives 
the 82489)DX’s internal operation. 


2. The TMBASE pin allows an independent clock 
signal to be connected to the 82489DxX for use by 
the timer functions. 


Signals from both CLK and TMBASE can be used as 
clock inputs that feed the timer. In addition, the 
82489DX contains a divider that can be configured 
to divide either input clock signal. The divider can be 
programmec! to divide the selected input clock by 2, 
4, 8, or 16. CLK, TMBASE, and the output of the 
divider together provide three time bases: Base 0, 
Base 1, and Base 2. Base 0 is always equal to CLK: 
Base 1 is always equal to TMBASE; and base 2 is 
one of; CLK/2, CLK/4, CLK/8, CLK/16, TMBASE/2, 
TMBASE/4, TMBASE/8, or TMBASE/16. The timer 


can indeperidently select one of these three time 


bases as its clock input as depicted in the following 
diagram. 


CLK = Base 0 


Base 2 


DIVIDE BY 
2, 4, 8, 16 
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Figure 13. Time Bases 
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Figure 14. Divider Configuration Register 


Bits [31:3] Bits 31 to 3 are reserved. They should 
be written 0. 


Divider Input: 


[Bit 2] Selects whether divider’s input 
connects to the 82489DX Local Unit’s 
CLK pin or TMBASE pin. 


0: means the divider takes its input sig- 
nal from CLK, 


1: means use TMBASE. 


Divide By: [Bits 1,0] 


This field selects by how much the divid- 
er divides. 


00: divide by 2 
01: divide by 4 
10: divide by 8 
11: divide by 16 


Timer 


Software starts a timer going by programming its Ini- 
tial Count Register. The timer copies this value into 
the Current Count Register and starts counting down 
at the rate of one count for each time base pulse. 
The time is one of Base 0, Base 1, or Base 2. 


The timer has a programmable mode which can be 
One-Shot or Periodic. After the timer reaches zero in 
One-Shot mode, the timer simply stays at zero until it 
is reprogrammed. In Periodic mode, the timer auto- 
matically reloads its Current Count from the Initial 
Count and starts counting down again. 


For the timer, interrupt generation can be disabled or 
enabled, and an arbitrary interrupt vector can be 
specified. When enabled and the timer reaches 
zero, an interrupt is generated at the 82489DX Local 
Unit. Timer generated interrupts are always treated 
as edges. They can only generate maskable inter- 
rupts to the local processor. 


TIMER VECTOR TABLE 


Bits [31:20] | Bits [19:18] Bits [15:13] Bits [11:8] | Bits 7:0] 


82489DX 


A timer set up with its interrupt masked is useful as a 
time base that can be sampled by the local proces- 


_ sor by reading the Current Count Register, for the 


purpose of measuring the intervals. By mapping the 
82489DX’s register space into a read-only user 
page, safe and efficient performance monitoring of 
user programs can be supported. 


If necessary, software may want to ensure that peri- 
odic timer interrupts on the different 82489DX Local 
Units are staggered such that the 82489DXs don’t 
all deliver their interrupt (e.g., a timer slice interrupt) 
to their local processor at the same time. This stag- 
gering avoids bursts of contention for shared re- 
sources (bus, cache lines, dispatch queue, locks). 
Randomness occurring “naturally’’ may be sufficient 
to ensure staggering. 


Initial Count Register 


Bits [31:0] Initial Count 


Current Count Register 


Bits [31:0] Current Count 


Figure 15. Initial Count and 
Current Count Registers 


Software writes to this register to set 
the initial count for timer. This regis- 
ter can be written at any time. When 
written, its value is copied to the Cur- 
rent Count Register and countdown 
starts or continues from there. The 
Initial Count Register is read-write by 
software. | 


Current Count: This is the current count of timer. It is 
read-only by software and can be 
read at any time. 


Initial Count: 


The timer is configured via its Local Vector Table 
entry shown below (see also Interrupt Control in this 
section). 


Vector: [Bits 7-0] 


This is the 8-bit interrupt vector to be 
used when timer generates an inter- 
rupt. 


Figure 16. Local Vector Table: Timer Entry 
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Bits 11-8 Reserved. Should be written 0. 


paves Status: [Bit 12] 


Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 


0: (Idle). means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the in- 
terrupt has been injected, but its de- 
livery is temporarily held up by other 
recently injected interrupts that are 
in the process of being delivered; 
Delivery Status is software read- 
only; software writes to this field (as 
part of a 32-bit word) do not affect 
this bit. 


Bits 15-13: Reserved. Should be written 0. 


MASK: [Bit 16] 


This bit serves to nee timer interrupt 
generation. 


0: means not masked, when _ timer 
reaches 0, it generates an interrupt 
with vector at the 82489DX Local 
Unit 

1: means masked, and no interrupt is 
generated. 

Timer Mode: [Bits 17] 

This field indicates the operation mode 

of timer. 

0: (One-Shot): the Current Count Reg- 
ister remains at zero after the timer 
reaches zero, and software needs to 
reassign the timer’s Initial Count 
Register to rearm the timer. 

1: (Periodic): when the timer reaches 
zero, the Current Count Register is 
automatically reloaded with the val- 
ue in the Initial Count Register, and 
the timer counts down again. 

Timer Base: [Bits 19,18] 

This field selects the time base input to 

be used by timer. 

00: (Base 0) uses “CLKIN”’ as input; 

01: (Base 1) uses “TMBASE”; 

10: (Base 2) uses the output of the di- 

vider (Base 2). 


Bits [31:20] Bits [31:20] are Reserved. Should be 
written 0. 
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7.0 82489DX I/O UNIT REGISTERS 


REGISTERS ADDRESSING SCHEME 


The 1/O Unit indirect addressing scheme uses two 


“registers directly mapped into the processor’s ad- 


dress space: the I/O Register Select register and 
the I/O Winclow register. The |/O register select reg- 
ister selects which I/O unit Register appears in the 
1/O Window register where it can be manipulated by 
software. 


I/O Register Select Register 


[Bis 94:8 


Figure 17. 1/O Register Select Register 


Bits [31:8]: Reserved. Should be written 0. 


Bits [7:0]: 1/O REGISTER SELECT: This register 
selects an 82489DX |/O unit register. 
The contents of the selected 32-bit reg- 
ister can be manipulated via the I/O 
Window Register. The |/O Register Se- 
lect register is read-write by software. 


1/O Window Register 


= Bits [31:0] | 


Figure 18. 1/O Window Register 


Bits [31:0] 1/O WINDOW REGISTER: This register 
is mapped onto the I/O Unit’s register 
selected by the I/O Register Select reg- 
ister. Readability/writability by software 
is determined by the I/O unit register 
that is currently selected. 


The addresses (offsets to a platform-de- 

fined base address) of all registers are 
listed in the register summary section. 
Note that register offsets are aligned on 
‘128-bit boundaries; in other words, regis- 
ters are located only at every fourth 
32-bit address. This eliminates the need 
for lane-steering glue logic when con- 
necting the 82489DX’s 32-bit data bus to 
a wider (64-bit and 128-bit) bus. 
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82489DX I/O UNIT CONFIGURATION 


I/O Unit ID Register 


Each 82489DxX I/O Unit has a register that contains 
the I/O Unit's 8-bit ID. The I/O unit ID serves as a 
physical name of the 82489DX I/O Unit. It is used in 
arbitrating for ICC bus ownership when the I/O unit 
wants to access the ICC bus for sending any inter- 
rupt message. Unlike the local unit ID, the I/O unit ID 
is not latched-in from the address bus during hard- 
ware reset. The I/O unit ID is set to 0 during reset. 
The software has to write different ID into the I/O 
Units before starting interrupt messages on the ICC 
bus. 


I/O Unit ID 


Bits [31:24] Bits [23:0] 


Bits [31:24] 1/O Unit ID: 


The I/O unit ID serves as the physical _ 


‘“‘name” of the 82489DX unit used for 


arbitration purposes for the ICC bus us- | 


age. In a system with, say, four 
82489Dx, there are 4 Local Units and 4 
1/O Units. All the 8 units should be as- 
signed different ID. The IDs should start 
with 0 and each unit should have differ- 
ent ID. 


Bits [23:0] Bits 23..0 are reserved. Should be writ- 
ten 0. 


1/O Unit Version Register 


Each 82489DxX |/O Unit contains a hardware Ver- 
sion Register that identifies this 82489DX I/O unit 
version. This register is read only. 


I/O Unit Version Register 


Version: [Bits 7-0] 


This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as “1” for 
82489DxX. 


Bits [15:8] Bits [15:8] are reserved. 


82489DX 


Max Redir Entry: [Bits 23-16] 


This is the entry number (0 being the 
lowest entry) of the highest entry in the 
Redirection Table. It is equal to the 
number of Interrupt Input Pins minus 
one of this |/O Unit. This field is hard- 
wired and is read-only. 


In the 82489DxX I/O unit this is read as 
15. | 


Bits [31:24] Bits [31:24] are reserved. | 


I/O UNIT INTERRUPT SOURCE REGISTERS 


Redirection Tables 


The Redirection Table has a dedicated entry for 
each interrupt input pin. Unlike IRQ pins of the 
8259A, the notion of interrupt priority is completely 
unrelated to the position of the physical interrupt in- 
put pin on the 82489DxX. Instead, software can de- 
cide for each pin individually what it wants the vector 
(and therefore the priority) of the corresponding in- 
terrupt to be. For each individual pin, the operating 
system can also specify whether the interrupt is sig- 
naled as edges or levels, as well as the destination 
and delivery mode of the interrupt. The information 
in the Redirection Table is used to translate the in- 
terrupt manifestation on the corresponding interrupt 
pin into an inter-82489DX message. | 


In order for a signal on an edge-sensitive Interrupt 
Input pin to be recognized as a valid edge ( and not 
a glitch) the input level on the pin must remain as- 
serted until the time 82489DX I/O Unit sends the 
corresponding message over the ICC bus. Only then 
will the source 82489DX be able to recognize a new 
edge on that Interrupt Input pin. That new edge will 
only result in a new invocation of the handler if its 
acceptance by the destination 82489DX causes the 
Interrupt Request Register bit to go from 0 to 1. (In 
other words, if the interrupt wasn’t already pending 
at the destination.) 


82489DX I/O unit has 16 Redirection Table entries. 
The layout of an entry in the Redirection Table is as 
follows: : 


Redirection Table Entry 
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Vector (Bits [7:0] 
Interrupt vector for this interrupt 


Delivery Mode (Bits [10:8]) 

| | 000: Fixed 
001: Lowest Priority 
010: <reserved> 
011: <reserved> 
100: NMI 
101: Reset 
110: <reserved> 
111: ExtINT 


Destination Mode (Bit 11) 
0: Phiysical . 
1: Logical 
Delivery Status (Bit 12) 
| 0: Idle 
1: Send Pending 
Bit 13 _ Bit 13: Reserved. Should be written 0. 
- Remote IRR (Bit 14) | 
Reflects the Remote IRR bit 
0: Remote IRR bit is clear. 
, 1: Remote IRR bit is set. 
Trigger Mode (Bit 15) 
_ 0: Edge 
| 1: Level — 
Mask (Bit 16) | 
| 0: Not Masked 
1: Masked 
Bits [31:17] Reserved. Should be written 0. 


DESCRIPTIONS 


Vector: [Bits 7-0] 


The vector field is an 8-bit field con- 
taining the interrupt for this interrupt. 


Delivery Mode: [Bits 10-8] 


The Delivery Mode is a 3-bit field that 
specifies how the 32489DxXs listed in 
the destination field should act upon 
reception of this signal. Note that 
remote read is not supported for I/O 
device interrupts. Note that certain 
Delivery Modes will only operate as in- 
tended when used in conjuction witha 
specific Trigger Mode. These restric- 
tions are indicatec| for each Delivery 
Mode. 
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000: (Fixed) means deliver the signal 
on the INT pin of all processors 
listed in the destination. Trigger 


Mode for ‘‘fixed’” Delivery Mode 
can be edge or level. 


001: (Lowest Priority) means deliver 
the signal on the INT pin of the 
processor that is executing at the 
lower priority among all the proc- 
essors listed in the specified des- 
tination; Trigger Mode for ‘‘lowest 
priority” Delivery Mode can be 
edge or level. | 


100: (NMI) means deliver the signal on 
the NMI pin of all processors list- 
ed in the destination, vector infor- 
mation is ignored. A Delivery 
Mode equal to “NMI” requires a 
“level” Trigger Mode. 


101: (Reset) means deliver the signal 
to all processors listed in the des- 
tination by asserting/deasserting 
the 82489DX’s Reset output pin. 
All addressed 82489DXs’ Local 
Units will assume their reset state 
but preserve their unit ID. One 
side effect of a unit message with 
Delivery Mode equal to “Reset” 
that results in a deassert of reset 
is that all 82489DXs’ Local Units 
(whether listed in the destination 
or not) will reset their lowest-prior- 
ity tie breaker arbitration ID to 
their unit ID (see the section on 
the ICC bus for details). A Deliv- 
ery Mode of ‘Reset’ requires a 
“level” Trigger Mode. 


111: (ExtINT) means deliver the signal 
to the INT pin of all processors 
listed in the destination as an in- 
terrupt that originated in an exter- 
nally connected (8259A-compati- 
ble) interrupt controller. The Local 
Unit receiving this interrupt will ac- 
tivate ExtINTA in response to this 
interrupt message. A Delivery 
Mode of “ExtINT’ requires an 
“edge” Trigger Mode. (See the 
section on Compatibility for de- 
tails.) ! 


ADVANCE INFORMATION : , 
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Destination Mode [Bit 11] 


This field determines the interpretation 
of the Destination field. 


0: (Physical Mode): in Physical Mode, 
a destination 82489DX Local Unit is 
identified by its unit ID. Bits 56 
through 63 (8 MSB of the destina- 
tion field) specify the 8-bit unit ID. 


1: (Logical Mode): in Logical Mode, 
destinations are _ identified by 
matching on Logical Destination un- 
der the control of the Destination 
Format Register in each 82489DX 
Local Unit. The 32-bit Destination 
field is the logical destination. 


Delivery Status: [Bit 12] 


Delivery Status is a 1-bit field that con- 

tains the current status of the delivery 

of this interrupt. Two states are de- 

fined: 

0: (Idle) means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the 

"interrupt has been injected, but its 
delivery is temporarily held up by 
other recently injected interrupts 
that are in the process of being de- 
livered; Delivery Status is software 
read-only; software writes to this 
field (as part of a 32-bit word) do not 
affect this bit. 


Bit 13: Bit 13 is Reserved. Should be written 0. 


Remote IRR: [Bit 14] 


This bit is used for level triggered inter- 
rupts; its meaning is undefined for 
edge-triggered interrupts. Remote IRR 
mirrors the interrupt’s IRR bit of the 
destination 82489DX Local Unit. When 
the value of the bit disagrees with the 
state of the Interrupt Input line, a unit 
message is automatically sent to make 
the destination’s IRR both reflect the 
new state of the Interrupt Input line, 
and then the Remote IRR bit is updat- 
ed to track its associated IRR bit. Re- 
mote IRR is software read-only; soft- 
ware writes to this bit do not affect it. 
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Trigger Mode: [Bit 15] 


The Trigger Mode field indicates the 
type of signal on the interrupt pin that 
triggers an interrupt. 


0: indicates edge sensitive, 
1: indicates level sensitive. 


Mask: [Bit 16] 


Use this bit to mask injection of this in- 
terrupt. 


0: indicates that injection of this inter- 
rupt is not masked. An edge or level 
on an interrupt pin that is not 
masked results in the delivery of the 
interrupt to the destination. 


1: indicates that injection of this inter- 
rupt is masked. Edge-sensitive in- 
terrupts signaled on a masked inter- 
rupt Input pin are simply ignored 
(i.e., It is not- delivered and is not 
held pending). Level-asserts or 
deasserts occurring on a masked 
level-sensitive pin are also ignored 
and have no side effects. As ex- 
pected, changing the mask bit from 
unmasked to masked while the lev- 
el remains asserted has the side ef- 
fect of deasserting the level. It is 
software’s responsibility to deal with 
the case where the Mask bit is set 
after the interrupt message has 
been sent but before the interrupt is 
dispensed to the processor. 


Bits [31:17] Bits [31:17] are reserved. Should be 


written 0. 


Destination 


Bits [63:32] 


Destination: If the Destination Mode of this entry is 
“Physical Mode”, then the 8 MSB [bits 
56 through 63] contain an 82489DX 
Local Unit ID. If Logical Mode, then the 
Destination field potentially defines a 
set of processors. The interpretation of 
the 32-bit destination field is further en- 
abled by the Destination Format Regis- 
ter in the 82489DX Local Units. 
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8.0 ICC BUS DEFINITION 


Physical Characteristics 


The ICC bus is a 5-wire synchronous bus connecting 
all 82489DxXs (all |/O Units and all Local Units). Four 
of these five wires are used for ciata transmissions 
and arbitration, ancl one wire is a clock. The descrip- 
tion refers to the logical state of the ICC bus. Electri- 
‘cal levels are just inverse of the logical state de- 
scribed. For example, the section describes that the 
ICC bus is 0000 when not transmitting any message. 
This refers to logical state. Electrically, the ICC bus 
is 1111 when not transmitting any message. 


The bus ts electrically an open-drain connection pro- 
viding for both bus use arbitration and arbitration for 
lowest priority. Being open-drain, the bus is run ata 


“comfortable” speed such that cesign-specific ter- 


mination tuning is not required. Furthermore, each 
82489DX receiving a message or participating in an 
arbitration must be given enough time in a single bus 
cycle to latch the bus and perform some simple logic 
operations on the latched information in order to de- 
termine whether the next drive cycle must be inhibit- 
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Note that it is likely in MP systems that additional 
processors be located on plug-in boards. Since the 
ICC bus would be part of the connector, the 
82489DxX to ICC bus connection is defined so that it 
can be electrically isolated using external drivers. 
The 82489DX has separate ICC bus input and out- 
put pins that can be connected externally to the 
82489Dx to either provide or not provide isolation. 


The isolation can also be used to provide a hierar- 
chical connection of ICC buses electrically support- 
ing large numbers of processors. The number of 
82489DXs supported using the hierarchical connec- 
tion is limited only by ICC bus bandwidth. It should 
be noted that ICC bus output low current is just 
4 mA. 


Bus Arbitration 


Arbitration (both for use of the bus and for determin- 
ing the lowest priority 82489DX) depends on all 
82489DX message units operating synchronously. 
To deal with the event where multiple agents start 
transmitting simultaneously, a distributed arbitration 
approach is used. Bus arbitration uses a small num- 
ber of arbitration cycles in the ICC bus. During 


82489DX 
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Figure 21. ICC Bits: Hierachical Connection 


4-250 


ADVANCE INFORMATION | 


intel. 


these cycles, arbitration losers progressively drop 
off the bus until only the winner remains transmitting. 
The winner then transmits its actual inter-unit mes- 
sage. Once the sending of a message (including bus 
arbitration) has started, any possible contender must 
suppress transmission until enough cycles have 
elapsed for the message to be fully sent. The num- 
ber of message cycles depends on the type of mes- 
sage being sent. 


A bus arbitration cycle starts by the agent driving its 
unit ID on the ICC bus. High-order ID bits are driven 
first, successive cycles proceeding to the low bits of 
the ID. All losers in a given cycle drop off the bus, 
using every subsequent cycle as a tie breaker for the 
previous cycle. By the time all arbitration cycles are 
completed, there will be only a single agent left driv- 
ing the bus. 


The 8-bit unit ID (17 16 15 14 13 12 11 10) is chopped up 
in successive groups of 2 bits (I7 16)(I5 14)(I3 12) 
(11 10). Each of these tuples is first decoded before 
driving them on the bus. The Os and 1 indicate logi- 
cal levels and not signal levels. The ICC bus is 0000 
when not transmitting any message. The decoding 
used is: 


ID Tuple 


a ICC Bus 


Note that the pattern generated on the ICC bus by 
tuple (I3 12) will be represented as 132 i32 i32 i32. 
The lower case signifies this encoding. 


Each tuple of the ID only contributes to a single wire, 
making it possible for an agent to determine with 
certainty whether to “drop off’ or to continue arbi- 
trating in the next cycle for the following two bits of 
the unit ID simply by checking whether the bus line 
the agent is driving is also the highest order 1 on the 
bus. Each ICC bus cycle therefore arbitrates 2 bits. 


i76 176 176 i76 ICC bus arbitration 
54 i154 i54 j54 
i832 i82 i382 i382 
i10 i10 i10 i10 
<message body > 


niece ears 
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Lowest-Priority Arbitration 


Arbitration is also used to find the 82489DX Local 
Unit with the lowest processor priority. Lowest-priori- 
ty arbitration uses the value of the 82489DX’s Proc- 
essor Priority value appended with an 8-bit Arbitra- 
tion ID (Arb ID) to break ties in case there are multi- 
ple units executing at the lowest priority. 


Using the constant 8-bit unit ID as the Arb ID has a 
tendency to skew symmetry since it would favor 
82489DxXs with low ID values. An 82489DX Local 
Unit’s Arb ID is therefore not the unit ID itself but is 
derived from it. At reset, an 82489DX Local Unit’s 
Arb ID is equal to its unit ID. Each time a message is 
broadcast over the ICC bus in lowest priority mode, 
all 82489DX Local Units increment their Arb ID by 
one, which gives them a different Arb ID value for 
the next arbitration. The Arb ID is then endian-re- 
versed (LSB becomes MSB, etc.) to ensure better 
rotation of which 82489DX gets to have the lowest 
Arb ID next time around. The reversed Arb ID is then 
decoded to generate arbitration signals on the ICC 
bus as described above. 


To support hot insertion of processor boards in a 
running MP system, a mechanism is provided to al- 
low the 82489DX of the added processor to syn- 
chronize its Arb ID with the existing 82489DxXs. This 
is accomplished by broadcasting a message with 
Delivery Mode equal to ‘Reset’, Trigger Mode equal 
to “Level’, and Level equal to 0. This message 
must be broadcast before the newly added 
82489DxX is allowed to participate in a lowest-priority 
arbitration. Depending on the exact sequence under 
which the newly inserted board is powered-up and 
initialized, this Arb ID synchronization may occur nat- 
urally if a Reset-deassert to the new 82489DxX is part 
of that sequence. If not, the local processor can al- 
ways send this as an inter processor interrupt (with a 
null destination), causing only Ihe side effect of re- 
setting all 82489DX Arb IDs. 


ICC BUS MESSAGE FORMATS 


The short message format is described first. Note 
that the first 19 cycles of both short and long mes- 
sage formats have the same interpretation. 


1: i76 i76 i76 i76 \CC bus arbitration 
2: 154 (54 i54 i54 
3: i382 i382 i382 i832 
4. 110 i10 i10 i10 
5: DM M2 M1 MO = destination mode and 
delivery mode 

control bits 


vector 


6: 9” =O L TM 
7: V7 V6 V5 V4 
8: V3 V2 V1 VO 
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destination 


checksum for 
cycle 5 through 
16 | 
post amble 
accept (1000 if 
OK, 1110 if 
preempt, else 
error) 

20: “Oo” “0” 0” “0” idle 1 

21: “0” “0” “OQ” “0” idle 2 


Cycles 1 through 4 are bus arbitration as described 
earlier. Cycle 5 (DM M2 M1 MO) is the Destination 
Mode which is 0 for Physical mode: and 1 for Logical 
Mode, and the Delivery Mode of the message. The 
encoding used for the Delivery Mode in the message 
is identical to the encoding usec for the Delivery 
Mode in the Redirection Table, Local Vector Table, 
and Interrupt Comrnand Register. 


Intel. 


Delivery Mode 


Lowest Priority 


| MO 

= 

FO <reserved> 
po [| MM 


— EXtINT 


Cycle 6 contains the Control Bits of the message. 
The control bits are: 


e TM (Trigger Mode): indicates whether this mes- 
sage corresponds to an edge or level; 


© L (Level): indicates whether this is an Assert or a 
Deassert of a “level” signal. L is undefined when » 
TM is edge. 


6: “0” “O” L TM ~~ Control Bits 
TM = Trigger Mode (0 = edge, 1 = level) 
= Level (0 = deassert, 1 = assert) 


The length of the message is derived from the Deliv- 
ery Mode, the Control Bits, and the Accept cycle of 
the message. 


TM/L (AAAA) 


| Edge 

Fixed : Short 

Lowest Priority Short (1110) 
Long (1000) 


Remote Read Long 


Reset 
ExtINTA 
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Level = Assert 


Short (1110) 
Long (1000) 
Shor 
Short “Short 


Level = Deassert 
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Cycles 7 and 8 are the 8-bit interrupt vector. The 
vector is only defined for Delivery Modes Fixed, and 
Lowest-priority. For Delivery Mode of ‘Remote 
Read’, the vector field contains the address of the 
register to be read remotely. 


If DM is 0 (physical mode), then cycles 9 and 10 are 
the unit ID and cycles 11 through 16 are zero. If DM 
is 1 (logical mode), then cycles 9 through 16 are the 
32-bit Destination field. The interpretation of the logi- 
cal mode 32-bit Destination field is performed by the 
Local Units using the Destination Format Register. 
The sending 82489DX knows whether it should (incl) 
or should not (excl) respond to its own message. 


Cycle 17 is a checksum over the data in cycles 5 
through 16. The checksum is computed by adding all 
4-bit quantities of cycles 5 through 16, feeding carry 
out of the MSB back into the LSB. This protects the 
data in these cycles against transmission errors. The 
(single) 82489DxX driving the message provides this 
checksum in cycle 17. 


Cycle 18 is a post amble cycle driven as 1111 by the 
sending 82489DxX allowing all 82489DXs to perform 
various internal computations based on the informa- 
tion contained in the received message. One of the 
computations takes the computed checksum of the 
data received in cycles 5 through 16 and compares 
it against the value in cycle 17. If any 82489DX com- 
putes different checksum than the one passed in 
cycle 17, then that 82489DxX will signal an error on 
the ICC bus in cycle 19 by driving it as 1111. If this 
happens, all 82489DXs will assume the message 
was never sent and the sender must try sending the 
message again, which includes re-arbitrating for the 
ICC bus. In lowest priority delivery when the interrupt 
has a focus processor, the focus 82489DX will sig- 
nal this by driving 1110 during cycie 19. This tells all 
the other 82489DxXs that the interrupt has been ac- 
cepted, the 82489DxXs is preempted, and short mes- 
sage format is used. All (non-focus) 82489DxXs will 
drive 1000 in cycle 19. Under lowest priority mode, 
1000 implies that the interrupt currently has no focus 
processor and that priority arbitration is required to 
complete the delivery. In that case, long message 
format is used. If cycle 19 is 1000 for non Lowest 
Priority mode, then the message has been accepted 
and is considered sent. 


19:EEEE 
1000 OK 
1110 preempt 


<others> error (drive error as 1111) 
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When an 82489DxX detects and reports an error dur- 
ing the error cycle, that 82489DxX will simply listen to 
the bus until it encounters two consecutive idle 
(0000) cycles. These two idle cycles indicate that 
the message has passed and a new message may 
be started by anyone. This allows an 82489DxX that 
got itself out of cycle on the ICC bus to get back in 
sync with the other 82489DxXs. 


Long Message Format 


Cycles 1 through 19 of the long message format are 
identical to cycles 1 through 19 of the short mes- 
sage format. As mentioned, long message format is 
used in two cases: . 


(1) Lowest Priority delivery when the interrupt does 
not have a focus. Cycles 20 through 27 are eight 
arbitration cycles where the destination 
82489DXs determine the one 82489DX with 
lowest processor priority/ARB ID value. . 


(2) Remote Read messages. Cycles 20 through 27 
are the 32-bit content of the remotely read regis- 
ter. This information is driven on the bus by the 
remote 82489DX. 


Cycle 28 is an Accept cycle. In lowest priority deliv- 
ery, all 82489DxXs that did not win the arbitration (in- 
cluding those that did not participate in the arbitra- 
tion) drive cycle 28 with 1000 (co accept), while the 
winner 82489DxX drives 1111. If cycle 28 reads 1111, 
then all 82489DXs know that the interrupt has been 
accepted and the message is considered delivered. 
If cycle 28 reads 1100 (or anything but 1111 for that 
matter), then all 82489DXs assume the message 
was unaccepted or an error occurred during arbitra- 
tion. The message is considered undelivered, and 
the sending 82489DxX will try delivering the message 
again. 


For Remote Read messages, cycle 28 is driven as 
1100 by all 8 2489DXs except the responding re- 
mote 82489DxX, who drives the bus as 1111 in case 
it was able to successfully supply the requested data 
in cycles 20 through 27. If cycle 28 reads 1111 the 
data in cycles 20 through 27 is considered valid; oth- 
erwise, the data is considered invalid. The source 
82489DxX that issued the Remote Read uses cycle 


- 28 to determine the state of the Remote Read 


Status field in the Interrupt Command Register (valid 
or invalid). In any case, a Remote Read request is 
always successful (although the data may be valid or 
invalid) in that a Remote Read is never retried. The 
reason for this is that Remote Read is a debug fea- 
ture, and a “hung” remote 82489DxX that is unable 
to respond should not cause the debugger to hang. 
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Cycles 29 and 30 are two idle cycles. The ICC bus is 
available for sending the next message at cycle 31. 
The two idle cycles at the end of both short and long 
messages, together with non zero (i.e., non idle) en- 
coding for certain other bus cycles allow an ICC bus 
agent that happens to be out of phase by one cycle 
to sync back up in one message simply by waiting 
for two consecutive idle cycles after reporting its 
checksum error. This makes use of the fact that val- 
id arbitration cycles are never 000). 


i76 i76 i76 i76 ICC bus arbitration 
{54 i54 i654 i54 . 

i832 i832 i382 i382 

i10 i10 i10 i10 

DM M2 Mi MoO delivery mode 
“oO” “Oo” L TM control bits 

V7 V6 V5 V4 vector 

V3 V2 Wi VO 

D31 D380 D29 D28 destination 

10: D27 D26 025 D24 

11: D23 D22 D21 D20 

12: D19 D18 Di7 Di6 . 

13: D15 D1i4 013 D112 
14: D1i1 D10 DOS DOs 
15: DO7 DO6 DOS5 D4 
16: DO3 DO2 DO1 DOO 
17, GCG CG G C 


CONOTWN> 


checksum for cycles 5 
through 16 
18: “1” “1” “4” “4 postamble . 
19: A A A A accept (1000 if OK, 
| 1110 if preempt, 
; else error) 
20: p76 p76 p76 p76 lowest priority 
; arbitration or 32 bits 
of remote register 
21: p54 p54 p54 p54 processor priority 
22: p32 p32 p32 p32 
23: pi0 pi0 p10 p10 
24: a76 a/76 a/6 a76 
25: a54 a54 a54 a54 arbitration ID 
26: a32 a32 a32 a32 
27: ai0 aiO ai10 ai0 
28: A A A A accept 
29: “00” “O” “0” “0” idlet 
30: ‘“O” “Od” “0” “0” idle2 


9.0 HARDWARE TIMINGS 


This section covers the following: 
— Timing Diagram Notation 
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— 82489DX Register Access Timing Diagrams with 
Descriptions 


A block diagram of the configuration of the CPU 
module of a MP system is shown. This in no way is 
intended to be a complete representation of 
486/Intel Cache/Intel Cache Controller connec- 
tions. It is intended to show all the 82489DX connec- 
tions, and how they connect to other components 
on and off-the module. This module has arbitrarily 
been drawn with a 64-bit data bus to show how the 
expanded address space architecture fits. The unit 
can be similarly attached to either a 32-bit or 128-bit 
data bus, with total transparency to shrink-wrap soft- 
ware. 


In this configuration, the 82489DX uses the same 
clock source as the processor and cache. However, 


it is quite possible to consider 82489DX as a memo- 


ry bus device and hence supply 82489DX with the 
memory bus clock, which can be slower than the 
CPU module clock frequency. : 


In the configuration shown, the processor’s INT and 
NMI pins could be supplied by other source to allow 
for the possibility that 82489DX can be totally by- 
passed if desired, by allowing those signals to be 
driven from off the module while the 82489Dx is dis- 
abled. The reset signal generated by the 82489DX 
goes to the MBC (memory bus controller) which is 
required to drive configuration lines at reset time. 
This would probably be configured as a “warm” re- 
set by the MBC. 


A future version of cache controller may generate 
the chip select for 82489DX at a fixed memory loca- 
tion of hexF-EEQ0000. By having the cache controller 
to provide the chip select signal, it would encourage 
a standard mapping for 82489DX address space. In 
some MBC designs, this signal should be connected 
to the MBC since 82489DX cycles limit bus pipelin- 
ing by constraining how soon the next bus cycle can 
come. The 82489DX chip select can be generated | 
by the MBC completely. 


The address, data and most of the bus control sig- 
nals share the respective bus with cache and cache 
controller. The block diagram shows attachment for 
only 6 address lines: A4—A9. A10 should be 0. This 
is all the 82489DX needs for operation, however, if 
the address lines are used to initialize 82489DX lo- 
cal ID at reset time, 8 address lines are required, 
A3-A10. 
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INTERFACING TO THE ICC BUS obtain first order behaviors. The parameters for 


these models are as follows: 
The 82489DX has separate ICC bus input and out- 


put pins to facilitate using external drivers. The ICC Cp (package capacitance) = 3 pF 
bus input pins (MBIO-3) are TTL-level compatible == Lp (package inductance) = 15 nH 
CMOS inputs. The output pins (MBOO-3) are open- Rb (bond wire resistance) = 0.080 
drain pins which required external pull-ups. The Ci (input buffer capacitance) = 3 pF 
open-drain output buffers are small buffers with: Co (output buffer capacitance) = 6 pF 


Ro (output buffer impedance) = 300-800 
Sink current of < 4 mA. Special consideration | | 
must be exercised when driving large capaci- , ; 
tive loads or long transmission lines. The pull- MBO Pull-up Resistor 
up resistor and the capacitive load constitute 
RC time constant that will affect the output 
transition times. This in turn will limit the oper- 
ating frequency of the ICC bus. 


To minimize the RC time constant, one would like to 
use the smallest pull-up resistor value possible. The 
MBO pins has a worst case lol-spec of 4 mA and 
Voc = 4.75V. This translates to a minimum pull-up 
of about 1 KM. Where stronger drive is needed 


When designing in the ICC bus, one needs to (smaller pull-up resistors), external drivers must be 


consider the loads that each 82489DxX will be 


driving and whether external drivers should Hoey 

be used. In most situations, the ICC bus driv- 

en high (MBO pins pulled high by the external Driving Lumped Capacitance 

pull-up resistors) poses the most challenge. 

Simulating the target design on an electrical In systems where external drivers are not used, the 

simulator (such as SPICE) will help greatly. MBI pins will be tied to the MBO pins. Figure 21d is a 

as shown in the following examples. SPICE simulation of the MBO output with a 1 KO 
pull-up driving lumped capacitive loads from 10 pF 
to 150 pF. 

First Order Buffer Models 


At a load of 50 pF, it takes about 30 ns to charge up 
to 2V. At 100 pF, it takes an additional 25 ns. Figure 
21d can be used to estimate the loading delay at 
different lumped capacitive loads. 


Figure 21a and 21b are first order input buffer and 
output buffer models of the MBI and MBO pins. The 
open-drain of the MBO is modeled as a switch as 
the primary interest here is the MBO pins going high. 
These models can be used on SPICE simulations to 
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Figure 21b. First Order Open-Drain Output Buffer for MBO Pins 


ADVANCE INFORMATION 4-255 


82489DX 


In real systems, the loads are made up of lumped 
capacitance and transmission lines. More accurate 
results can be obtained using transmission line mod- 
els. | 


Driving Transmission Lines 


Two device model 


In this example the ICC bus is a signal line on an 
FR-4 printed circuit board. The line width is 6 mils. 
Line length of 12 inches and 18 inches are modeled. 
The FR-4 PC board has the following characteris- 
tics:. » 3 


0.6 mQ/sq. (0.10/inch for 6 mil 


resistivity = 
width) 
inductance = 60 pH/sq. (10 nH/inch for 6 mil 
width) 
capacitance = 0.55 nF/saq. in. (3.3 pF/inch for 6 
_ rnil width) 
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The ICC bus is shared by two 82489DxXs, one at 


-each end. The ICC bus is modeled as a transmission 


line. For the simulation, only one of the 82489DxX is 
driving. A pull-up resistor of 2 KN. is used at each 
end (1 K© equivalent value) as shown in Figure 21e. 


- Figure 21f shows the signals at each end of the 12 


inch transmission line. Trace 1 is the wave form at 
the driven end and trace 2 is the signal at the receiv- 
ing end of the line. The 2 ns delay between the two 
signals is the propagation delay (or flight time) 
through the 12 inch transmission line. It takes about 
35 ns for the voltage to charge up to 2V. 


Figure 21g shows the received signal with different 
line length and with additional lumped capacitance. 
Trace 1 is for 12 inch only. Trace 2 is for 12 inch with 
additional 20 pF lumped capacitance to represent 
interconnect socket capacitance. Trace 3 is for 18 
inch plus 20 pF. The presence of the 20 pF at each 
end of the 12-inch transmission line increases the 


- delay time by 20 ns at 2V. 
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Figure 21d. 1 KQ. Pull-Up Driving Lumped Capacitance 
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Four Device Model 


In this example (Figure 21h), the ICC bus is a 
12-inch transmission line with four 82489DXs con- 
nected at 4 inch intervals. The loading at each junc- 
tion consisted of the MBI and MBO buffers and a 
20 pF lumped capacitance. 2 KQ1) pull-ups are at 
each end of the transmission line. 


As shown in Figure 21i, it takes more than 90 ns for 
the signal level at both ends to reach 2V. 


82489DX 
MBO 


82489DX 


One way to improve the low to high transition time is 
to use a stronger pull-up (smaller resistor value) 
which is possible using external line drivers wn their 
larger current drive capabilities. 


Figure 21j shows the difference in output when the 


model is used with 3002 pull-ups at each end of the 
transmission line. 


82489DX 
MBO 
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Figure 21f. Waveform at Both Ends of 12” T-line 
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External Drivers/ Buffered ICC bus 


The 82489DX has separate ICC Bus input (MBI) and 
output (MBO) pins that can be connected to external 


line drivers in systems that has appreciable loading | 


on the ICC Bus or where modularity of the bus is 
needed. | 


Figure 21k is a typical implementation using external 
drivers with tri-state outputs. Drivers such as 74F125 
or its equivalent can be used. The drivers should be 
placed as close to the MBO pins as possible. The 
input buffer on MBI is optional depending on the us- 


12" with 2k 


| 

intel. 
ers ICC Bus scheme. The total delay through the © 
drivers, buffers, transmission line, clock skews etc. 


must be calculated to ensure that all the ICC bus 
timing requirements are met. 


A hierarchical bus connection can also be used in 
applications that cannot afford driver/buffer per unit 
and where bus loading are localized in cluster 
groups. Figure 211 shows such a connection where 
each cluster group is connected directly and drivers 
are used to connect to other clusters. Each cluster 
group is assumed to be close together physically — 
with small loading on the local ICC bus. 


100.0ns 150.0ns 
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Figure 21h. Four 82489DX Configuration 
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Figure 21j. Waveform with Different Pull-Ups 
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Transmission Line Termination 


As with all high speed designs, one has to consider 
transmission line effects on signals, especially clock 
signals. Even though the ICC bus clock, ICLK is usu- 
ally operated in the 10 MHz range, one has to con- 
sider proper transmission line termination also for 
short rise times. Figure 21m shows the ICLK wave 
form at the end of a 12 inch T-line when driven by a 
clock generator with and without series matched ter- 
mination. 


- Series termination should not be used for the ICC 
bus data lines (MBO). The combination of the pull-up 
resistor and series resistor would clegrade the output 
low voltage, Vol. For example, with a pull-up of 30029. 
and a series termination of 5029 at each end, the Vol 


voltage at the receiving end woulc' be at 1.55V if the 


driving end is at 0.4V (see Figure 21n). 


ICC BUS Operating Frequency 


The 82489DX ICC BUS has a design target of oper- 
ating up to 16 MHz (62 ns period). As shown in the 
examples above, the MBO low-to-high transition 
times are strongly dictated by the loads and the pull- 
ups used. This will in turn affect the maximum oper- 
ating frequency of ICLK. 


In general, the minimum period is the larger of 62 ns 
or MBO-to-MBI low data time or MBO-to-MBI high 
data time. 


MBO-to-MBI low time = 


(ICLK skew + MBO valid low delay + T-line 
prop.delay + ext. buffer delay + MBI setup 


time) 
1K 
#5 
a 


Note: 


| Input buffer is optional 


a | ‘ 
intel. 
MBO-to-MBi high time = 


(ICLK skew + MBO Hi-Z delay + pull-high 
time + T-line prop.delay + ext. buffer delay — 
+ MBI setup time) | 


Maximum MBO valid low delay = 50 ns 
Maximum MBO H-Z delay = 15 ns 
MBI minimum setup time = 8 ns 


In the example shown earlier where two 82489DXs 
are at each end of a 12-inch T-line with no other 
loads, the pull-high time to 2V is 35 ns (trace 1 in 
Figure 21g). If the ICLK skew is 2 ns, then this con- 
figuration can operate to 62 ns period or 16 MHz. 


‘If the same configuration has additional 20 pF loads 


at each end, then the pull-high time is 55 ns (trace 2 
in Figure 21g). The maximum frequency decreases 
to 12 MHz (82 ns period). 


In the four device model discussed earlier, where 
the ICC Bus is unbuffered, the pull-high time is 90 ns 
(Figure 21j). The operating frequency will be less 
than 8 MHz (117 ns period). If external buffers are 
used (whereby allowing use of 300 pull-up) and 
assuming the external buffers have delays of 10 ns, 
the operating frequency is limited by the MBO-to- 
MBI low time of 72 ns or 14 MHz. 


: NOTE: 
Each application is unique in its configuration and 
loading on the ICC Bus. The above examples high- 
lighted some of the factors that need to be consid- 
ered. It is important to do electrical simulation to 
ascertain if the propose implementation is viable 
before committing to the printed circuit board. 
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Figure 21k. External Driver/Buffer Implementation 
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Figure 21n. Effect of Series Termination on MBO VOL 
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Figure 22. Possible Configuration of the CPU Module 


9.1 82489DX Register Access Timing 
This section provides descriptions of the four basic 
cycle timings for the 82489DX, which are: 

— Register Write 

— Register Read 

— Interrupt Acknowledge 

— Reset 


In addition, Inter-Unit (nibble) bus timings are pre- 
sented. 


Register access occurs in three distinct phases: 
1. Control Phase, 

2. Address Phase and 

3. Data Phase. 


They always occur in this order, although in some 


cases address and data phases can occur in the 
same clock cycle, as will be seen in the diagrams. 
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NOTE: 
As mentioned previously, the clock signal in all 
these timing diagrams is assumed to be the proces- 
sor clock. 


TIMING DIAGRAM NOTATION 


The 82489DxX bus (register access) interface is syn- 


chronous. Unless otherwise noted, setup and hold 
times for inputs, and delay times for outputs are 
measured with respect to a rising clock edge. There- 
fore the timing diagrams contain very few construc- 
tion lines to identify timing parameters and ones that 
exist are explicitly discussed. High and low level for 
signals are obvious; dashed lines at the middle of 
the signal range indicate a tristate output buffer in 
high impedance mode; hashed or cross-hatched 
lines indicate a “don’t care” state for inputs. 


Cycle expansion marks—short curved lines breaking 
the waveform—appear throughout. These appear in 
sets or groups which are identified by construction 
arrows and/or vertical column alignment. Conditions 
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resulting in cycle expansion are listed at the bottom 
of each diagram, and the associated set of expan- 
sion marks indicates which signals must be 
stretched for that condition. Signals not so indicated 
are not affected by that condition, and continue with- 
out any cycle stretching. For example, the data 
phase of a transaction may be delayed, stretching all 
data related signals, while address related signals 
can continue to the next cycle. 


Sample points for input signals are marked with 
bold, downward pointing arrows. Since sample tim- 
ing for address, data and cycle definition signals is 
dependent on the timing of related control signals, a 
bar is used on top of the arrow to indicate the signal 
with the independent timing. Each signal group has 
exactly one independent signal. In general, indepen- 
dent signals are sampled on every clock, and there- 
fore must meet setup and hold times on every clock 
edge. Signals having dependent timing, (indicated 
by the arrow with no bar), are only sampled when 
the associated independent signal is active, and 
therefore setup and hold times for dependent sig- 
nals need only be met at the peieaice sample 
points. 


REGISTER WRITE TIMING 


For discussion of this bus cycle, refer to Figure 23, 
82489DX Timing Diagram 1. This shows the relation- 
ship between the three phases of the bus cycle. 


The control phase is independently timed_by the 
ADS signal. The cycle definition signals [M/IO, D/C, 
are dependently sampled with ADS as indicated by 
the bold sample point arrows labeled “C’’. The cycle 
definition signals will be sampled in the first clock 
when ADS is active (low). The control signal should 
remain stable until ADS goes inactive. For any valid 
82489DX cycle, the memory bus controller should 
ensure that the ADS pulse for a subsequent bus cy- 
cle is NOT presented until after the 82489DX as- 
serts its RDY pin (low) as shown. 


The address phase is independently timed by the 
(BGT) signal, as indicated by the bold sample point 
arrows labeled ‘‘A’’. This signal is actually used an 
address latch enable, however, its name is intended 
to imply that in most cases it can be directly driven 
by the Intel cache controller signal of the same 
name. The BGT pulse may be delayed until the ad- 
dress bus is available, in which case all address and 
data phase signals will be stretched. Note that DLE 
must not occur before BGT. 82489DX does not start 
the internal cycle until BGT is recognized with the 
appropriate chip select signal, If multiple ADS has 
been issued without BGT and a valid chip se- 
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lect, the internal cycle starts with the most recent 


ADS cycle definition preceding the BGT with valid 


chip select. 


NOTE C: 
Address information, including chip select (CS), is 
sampled in the first clock when BGT is active (low), 
and they must_remain stable until BGT goes inac- 
tive, OR until RDY is asserted (low), whichever oc- 
curs first. CS should be stable when ADS is active. 


In some configurations, BGT may not be provided, 
and can be permanently tied low. In this.case, the 
independent address timing will occur exactly one 
Clock after the ADS signal is first sampled low, and 
the dependent address information (address and 
chip select) will be assumed stable at_this time. 
82489DX recognize that independent BGT timing is 
not provided by sampling a low state of BGT at the 
time ADS is first sampled low. 


The data phase is independently timed by the DLE 
signal, as indicated by the bold sample point arrows 
labeled “D’’. In the case of register writes, this signal 
work logically like a synchronous data latch enable. 
The DLE pulse may be delayed until the data bus is 
available, in which case data and RDY will be 
stretched. | 


NOTE D: = 
Write data are sampled the first clock when DLE is 
active (low), and should remain stable until DLE 
goes inactive, OR until RDY is asserted (low), 
whichever comes first. 


In some configurations, DLE may not be provided, 
and can be permanently tied low. In this case, the 
independent data timing will occur exactly one clock 
after the ADS signal is first sampled low, OR on the 
same clock as BGT is first sampled low, whichever 
occurs later. The data bus will be assumed stable at 
this time. 82489DX recognize that independent DLE 
timing is not provided by sampling a low state of DLE 
at the time ADS is first sampled low. 


Cycle completion is signaled by the RDY signal. Its 
relative positioning on any of the timing diagrams 
does NOT imply the number of clock cycles required 
for an access. RDY is delayed as needed in order for 
the 82489DX to complete the cycle. It is then assert- 
ed (low) for one clock cycle and then deasserted. 
Again, the next ADS cannot start until after RDY has 
been driven low. ADS must return to an inactive high 
state before the next cycle can be issued. It is highly 
recommended not to have, ADS more than one 
clock wide. 
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REGISTER READ TIMING 


For discussion of this bus cycle, refer to Figure 24, 
82489DxX timing Diagram 2. It shows the relationship 
of the three phases of the bus cycle, however, the 
dependent control and address signals are not 
shown here, since they behave exactly asin the 
case of a register write. See the previous section for 
the description of control and address phases of the 
bus cycle. 


In the case of a read when DLE is used (Figure 24A), 
it works logically like an asynchronous, output data 
enable. The 82489DX drives the data bus within 
time delay “A’’ after DLE is asserted, which must not 
occur before BGT. Note that even though the bus is 
_ being driven, the data only becomes valid during the 
clock cycle in which RDY is asserted, after that 
point, valid data is maintained on the bus as long as 
DLE remains asserted, after which the data bus re- 
turns to high impedance state within time delay “B” 
of DLE deassertion. If DLE is asserted late, 
82489DX could complete its internal read cycle and 
return RDY early. In that case, RDY active low state 
will be maintained until. DLE is asserted. 


_ In the case of a read when DLE is NOT used (Figure 


- 24B) the data is driven for exactly one clock cycle, 


coincident with RDY being asserted. 


DLE is sampled with the control signals to determine 
whether it is being used. If sampled in the asserted 
state when ADS is active, the DLE will be consid- 
ered not used, and its state during the remainder of 
the cycle doesn’t matter. This is consistent with the 
notion of permanently tying this signal low when not 
used, as described in the previous section. 


Indication of the end of the bus cycle is dependent 
on the use of DLE. When it is not used, (i.e., perma- 
nently tied to ground) RDY indicates the end of the 
bus cycle, as it does in the case of write access. 
When it is used, DLE deassertion indicates the end 
of the cycle, since the 82489DX could be driving the 
bus well after RDY is deasserted. In either case, the 
82489DX can accept the next ADS pulse anytime 
after RDY has been asserted. However, note that if 
DLE is being usec, the next ADS should be delayed 
until DLE can be safely sampled inactive. Note these 
two options for earliest next cycle in the timing dia- 
gram. 


INTERRUPT ACKNOWLEDGE TIMING 


For discussion of this bus cycle, refer to Figure 25 
82489DxX timing diagram 3, This cycle is the result of 
the 82489DX posting an interrupt to the processor 
by asserting PINT. After PINT is asserted, other bus 
cycles may occur before the interrupt acknowledge 
cycle. 
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PINT can be asserted for any external (8259) inter- 
rupt as shown in Figure 25A, or for an 82489DX gen- 
erated interrupt as shown in Figure 25B. ExtINTA 
indicates whether PINT was asserted in response to 


an 8259 request or an 82489DX request. This signal 
is used by external control logic to either allow or 


preclude the 8259 from responding to the subse- 


quent interrupt acknowledge cycle. It should be not- 
ed that ExtINTA pin gets deactivated at third clock 
after the ADS of the second INTA cycle. 


When ExtINTA is high, the 82489Dx will not respond 


. to the acknowledge cycle other than to deassert 


PINT signal, and clear the pending external interrupt 
two full clock cycles after the ADS of the second 
INTA cycle, as shown in Figure 25A. Once PINT is 
asserted in response to an external interrupt, it can 
only be deasserted by an INTA cycle. An INTA cycle 
is recognized by 82489DX as soon as the bus cycle 
definition is sampled with ADS in a low state. BGT is 
not needed in this case. 


When ExtiNTA is low, the 82489DX will respond to 
the acknowledge cycle, as shown in Figure 25B. In 
this case, external logic (e.g., the memory bus con- 
troller) is expected to prevent any attached 8259 
from seeing the acknowledge cycle. When PINT is 
asserted in response to an 82489DxX internal inter- 
rupt, it can be deasserted by an INTA cycle. The 
PINT signal will be deasserted 5 full clocks after 
BGT of the second INTA cycle. 


Note that ExtINTA is stable at all times while PINT is 
asserted. That means that even if new interrupts ar- 
rive between the time an interrupt is posted to the 
processor, and the acknowledge occurs, the 
82489DX will not change its commitment for an ex- 
ternal (8253) or internal (82489DX) acknowledge cy- 
cle, regardless of priority. This also means that PINT 
may be raised for a high priority internal interrupt 
right after responding to the external Interrupt. In any 
event, PINT will be kept low for a minimum of two 
clocks before reasserting itself. 


The interrupt acknowledge cycle is indicated by the 
bus cycle definition signals all being low, and looks 
like two consecutive read cycles, except that there is 
no explicit address information. The actual content 
of the adcress pins during this cycle is processor 
dependent, and therefore there is no chip select ei- 
ther. Chip select is implied by a combination of the 
bus cycle definition signals (all low) and BGT. 


Note that there is a “dummy” data phase in the first 
interrupt acknowledge cycle. This allows parity to be 
generated on the bus for processors like i860XP. 
During this cycle, 82489DX drives random data on 
the bus with the appropriate parity. The interrupt Re- 
quest Register is “frozen” and the highest priority 
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pending interrupt vector is returned to the processor | 


in the second acknowledge cycle. 


The second acknowledge cycle has a complete data 
phase with timings identical to those of an ordinary 
register read. The data returned is the vector of the 
highest priority internally pending 82489Dx interrupt, 
or the spurious interrupt vector, if there is no inter- 
rupt pending higher than the current processor prior- 


ity. 


Note that the timing diagram shows DLE being used 
(sampled high during ADS). However, just as in a 
normal read cycle, the option exists not to use DLE 
(i.e., permanently tied to ground). 


RESET AND MISCELLANEOUS TIMING 


For discussion of this bus cycle, refer to Figure 26 
82489DX Timing Diagram 4. It shows the 82489DX 
reset cycle, the timing of some related signals, and 
the ICC bus. The RESET input has a setup and hold 
time to the system clock edge, CLKIN, as do other 
independently timed signals, The RESET signal will 
reset the two asynchronous system on the chip, 
namely the ICC bus unit running synchronously to 
the ICLK and all the other unit running synchronous 
to the system clock, CLKIN. RESET must meet the 
minimum reset time with respect to both clocks and 
there should be at least one ICLK rising edge during 
reset. The TAP controller should also be initialized. 


During reset, an eight-bit 82489DX Local Unit ID can 
be optionally initialized. Eight address lines, A10-—-A3 
are sampled on every clock edge while RESET is 
asserted. The last sample remains in the 82489DX 
Local Unit ID register after reset. Alternatively, the 
82489DX Local Unit ID can be loaded with a register 
write as part of software initialization, before 
82489DxX operation is started. In any event, the reg- 
ister must be initialized before the 82489DX can 
communicate on the ICC bus, including sending/re- 
ceiving RESET messages. All valid signal to 
82489DX should wait at least two full clocks after 
RESET is deasserted. 


The PRST signal (reset output) is asserted both with 
RESET input, or under software control. Its on-off 
delay times are relative to the rising clock edge. The 
duration of PRST under software control is defined 
by the software itself. Also note that the PNMI pin 
has the same timing as does PRST when the latter 
is software controlled. 


The ICC bus signals are both input and output on 
each cycle. Setup, hold and delay times are all mea- 
sured with respect to the ICC bus Clock ICLK which 
has no relationship to the Processor clock on which 
the remainder of the 82489DX runs. This means 
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that the ICC bus is independently sampled on each 
ICLK edge, as shown. It also implies that largest 
possible hold time will not exceed the minimum de- 
lay time. 


After reset, all 82489DX registers are reset to “0” 
state. The mask bits in the local vector table and the 
redirection table are reset to ‘‘1”’ state to mask out 
all interrupts. All reserved bits are all wired to ‘‘O” 
state permanently on chip. 


10.0 BOUNDARY SCAN 
DESCRIPTION 


The 82489DxX is equipped with the JTAG boundary 
scan standard. This feature allows the user to test 
the interconnections between 82489DxX and the ex- 
ternal hardware once they have been assembled 
onto a printed circuit board or other substrate. In 
addition to the JTAG mandatory instruction set, 
82489DX also provides the INTEST instruction 
which allows static testing of the on-chip logic. 


The detailed information related to the IEEE Std 
1149.1-1990 (the JTAG standard) can be obtained 
from the reference document /EEE Standard Test 
Access Port and Boundary Scan Architecture (IEEE 
Std 1149.1-1990). 


10.1 Boundary Scan Architecture 


The boundary scan logic contains the following ele- 
ments: 


— Five Test Access Ports (TAP): They are la- 
beled as trst, tck, tdi, tdo and tms. All ports are 
input pins except tdo, which is a tri-state output 
pin. 

— A TAP Controller: The logic is used to control 
the boundary scan activity. 


— 82489DX Device ID Register: This is a 32-bit 
read-only register. The DID can be shifted out in 
ascending order to the tdo pin. 


— JTAG Instruction Register (IR): This is a 4-bit 
register which accepts instruction code shifted in 
from the tdi pin. The opcode stored in the IR reg- 
ister is used to control operation. 


-— Boundary Scan Register: This is a 137 stages 
scan path which connects almost all 82489DX 
signal pins for boundary scan purposes. 


— Bypass Register: This register simply allows the 
data which goes into tdi pin to be shifted out di- 
rectly from tdo. 


The following block diagram illustrates the imple- 
mentation of the JTAG architecture in the 82489DX 


design. 
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7 Boundary. scan reg. ‘ 
ICC JTAG ID reg. es 
By-pass reg. | 


instruction reg. 
TAP controller F 


Figure 27. Block Diagram of the JTAG Architecture 


tms.trst 


290446-17 


Test Access Forts 


trst TAP controller master reset pin. When trst is 
low, the TAP controller's state machine will be 
reset to “test-logic-reset” state asynchronous- 
ly. This pin is tied to a weak internal pull-up for 
keeping to be a logical 1 when not driven. 


tck This is the test logic clock. The test logic will 
change state on the rising edge of the tck. 


tdi Test data input..Data is shifted into the tdi pin 
on the rising edge of tck. This pin is tied to a 
weak interna! pull-up for keeping it to be a logi- 
cal 1 when not driven. 


tms Test mode select. This pin is used to select the 
state of the TAP controller. This pin is synchro- 
nous to the rising edge of the tck. This pin is 
tied to a weak internal pull-up for keeping it to 
be a logical 1 when not driven. 
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tdo Test data output. This is a tri-state pin which 
allows the data to be shifted out. 


TAP CONTROLLER 


The TAP controller in 82489DX is implemented to 
conform the IEEE1149.1 standard. The TAP control- 
ler is a single phase clock, synchronous finite state 
machine. It controls the sequence of the operation 
of the test logic. 


The value of the test mode state (tms) pin at a rising 
edge of tck controls the sequence of the state 
changes. The state diagram for the TAP controller is 
shown in Figure 28. Test designers must consider 
the operation of the state machine in order to have 
the correct sequence of value to drive on tms. 


The behavior of the TAP controller and other test 


logic in each of the controller states is briefly de- 
scribed as follows. 
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Figure 28. TAP Controller State Diagram 


TEST-LOGIC-RESET 


The test logic is disabled so that normal operation of 
the on-chip system logic (i.e., in response to stimuli 
received through the system pins only) can continue 
unhindered. This is achieved by initializing the in- 
struction register to contain the IDCODE instruction. 
No matter what the original state of the controller, it 
will enter Test-Logic-Reset when tms is held high for 
at least five rising edges of tck. The controller re- 
mains in this state while tms is high. 


lf the controller should leave the Test-Logic-Reset 
controller state as a result of an erroneous low sig- 
nal on the tms line at the time of the rising edge on 
tck (for example, a glitch due to external interfer- 
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ence), it will return to the Test-Logic-Reset state fol- 
lowing three rising edges of tck with the tms line at 
the intended high logic level. The operation of the 
test logic is such that no disturbance is caused to 
on-chip system logic operation as the results of such 
an error. On leaving the Test-Logic-Reset controller 
state, the controller moves into the Run-Test/Idle 
controller state where no action will occur because 
the current instruction has been set to select opera- 
tion of the device identification register. The test log- - 
ic is also inactive in the Select-DR-Scan and Select- 
IR-Scan controller states. 


Note that the TAP controller will also be forced to 


the Test-Logic-Reset controller state asynchronous- 
ly by applying a low logic level at trst. 
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RUN-TEST/IDLE 


A controller state between scan operations. Once 
entered, the controller will remain in the Run-Test/ 
Idle state as long as tms is held low. When tms is 


high and a rising edge is applied at tck, the controller _ 


moves to the Select-DR-Scan state. 


In the Run-Test/Idle controller state, activity in se- 

lected test logic occurs only when certain instruc- 

tions are present such as RUNBIST. Since 82489DX 

does not have RUNBIST instructior,, this state is act- 
ing like an idle state. 


The instruction does not change while the TAP con- 
troller is in this state. 


SELECT-DR-SCAN 


This is a temporary controller state in which all test 
data register (82489DX has one test data register 
which is the boundary scan shift registers path) se- 


lected by the current instruction retain their previous — 


state. 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-DR state and a scan se- 
quence for the selected test data register is initiated. 
If tms is held high and a rising edge is applied to tck, 
the controller moves on to the Select-IR-Scan state. 


The instruction does not change while the TAP con- 
troller is in this state. . 


SELECT-IR-SCAN 


This is a temporary controller state in which all test 
data registers selected by the current instructing re- 
tain their previous state. 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-IR state and a scan se- 
quence for the instruction register is initiated. If tms 
is held high and a rising edge is applied to tck, the 
controller returns to the Test-Logic-Reset state. The 
instruction does not change while the TAP con- 
troller Is In this state 


CAPTURE-DR 


In this controller state data may be parallel-loaded 


into test data regis sters selected by the current in- 
struction on the rising edge of tck. If a test data reg- 
ister selected by the current instruction does not 
have a parallel input, or if capturing is not required 
for the selected test, then the register retains its pre- 
vious state unchanged. The instruction does not 
change while the TAP controller is in this state. 
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When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 


‘Exit 1-DR state if tms is held at 1 or the Shift-DR 
state if tms is held at 0. 


SHIFT-DR 


In this controller state, the test data register con- 
nected between tdi and tdo as a result of the current 
instruction shifts data one stage towards its serial 
output on each rising edge of tck. Test data registers 
that are selected by the current instruction, but are 
not placed in the serial path, retain their previous 
state unchanged. The instruction does not 
change while the TAP controller is In this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit 1-DR state if tms is held at 1 or remains in the 
Shift-DR state if tms is held at 0. 


EXIT 1-DR 


This is a temporary controller state, if tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-DR state, 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con- 
troller enters the Pause-DR state. All test date reg- 


isters selected by the current instructions retain 


their previous state unchanged. 


The instruction does not change while the TAP con- 
troller is in this state. 
PAUSE-DR 


This controller state allows shifting of the test data 
register in the serial path between tdi and tdo to be 


temporarily halted. All test data registers selected by 


the current instruction retain their previous state un- 
changed. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-DR state. 
The instruction does not change while the TAP con- 
troller is in this state. 


EXIT 2-DR 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, the scanning process terminates and the TAP 
controller enters the Update-DR controller state. If 
tms is held low and a rising edge is applied to tck, 
the controller enters the Shift-DR state. 
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All test data registers selected by the current in- 
struction retain their previous state unchanged. The 


instruction does not change while the TAP controller 
is in this state. 


UPDATE-DR 


Some test date registers may be provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in the associated 
shift-register path in response to certain instructions 
(e.g., EXTENT, INTEST, and RUNBIST). Data is 
latched onto the parallel output of these test data 
registers from the shift-register path on the falling 
edge of tck in the Update-DR controller state. The 
data held at the latched parallel output should not 
change other than in this controller state unless op- 
eration during the execution of a self test is required 
(e.g., during the Run-Test/Idle controller state in re- 
sponse to a design-specific public instruction). 


All shift-register stages in test data registers select- 
ed by the current instruction retain their previous 
state unchanged. The instruction does not 
change while the TAP controller is in this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Select-DR-Scan state if tms is held at 1 or the Run- 
Test/Idle state if tms is held at 0. 


CAPTURE-IR 


In this controller state the shift-register contained in 
the instruction register loads a pattern of fixed logic 
values on the rising edge of tck. In addition, design- 
specific data may be loaded into shift-register stages 
that are not required to be set to fixed values. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 


When the TAP controller is in this state and rising 
edge is applied to tck, the controller enters either the 
Exit 1-IR state tms is held at 1 or the Shift-IR state if 
tms is held at 0. 


SHIFT-IR 


In this controller state the shift-register contained in 
the instruction register is connected between tdi and 
tdo and shifts data one stage towards its serial out- 
put on each rising edge of tck. 
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Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 
When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit1-IR state if tms is held at 1 or rémains in Shift-IR 
state if tms is held at 0. 


EXIT 1-IR 


This is a temporary controller state. If tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-IR state, 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con- 
troller enters the Pause-IR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not changes while the TAP controller is in this state 
and the instruction register retains its state. 


PAUSE-IR 


This controller state allows shifting of the instruction 
register to be halted temporarily. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-IR state. 


EXIT 2-IR 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, termination of the scanning process results, 
and the TAP controller enters the Update-IR control- 
ler state. If tms is held low and a rising edge is ap- 
plied to tck, the controller enters the Shift-IR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


UPDATE-IR 
The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 


ter path on the falling edge of tck in this controller 
state. Once the new instruction has been latched, it 
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becomes the current instruction. Test data regis- 
ters selected by the current instruction retain 
their previous state. 


When the TAP controller is in this state and a rising 
edge is applied to ick, the controller enters the Se- 
lect-DR-Scan states if tms is held at 1 or the Run- 
Test/Idle state if tms is held at 0. 

INSTRUCTION REGISTER 


The function of the instruction register is to select 


the operating mode of the test logic. For instance, | 


read the ID register, or capture the 82489DX output 
signals. 82489DX has implemented 4 instructions. 


| Instruction | Mandatory/Optional 
om 
[Fewest 


Bypass Instruction 


o0/3/3/3 


The bypass instruction selects the bypass register to 
be connected to tdi and tdo, effectively bypassing 
the test logic on the 82489DX boundary scan path 
and reducing the shift length to be on one bit. Note 
that an open circuit fault in the board level test data 
path will cause the bypass register to be selected 
following an instruction scan cycle due to the inter- 
nal pull-up on the tdi pin. This has been done to 
prevent any unwanted interference with the proper 
operation of the system logic. 


Extest Instruction 


The extest instruction allows testing of circuitry ex- 
ternal to the component package, typically board in- 
terconnects. It does so by driving the values loaded 
into the 82489DX’s boundary scan register out on 
the output pins corresponding to each boundary 
scan cell and capturing the values on 82489DX’s 
input pins to be loaded into their corresponding 
boundary scan register locations. |/O pins are se- 
lected as input or output depending on the value 
located into the output control cell. Values shifted 
into input latch in the boundary scan register are 
never used by the internal logic of the 82489DX. 
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82489DX must be reset after extest instruction has — 
been executed. 


Sample/Preload Instruction 


The sample/preload instruction has two functions 
that it can perform. When the TAP controller is in the 
CAPTURE-DR state, the sample/preload instruction 
allows a snap-shot of the normal operation of the 
82489DxX without interfering with that normal opera- 
tion. The instruction causes boundary scan register 
cells associated with outputs to sample the value 
being driven into the 82489DX. On both outputs and 
inputs the sampling occurs on the rising edge of tck. 
When preloads data into the 82489DX pins to be 
driven to the board by executing the extest instruc- 
tion. Data is preloaded to the pins from the boundary 
scan register on the falling edge of tck. 


dcode instruction 


The idcode instruction selects the device identifica- 
tion register to be connected to tdi and tdo, allowing 
the device ID code to be shifted out of the device on 
tdo. Note that the bit stream shifted into tdi will ap- 
pear on tdo after all 32 bits of the DID has been 
shifted out. 


DEVICE IDENTIFICATION REGISTER (DID) 


The device identification is a 32 bits number which 
can be read by the external hardware by using the 
idcode instruction. The 82489DX device ID is as- 
signed to 1489A013 (hex). This is subject to change. 


The upper 4 bits of DID may be changed for different 


version. The 16-bit number (bit 27-bit 12) 489A 
(hex) is the part ID.- The lower 12 bits are the manu- 
facturer ID for Intel which must be 013 (hex). 


BOUNDARY SCAN REGISTER 


82489DX has only one test data register, i.e., the 
boundary scan register. The boundary scan register 
is a single shift register path containing the boundary 
scan Cells that are connected to all signal input and 


‘output pins of the 82489DX. There are three generic 


type of boundary scan cells—input, output, and bi-di- 
rectional. For each input only cell, one stage of shift 
register is added to the boundary scan path. 


ADVANCE INFORMATION 


intel i. |  g2489DX 


All output pins will become tri-stateable when put driver with a specific tri-state control cell in the 
boundary scan is activated, regardless whether they scan path. The user must shift in a proper control 
are tri-stateable or not in the normal operation. To signal for these tri-state control cells in the scan 
explain further, the user will enable/disable an out- path. | 
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Figure 29. Logical Structure of Boundary Scan Register 
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BOUNDARY SCAN CELL NAMES IN ORDER 
FROM tdi TO tdo . 


The following table is a list of the boundary scan cell 
names in the order from tdi to tdo. The type informa- 
tion indicates the purpose of the cells. 


| = input only cell 

B = bi-directional cell 

T = tri-state output cell 

C = tri-state control. Note that the signal name 
enclosed within the parenthesis is con- 
trolled by this cell. 

Name Pin # 
CLKIN 
TMBASE 
ICLK 
DC 6 


OT 
N 


— 


= 
= 
Ol 
(o>) 


3 


> 
O 
” 
® 
B 


Oloalolo 
Tiln |n Ilion 
mi} @® |o | @ 
<feqi<. 
Oo;o0/]oO 
alaia 
NENT NDTN 
wolrnm]—|o 


| 
or 


reserved 
MBIS 
MBI2 
MBI1 
MBIO 
LINTIN1 
LINTINO 


reserved 


NOT ™NOTN TN 
oOo;dOIN!] Oo 


fe) 


(@) 
—*/ 
SI i~ 


1 


(reserved) 
INTINAS 8 
INTING4 83 
reserved 


N 


(reserved) 
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CellNumber | Type | Name | Pin # 


28 
29 
30 


31 


32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 


62 


63 
64 


z 

[1 Prive | 05 
TP reeeved | 
To tesorved | 
Pints | 08 
Pantin” | 67 
[7 | rwsowed | 
To [tesored | 
: 

ed 
a 
1 fivtiny | 00 
ee 
a a 
Te [tresonee) | 
1 Pivtins | oa 
ee 
(7 [resoned | 
Te [tesonee | 
Pirin | 98 
1 Fawtino [97 
[1 [resoned | 
e_[ors | 101 
[~c | wreay | 

[owe 


:0)) 
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[eetnunber | Type [Name [pine 
res | 8 | bee 108 
ee 
Ter | tI reconed | 
Tes |_| wosorve | 
ee 
ee 
ee 


a 
76 | 1 | weed [| 
ee 
70 | 8 [oe | 10 


| 90 | TT | reserved | 
pt | | reserved) | 
ee ee 
| 93 | 8 | oo 
|| TC reserved | 
p85 | | reserved) | 

ee 
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[eeinumber | Type | Name [in 
rte | 8 feed 
Pts fe fori ts 
[toe | 8 [ood ts 
[105 [1 Presonea te 
[106 [8 [resened | 20 
P07 [| cttos, area | 
108 [Te fa 
[108 [8 


24 


G | © 
— 


reserved 


reserved 


Ooi; h al asi Ni @ | oO 
NO 
<<) 


’ 
oo 
= 
—” 
Ss 

ih ND ur o NI a rs S| ® IN) 


U UV 
a a 
H|E/= 


=> {mM 
rat 
Zz 
= 
> 
pas 


reserved) 


reserved 


as 
= 
| 3 
n 
© 
< 
@ 
Q. 
— 
BAN 


me) 
O 
< 
BAN 
Go 


BO3 


4 


fee) 


aN 
i<e) 
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BYPASS REGISTER 


The bypass register is simply a 1-bit shift. register 
which connects between the tdi and tdo. When se- 
lected by using the bypass instruction, the data shift- 
ed into tdi will be shifted out from tdo one tck clock 
later. 


JTAG TAP Controller Initialization 


The TAP controller must be reset to test-logic-reset 
state when 82489Dx is first powered up. There are 
two ways to reset the TAP controller: 


1. Assert trst to be 0, it will reset the TAP controller 
asynchronously. 
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2. Assert tms to be 1, and clock the TAP controller 


at least five times, the TAP controller will be reset 
after the fifth rising edge of the tck. 


After reset, the idcode instruction is loaded into the 
IR automatically. 


Note that the tms and trst pins both have an internal 
weak pull-up device to keep them to be logic 1 level. 
Therefore the user can simply apply 5 clocks at the 
tck input to reset the TAP controller. If the TAP con- 
troller is not reset properly, 82489DX may not func- 


_ tion because the boundary scan logic might be ac- 


tive which will impact the signals flow in and out to 
the chip. 
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11.0 ELECTRICAL NOTICE: This data sheet contains information on 


CHARACTERISTICS products in the sampling and initial production phases 
of development. The specifications are subject to 


| change without notice. Verify with your local Intel 
11.1 D.C. Specifications Sales office that you have the latest data sheet be- 
fore finalizing a design. 


ABSOLUTE MAXIMUM RATINGS | * WARNING: Stressing the device beyond the “Absolute 
Maximum Ratings” may cause permanent damage. 
Case Temperature Under Bias... —65°C to + 110°C These are stress ratings only. Operation beyond the 


“Operating Conditions” is not recommended and ex- 


Storage Temperature .......... —65°C to + 150°C tended exposure beyond the “Operating Conditions” 
Voltage on Any Pin may affect device reliability. 
with Respect to Ground...... —0.5 to Vcc + 0.5 


Voc = 5V +5%; To = O°C to + 85°C 


[symbol [Parameter | Minne) | Wax | Unite [Notes 
i 
putrichvotagermy [20 | voovos [vf 

Pig onpativatgers [| a 
Vow | Output GH Votage my |e Wate 2) 
Tice | s8irePover supp Career [| 
Fin | tnputlealagecurent | | | pa | 
ree 
ae ae a ae OO 
Sa a OL AE OS 
co | Wooroutntcapactance [|e | | 
cauan | Grock Capactance «| Sid Se 
Timo | 100 8us OutpstLowGurent [| —+|«—*dt mate) 
Tuo | 160 Bus Total Capacitance | «dtr 
Twi | tec usinputrigh crm | 20 | Vegros |v | 
TV. | tec Bus nutuowcr) | -08 | 40a |v 


NOTES: 

1. This parameter is measured with current load of 4 mA. 

2. This parameter is measured with current load of 1.0 mA. 

3. This parameter is for output without pulldown. 

4. This parameter is for tri-state output with pulldown and Voy = 3.0V. 
5. This parameter is for input with pullup at Vi, = OV. 

6. ICC bus output low current is measured at 0.6V. 


ttt 
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A.C. Parameters Referencing 33 MHz System Clock 
Vcc = 5V +5%; Tc = 0°C to + 85°C 


D31--D0, DP3-DP0, A9-A3 Setup Time | 31 a ae 


ADS, BGT, DLE, M/IO, D/C, W/R, CS 31 
Hold Time 


D31--D0, DP3—DP0, A9-A3 Hold Time 3 
D31--D0, DP3-DPO0, Valid Delay 


Symbol | - Parameter Ge jin 7 Max | Notes 
CLKIN Low Time 30 a ee ie 
| +3 | CLKIN Rise Time 30 | {8 | (Note 2) | 
/ t4 | CLKIN Fall Time 0 Ff 8 | Note 2) | 

5 __ ADS, BGT, DLE, M/I0, D/C, W/R, CS 31 | 


+ 
hb 
© 


oh 
° 


ie) 
© 


2 a ae 


18 
: 
of hal 
2 
so | 39 | wi 
|e | | totes) | 


ans 
ee 

Civ [ [note] 
Tao |_| too) | 


tn al 
—s 
_s 


t12 D31--D0, DP3-—DP0, Low-Z Delay 
When DLE is NotUsed 
t13 D31--D0, DP3-—DP0, High-Z Delay 32 


When DLE is Not Used 


t14 D31--D0, DP3-DPO Enable Delay 
When DLE is Used 


D31--D0, DP3-DPO Disable Delay 
When DLE is Used 


RDY Valid Delay 

PRST, PNMI, PINT Valid Delay 
RESIET Setup Time 

RESET Hold Time 

RESET Cycle Time 
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t24 INTIN[15:0], LINTIN[1:0] Low Time 


All parameters are given in nanoseconds. 
TTL Level timing is measured at 1.5V for both “O” and “1” levels. 


NOTES: 

1. ICC bus clock ICLK period must be at least 5 ns longer than system clock CLKIN for proper synchronization of the 
internal asynchronous signals. 

2. System clock CLKIN measured from 0.8V—2.0V. 

3. Minimum Reset cycle is the greater of the two cycle times. 

4. Minimum pulse width must be met for valid level to be attained on the DATA or ADDRESS output. 

5. Set up and hold time is required for RESET to start at the next rising edge of the clock. 

6. INTIN and LINTIN low time is measured from 1.5V of the falling edge to 1.5V of rising edge. 

. 7. Not 100% tested. Guaranteed by design characterization. 
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Time Base A.C. Parameters 
Voc = 5V +5%; To = 0°C to + 85°C 


[set [Pernt 


TAP Controller A.C. Parameters Voc = 5V +5%; Tc = 0°C to + 85°C 


[symbor | Parameter ——_——_—(|Ret Fig. | Minne) | Maxine) | Note 
ic | ToKewod——SSSC~=~sSCiSCSdC att 
[iso | ToKtightine ———SSS—~=~s tw] 
ist | TeKtowtime ————SSsS—~—~—s ws to 
rise | toxrisotine ——SSCSC~“~*~‘“dtS Ss 

iss | ToKFaltime ——SOS~=~“~*“~*~*~stCiSSCSSSSC*dC 
risa | TOL TMS, TRST Soup tine ———i«dts=Ce | | 
iss | To, TMS, TRSTHodtime —~SCi«dts=Cie Ss | 
[ise | TRSTwinimumtowTine ‘| | 10 | | wrote) | 


Ail parameters are given in nanoseconds. 
TTL level timing is measured at 1.5V for both “0” and “1” levels. 


NOTES: 
1. These parameters are specified for 50 pF load. 
2. This parameter is measured at 1.5V between the rising and falling edges. 


: ADVANCE INFORMATION 4-281 


82489DX 


A.C. Parameters for ICC Bus 
Voc = 5V +5%; To = O°C to + 85°C 


| Parameter 

ICLK Period 

ICLK High Time 

ICLK Low Time 

ICLK Rise Time 

ICLK Fall Time 

MBI3—MB10 Setup Time 
MBI3-—MB10 Hold Time 
MB03-MBO0 VALID Low Delay 
MB03-MBOO0 VALID High-Z Delay 


MB03-MBOO0 VALID Low-Z Delay 
All parameters are given in nanoseconds. 


Ref. Fig. 
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TTL level timing is measured at 1.5V for both “0” and ‘1”’ levels. 


NOTES: 


1. MBI3-O0 and MBO3-0 timing is tested at 150 ns cycle time. 


2. This parameter is specified for 50 pF load. 
3. Not 100% tested. Guaranteed by design characterization. 


12.0 REGISTER SUMMARY 


82489DX registers can be located at any 1 Kbyte 
boundary in either memory or |/O space for as far as 
the 82489DxX architecture itself is concerned. From 
a platform standard point of view, it is recommended 
to locate all 824891IDX Local Units in memory space 
at address OxFEEO—O0O00. It is further recommend- 
ed that all 82489DX I/O Units also be located in 
memory space; I/O Unit 1 at address OxFECO— 
0000, I/O Unit 2 (if present) at address 0xFECO— 
1000, and so on. Chip select for the 82489DX 
should be based on a full decode of address pins 
A31-A10. | 


All directly accessible 82489DxX registers are 32 bits 
wide and are aligned at 128-bit boundaries. The reg- 


ister being accessed is determined by bits 4 through | 


9 of the address. This is listed in the tables below. 
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Addresses not listed are reserved by the architec- 
ture. The tables also show whether the register is 
readable and/or writable by software, and what the 
side effects are of software accessing the register. 


After reset, all registers are initialized to all zeroes 
with the following exceptions, The Local Unit ID field 
is initialized with data present on the 8 LSB address 
pins. The Mask bit is initialized to 1 (“masked”’ state) 
in all entries in both the local vector table and the 


~ redirection table. 


For the I/O Unit, only the I/O register select and |/O 
window registers are directly accessible in the ad- 
dress space. The other I/O unit registers are ac- 
cessed indirectly through the select and window reg- 
ister, 
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I/O Unit Registers | 


Register Address(9:4) | SW | __ Side Effects 


70 Reg Select 70) | SW | Side Efecte 
I/O Unit ID Register ~ 9000 0000 
Version Register r 
Redirection Table [0] (31:0) 
Redirection Table [0] (63:32) 
Redirection Table [1] [31:0] 
Redirection Table [1] [63:32] 


Redirection Table [2] (31:0) 0001 0100 


Redirection Table [2] (63:32) 0001 0101 


Redirection Table [3] [31:0] 
Redirection Table [3] [63:32] 
Redirection Table [4] (31:0) 
Redirection Table [4] (63:32) 
Redirection Table [5] [31:0] 
Redirection Table [5] [63:32] 
Redirection Table [6] (31:0) 
Redirection Table [6] (63:32) 
Redirection Table [7] [31:0] 
Redirection Table [7] [63:32] 
Redirection Table [8] (31:0) 
Redirection Table [8] (63:32) 
Redirection Table [9] [31:00] 
Redirection Table [9] [63:32] 
Redirection Table [10] (31:00) 
Redirection Table [10] (63:32) 
Redirection Table [11] [31:0] 
Redirection Table [11] [63:32] 
Redirection Table [12] (31:0) 
Redirection Table [12] (63:32) 
Redirection Table [13] [31:0] 
Redirection Table [13] [63:32] 
Redirection Table [14] (31:0) 
Redirection Table [14] (63:32) 
Redirection Table [15] [31:0] 
Redirection Table [15] [63:32] 
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LOCAL UNIT REGISTERS 


Registers SW Side Effects 
Local Unit ID Register rw 
Version Register ne | 
Reserved 
Task Priority Register Iw | mask intr dispense 
EOI Register rw prioritization cycle 
Remote Register . oof 
Logical Destination Reg. rw 
Destination Format Reg. rw 


IRR (127:96) 10 0011 r 
IRR (159:128) 10 0100 r 
IRR (191:160) 10 0101 r 


Spurious Vector Register rw 
sR Ot 3 
ISR (63:32) 
ISR (95:64) 
ISR (127:96) r 
ISR (159128) 
ISR (191:160) 
ISR (223:192) _ 
ISR (255:224) 
TH : 
TMR (63:32) of 
TMR (95:64) | 
TMR (127:96) P  ototn ert 
TMR (159:128) 
TMR (191:160) 
TMR (223:192) r 
TMR (255:224) 
AO 3 
IRR (63:32) 
IRR (95:64) | or 

100011 

| 190100 

| 100101 
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LOCAL UNIT REGISTERS (Continued) 


[Resistors ——=S=*d:C*«C restos) | sw ~d|~SCSide fects 
[naeaetse)—SSCSC~srSC(“i‘it SP 
[inn essazySSSC~SC tort dP 
[rt comm Reg. 682) | tt0001 dw 
[Local vectorTable timer) [nooo Sw 
[Resend Toons PP 
OD 
[Local veciorTable focally [noon i) ow | 
[Local VestorTablelocalitt) [1100 rw 
Theor ton dP 
[ia CountRegster = 1000 «dw 
[curent CountRegiter it tttoon id 
al RINNE 
——— 
i 
ee 
ef 
ene Meenas 


Divider Configuration Reg. 11.1110 
ie ae 


NOTE: 
Address space 101000 to 101111 and 111111 are reserved 


13.0 TIMING DIAGRAMS 


tris too toy 


Output Signal 
290446-20 


Figure 30. Output Waveform 
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13.0 TIMING DIAGRAMS (Continued) 


CLKIN 
Input Signal 
290446-21 
Figure 31. Input Waveforms 
CLK 
ADS 
DLE 


bD[3 1:0], DP[3:0] 


290446-22 


Figure 32. Data Bus Tri-State Delays when DLE Sampled Low with ADS 
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13.0 TIMING DIAGRAMS (Continued) 


D[3 1:0], DP[3:0] 
290446-23 


Figure 33. Data Enable/Disable Delay when DLE is Sampled High with ADS 


Output Signal, TDO 


290446-24 


Figure 34. TAP Signal Timings 


: ADVANCE INFORMATION 4-287 


82489DX | 7 | | | intel ; 


13.0 TIMING DIAGRAMS (Continued) 


~ t3a,tga, | 133,143 
t52 t53 


SH SF FFF SF FSFEFHFVFSFFFSSSS BZD SFFVSVESTSESFVSESSTSSSVSVSSE FTHSSVAVTSEHRVSFESSEVSVNSVSVSVSSsVSUVSTASSVSSVSVSSSVSsWWSSSN|SVVVS Q#geseeseRRAeas 


8SHSHFFVSVFVSHFVVSTVFAHSSlS QSFSeeseeS*eeses*seF*sVssVe* Rt SSS*SseS*S*esSW*V|esS* sls SF Fs Sl e*aePeVeVWseel®S|SsVl* SSVWsSsS SVS SVeSSVsSesesVseVpsseseeva geFeeeeeseseeeeuvase 


(30, 40, t50 t31, tqy, ty 


SMWVWVVSSVSSE SSVBA FGAVSlVFVOFFSTSlVVSSVSTHS FHSS FHTTDHVHHHTHVNSTSE TZLAVSVSVVI*sSeVeeWlsessese Seeae LHeseeeeseesseesveuds: 


TMBASE 


time tics tic 
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Figure 35. TMBASE, ICLK, TCK Timing 
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Figure 36. ICC BUS Open-Drain Output Delay 
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14.0 PACKAGE PIN-OUT 


132-Lead PQFP 
Package Type KU (See Packaging Specification. Order Number 240800) 


113 =) Vssp 
108 = Vecp 
106 == Vssp 
100 = Vecp 


a. a 
Baa 8 
> > 
(=) ~n 
N - 
- Sd 


TOP VIEW 
132 —-LEAD PQFP 


126 E>) Vccp 


Vss 

Vec 
INTINO 
INTIN 1 
INTIN2 
INTINS 
INTIN4 
INTINS 
INTING 
INTIN7 
INTINS 
INTINS 
INTIN10 
INTIN 1 1 
INTIN12 
INTIN13 
INTIN14 
INTIN1S 
LINTINO 
LINTIN 1 
MB10 — 
MB11 
MB12 
MB13 
Reserved 
CS 

DLE 
Reserved 
Reserved 
Reserved 


OON ODN AWDND om 


Reserved 
Reserved 


Vsspo Co 36 


Reserved Comj 34 
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NOTE: 
See pin description section for appropriate pin-strapping of the reserved pins. 
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15.0 PACKAGE THERMAL 
~ SPECIFICATION 


The 82489DX is specified for operation when the 
case temperature is within the range of O°C to 
+ 85°C. The case temperature may be measured in 


! a) | : <i 
/: 4) “imi (Ws 
“Ah IY Th oe 


1 
132 


Plastic Quad Flat Pack (PQFP) 


Lj : 

intel ° 
any environment, to determine whether the device is 
within the specified operating range. 


The PQFP case temperature should be measured at 
the center of the top surface opposite the pins, as 
shown in Figure below. 


MEASURE PQFP CASE TEMPERATURE 
AT CENTER OF TOP SURFACE 
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PQFP Package Thermal Characteristics 


Thermal Resistance—°C/W 


Parameter 


6 Junction to Case 


6 Junction to Ambient 


Air Flow Rate (Ft./Min) 


NOTES: 


1. Table above applies to 82489DX PQFP sieaed into a socket or soldered directly into the board. 


2. O54 = 9jc + 9Ca~. 
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16.0 GUIDELINES FOR 82489DX 
USERS 


16.1 Initialization 


This section outlines one possible initialization sce- 
nario. Other scenarios are certainly possible, and 
one would be selected as part of a platform stan- 
dard initialization scheme. The intent of this section 
is to illustrate that the initialization support provided 
by the 82489DX is adequate to support MP (Multi- 
processor) system initialization. 


Each 82489DX has a RESET input pin connected to 
a common Reset line. Upon system reset, this com- 
mon reset line is activated, causing all the 
82489DXs to go through reset. All 82489DX local 
units (note: only local units and not |/O units) latch 
their ID from their address bus on reset. The ID can 
be provided by the bus control agent based on slot 
number. 


The local units next assert their processor’s Reset 
pin, holding the processor in reset, and next perform 
their internal reset, setting all registers to their initial 
state. The initial state of all 82489DX Units (both 
local and |/O units) is “all masks set’ and all Local 
Units disabled; registers are otherwise initialized to 
zero. Note that the PINT and PNMI output pins are in 
tri-state mode when the local unit is disabled. After 
this, each 82489DxX local unit will deassert its proc- 
essor’s Reset pin, allowing the processors to come 
out of reset and perform self test and start executing 
initialization code. 


Note that while connecting PRST pin it should be 
noted that whenever PRST pin is activated by 
82489DX either because of software reset message 
or hardware reset, the 82489DxX itself is reset. It 
should be taken care in the cases of Warm reset 
where only processors need to be reset and not the 
interrupt controller. In brief, the usage of PRST de- 
pends upon the system requirement on various re- 
set. 


Somewhere in this code sequence, the processors 
that are “alive” will enable their 82489DxX local units, 
and attempt to force all the other processors back 
into Reset. Forcing the other processors into reset is 
performed by sending them the inter-processor in- 
terrupt with Destination Mode = ‘Physical’, Deliv- 
ery Mode = “Reset”, Trigger Mode = “Level”, 
Level = ‘1’, and Destination Shorthand = ‘All Excl 
Self’’. Only the first processor to get the ICC bus will 
succeed in sending this signal and reset all other 
82489DxXs and their processors. The other proces- 
sors are kept in reset until such time that an MP 
operating system decides they can become active 
again. The only running processor next performs the 
rest of system initialization. 
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Eventually, an MP operating system will be booted 
at which time the operating system would send 
“deassert reset’’ interprocessor signals to activate 
the other processors in the system. A mechanism 
must be provided by the platform that allows the 
added processors to differentiate the very first reset 
from a subsequent one. 


16.2 Compatibility 
COMPATIBILITY LEVELS 


The 82489DX can be used in conjunction with stan- 
dard 8259A-style interrupt controllers to provide a 
range of compatibility levels. 


At the lowest level we have “PC shrink-wrap” com- 
patibility. This level effectively creates a uniproces- 
sor hardware environment within the MP platform 
capable of booting/running DOS shrinkwrap soft- 
ware. In this mode, only the 8259A generates inter- 
rupts and the 82489DX becomes a virtual. wire. The 
interrupt latency can be minimized by connecting the 
8259A interrupt to local unit directly. 


The next level preserves the software compatible 
view of an 8259A but it allows more than one proc- 
essor to be aciive in the system. This results in an 
asymmetrical arrangement, with one processor field- 
ing all 8259A interrupts but with added inter-proces- 
sor interrupt capability. In this mode, 82489DX 
“merges” 8259A interrupts with inter-processor in- 
terrupts. Existing |/O drivers would be bound to the 
compatible CPU and interface directly with the 
8259A. 


At the next compatibility level, 8259A compatible 
drivers can be mixed with native 82489DxX drivers. 
Devices can generate interrupts at either 8259A or 
an 82489DxX. This provides for partial symmetry as 
individual drivers migrate from the 8259A to native 
82489Dxs. 


Another 8259A compatible point can be defined for 
MP systems. Each processor could have its own 
compatible 8259A controllers, allowing multiple 
processors to run compatible !/O drivers, but stati- 
cally spreading the load across the available proces- 
sors. 


82489DX/8259A INTERACTION 


The principle of compatible operation is very 
straightforward; the 82489DX(s) become a virtual 
wire connecting the 8259A’s INT output through to 
the processor, while at the same time making 8259A 
visible to the processor. 
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The two connection schemes described only differ 
in the number of 82489DX(s) (one or two) that are 
located in the path from the 8259A to the processor. 
In the one 82489DX example illustrated in Figure 37, 
the INT output of the 8259A connects to one of the 
Interrupt Input pins of the 82489DX through an edge 
generation logic. This could be an interrupt pin on 
the 82489DX’s |/O unit or local unit; assume a local 
interrupt input is used. The Local Vector Table entry 
for the interrupt pin that connects to the 8259A is set 
up with a Delivery Mode of ‘‘ExtINT” and edge trig- 
ger mode. This indicates that the interrupt is gener- 
_ated by an external controller. The processor’s INT 
pin connects to the 82489DX PINT pin. 


This setup enables the 82489DxX local unit to detect 
assertions (up-edges) of the 8259A’s INT output pin 
and pass this on to the processor’s INT input. 
82489DX asserts ExtINTA pin along with (one clock 
prior to) PINT pin to indicate “8259” interrupt. When 
the processor performs its INTA cycle the 82489DX 
itself does not respond other than deasserting PINT 
to the processor. At the third clock after ADS in the 
second bus cycle of INTA cycle ExtINTA is deassert- 
ed. External logic should make use of the ExtINTA 
signal to make the INTA cycle visible to the 8259A 
and the 8259A should provide the vector. At the 
~ same time, the local unit considers the external re- 
quest as delivered, and need not wait for the exter- 
nal 8259A’s INT tc be deasserted. A new up-edge 
must be generated on the 8259A INT pin before the 
local unit will assert the processor’s INT pin on be- 
half of the 8259A. External edge generation logic 
should be used for this. Compatible software inter- 
acts directly with the 8259A. 


The mechanism is essentially the same in the two- 
82489DX scheme. The difference is that the 8259A 
connects to an interrupt input pin of the 82489DX 
I/O unit in the 1/O system. The Redirection Table 
entry for this pin is again programmed with an 
“ExtINT” Delivery Mode, and the (single) 82489DX 
destination local ID) corresponding to the compatible 
DOS processor. Capturing the up-edges of the 
8259A’s INT pin by the 82489DxX local unit now in- 
volves sending messages from the 82489DX I/O 
unit to.the 82489DX local unit via the ICC bus. The 
“virtual wire’? now includes messages over the ICC 
bus. | 


Adding inter-processor ICC interrupts (or any other 
82489DX generated interrupts) to the compatible 
operation is accomplished by having the 82489DX 
internally OR the 8259A’s INT request with any 
82489DxX interrupt request. 


Before the 82489DxX actually sends the interrupt sig- 
nal to the processor, the 82489DX decides whether 
it does this for arn 82489DxX interrupt or whether it 
does this on behalf of the external controller. When 
the processor performs the corresponding INTA cy- 
cle, only the 82489DX knows whether it should re- 
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spond with a vector, or whether the external 8259A 
should. 


If the 82489DX needs to respond, then it will enable 
an externally implemented trap that prevents the 
8259A from seeing the INTA cycle. If the 8259A 
needs to respond, then the 82489DxX will not enable 
the INTA trap, and the INTA will be allowed to reach 
the 8259A. 82489DX implements this by asserting 
its EXTINTA pin to indicate external 8259A should 
respond with the vector. The 82489DX local unit 
controls the INTA trap via its “ExtINTA” output pin; 
the 82489DX does not actually provide the trap it- 
self. 


— 


MP "SYSTEM" BUS 


PROCESSOR 
82489DX pal a Functional 


Block 


I EDGE 
LOGIC 


= 


Interrupt Requests 


8259A 


Equivalent 


INTA 
teks TRAP 
Bus Cti 


Figure 37. Edge Logic 


1/0 EXPANSION BUS ~ 
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82489DX/8259A DUAL MODE CONNECTION 


In systems that can be booted either as a configura- 
tion with compatible 8259A or without, device inter- 
rupt lines are connected to both the Interrupt Re- 
quest pins of the 8259A and Interrupt Input pins of 
the 82489DX with all interrupts either masked at the 
82489DX or at the 8259A. Some EISA and Micro- 
Channel chip sets that include on-chip 8259As also 
have internally connected interrupt requests. For ex- 
ample, the 82357 (the ISP of the EISA chipset) gen- 
erates timer and DMA chaining interrupts internally. 
These are not available as separate interrupts out- 
side the ISP. In non compatible mode the ISP timers 
are not used, since each local 82489DX unit pro- 
vides its own timer. Therefore, the ISP’s 8259A is 
configured to mask out all interrupts except the DMA 
chaining interrupt which is configured in level-sensi- 
tive, auto F:OI mode. This causes the 8259A’s INT 
output to track the state of the internal DMA inter- 
rupt request. The 8259A’s INT output is then con- 
nected to one of the 82489DxX interrupt input pins 
programmed to generate a regular (i.e., not 
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“ExtINT’) level-sensitive interrupt. The ISP 8259A 
then no longer functions as an external interrupt 
controller; it has been logically disabled, and it 
needs no interrupt acknowledge or EOI. The INTA 
and EOI cycles occur only at the 82489DxX. It should 
be noted that 82489DX accepts only active high lev- 
el/edge interrupt inputs. External programmable log- 
ic should take care of polarity reversal that may be 
needed in EISA system for sharing of interrupts. 


16.3 Hardware Guidelines 


82489DX HARDWARE STATE ON RESET 


The 82489DX goes to reset state either by Hard- 
ware Reset state or Software Reset message re- 
ceived on the ICC bus. On reset, 82489DxX is dis- 
abled. The following is the hardware state of 
82489DxX after reset. 


PRST } Active (HIGH) 
| PNMI | TRI-STATED (Internal Pull-Down Provided) 


PINT | TRI-STATED (Internal Pull-Down Provided) 


82489Dx is disabled on Reset and unless specifical- 
ly enabled, it does not start its interrupt mechanism. 


The difference between hardware reset and soft- 
ware reset message is that during hardware reset 
82489DX samples the address bus and stores the 
last sample in Local Unit ID whereas for software 
reset it does not sample and store the unit ID. In 
addition, during the hardware reset pulse should be 
wide enough to accommodate for at least one rising 
and falling edge of ICLK. On hardware reset ExtiN- 
TA is held high. 


PULL UP AND PULL DOWN RESISTORS 


PNMI, PINT are tri-stated at power on and they are 
maintained in tri-state condition till the unit is en- 
abled. Eventhough internal pull down resistor is pro- 
vided on PNMI and PINT external additional pull 
down resistor may be needed depending upon the 
loading on these pins by external logic. The DC 
characteristics gives the control specification from 
which the value of resistor, if needed, can be calcu- 
lated. 


It should be kept in mind that the ICC bus being 
electrically open drain bus requires pull up resistors 
at the MBO pins. ICC bus output low current is just 
4 mA. 


PINT and ExtINTA Timings 


It should be noted that for ExtINTA type of interrupts 
PINT gets activated one clock after ExtiINTA gets 
activated. When getting deactivated, both PINT and 
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ExtINTA gets deactivated at the same clock edge. 


ExtINTA Timings 


In the interrupt acknowledge cycle for External Inter- 
rupt control, 82489DX asserts ExtINTA. It decodes 
the type of cycle from CPU control signals like M/IO, 
D/C and W/R. CPU does two bus cycles back to 
back for interrupt acknowledge cycle. 82489DX 
maintains ExtINTA active throughout the first cycle. 
For next cycle (when the vector will be given by ex- 
ternal 8259) after 82489DX senses the start of the 
cycle (by ADS) 82489DX deactivates ExtINTA. Ex- 
ternal control logic may be inserting wait states to 
match the 8259 timings. Since 82489DX has no way 
of finding out the cycle completion, 82489DX deas- 
serts ExtINTA before the second bus cycle gets 
completed. This should be kept in mind while using 
ExtINTA for external interrupt control logic. 


82489DX AND MEMORY MAPPING 


The 82489Dx is a 32-bit high performance interrupt 
controller. It allows the CPU to do 32-bit read and 
write to it. By memory mapping 82489DxX the system 
performance can be enhanced. It should be noted 
that 82489DX does not support pipelining. Even- 
though 82489DX can be memory mapped, its func- 
tionality as an interrupt controller should be kept in 
mind while programming the virtual memory man- 
agement control data structure. The caching policy 
for the page where 82489DX is mapped should also 
be done with the functionality of 82489DX in mind. 
For example, the reads to 82489DX should not be 
cached and writes should be write-through. Since 
82489DX registers are aligned at 128-bit bounda- 
ries, memory mapping 82489DX with interleaved 
memory system should not be a problem. 


JTAG CIRCUIT CONSIDERATIONS 


The JTAG circuit is used for boundary scan test. The 
JTAG pins has a TCK, (JTAG clock), TRST, (JTAG 
Reset), TDI, (Test Data Input), TMS, (Test Mode Se- 
lect) and TDO, (Test Data Output). 


The JTAG circuitry, if not used, should be properly 
deactivated so that it will not interfere in the normal 
functional operations. The JTAG can be inactivated 
in any one of the following ways: 


1. JTAG inactivation through TRST: The TMS, Test 
mode Select should be either left open (internal 
pull up is provided) or tied to Voc. The TRST can 
be pulsed low (bring it low and after meeting the 
pulse width requirement bring it back high again) 
to keep the JTAG circuitry to idle state. The 

-TRST pulse brings the JTAG circuitry to idle state 
and TMS being kept high maintains the JTAG cir- 
cuitry in idle state. 
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2. JTAG inactivation through TCK: The TMS, Test 
mode select should be either left open (internal 
pull up is provided) or tied to Voc. The TCK within 
5 clocks brings the JTAG circuitry to idle state. 


-The TMS, being held at logic high level, maintains - 


the JTAG circuitry in idle state. 


16.4 Programming Guidelines 


- The 82489DxX register data structure contains differ- 
ent fields to specify the mode of operations and the 
options available within each mode. Since certain 
options are applicable to specific modes only (for 
example “Remote Read” mode applies only to inter- 
rupt command register, it does not have relevance 
to I/O unit’s redirection tables) the following pro- 
gramming guidelines are provided. 


UNIQUE ID REQUIREMENT 


All the local units and I/O units hooked in a ICC bus 
should have unique ID before they can use the bus. 
This should be ensured by the programmer since for 


ICC bus arbitration the units (whether it is local unit 


or !/O unit) arbitrate with their unit ID. 


For future compatibility, the Units should be as- 
signed IDs starting with 0, 1, 2 etc. with the highest 
ID in the system being number of units minus 1. So 
in a four 82489DX system there are four local units 
and four |/O units. The ID starts with 0 and the high- 
est ID in the system will be 7. Note that each unit 
should have different ID in the system. 


‘ATOMIC WRITE READ TO TASK PRIORITY 
REGISTER 7 


Normally, the task priority register is written with 


highest priority to mask certain low level interrupts: 


before entering into critical section code. In a sys- 
tem where 82489DX is memory mapped the CPU 
may buffer this task priority register write to its on 
chip write buffer. The following scenerio can happen 
in such situation: CPU posts task priority register 
write to its on chip write buffer and enters into the 
critical code. A lower priority interrupt (which should 
not enter the critical code) interrupts the CPU before 
the write buffer gets flushed into task priority regis- 
ter). The CPU accepts the lower priority interrupt. To 
avoid the situation atomic write read to task priority 
register should be done. The read following write en- 
sures that the write buffer is flushed to task priority 
register and the atomicity ensures that no interrupt 
will be accepted by the CPU during its write to task 


priority. 
It should be noted that if the CPU does interrupt 


acknowledge cycle only after flushing the write buff- 
ers then the above situation may not arise. 
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CRITICAL REGIONS AND MUTUAL EXCLUSION 


Each 82488DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. The programmer should make sure to syn- 
chronize access to this register. Specifically, 1). writ- 
ing all fields of the register, 2). Sending the interrupt 
message (by writing the LSB register), and 3). wait- 
ing for Delivery State to become Idle again, should 
occur as a single atomic operation. For example, if 
interrupt handlers are allowed to send inter-proces- 
sor interrupts, then interrupt dispensing to the proc- 
essor must be disabled for the duration of these.ac- 
tivities. 


INTERRUPT COMMAND REGISTER 
PROGRAMMING SEQUENCE 


The interrupt command register (31:0) has a side 
effect of seriding interrupt once it is written. The des- 
tination is provided in the interrupt command register 
(63:32). So always interrupt command register 


(63:32) should be programmed before programming 


interrupt command register(31:0). 
Program Interrupt Command Register (63:32) 


L 


Program Interrupt Command Register (31:0) 


INTERRUPT VECTOR 


Two different interrupts should not be programmed 
with the same interrupt vector. 


LOCAL AND I/O UNIT 


Only Interrupt command register supports “Remote 
Read” Delivery mode. Local and |/O unit interrupts 
do not support ‘Remote Read”. 


ICR (INTERRUPT COMMAND REGISTER) | 
1. ExtINTA delivery mode is not supported for all 
destination shorthands. 


2. “Remote Read” should always be programmed | 
as “Edge” triggered interrupt. 


3. “Remote Read” should always be programmed 
with physical destination mode (and not with Log- 
ical Destination mode). Broadcast addressing 
should not be used for Remote Read. 


4. For “all incl Self’? and “‘All exc. self’ destination - 


shorthands, “remote read” delivery mode should 
not be used. 


5. For “all incl self’ and “self’’ destination short- 
hands “Reset” delivery mode should not be- 
used. | 
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ICR (INTERRUPT COMMAND REGISTER) 

(Continued) 

6. For “all exc self’? destination shorthand if “Re- 
set” delivery mode is used, it should be ensured 
at system level that only one processor executes 
this instruction at any time. 


7. Messages could be sent out in “Logical” or 
‘Physical’ mode with destination ID of all 1’s de- 
pending on the way Destination mode entry is 
programmed. In brief, “All incl self’? and ‘‘All exc. 
Self” support both “Logical” and ‘Physical’ ad- 
dressing mode. 


8. When destination shorthand (i.e., broadcast) is 
used with lowest priority destination mode, then 
even though all participates in arbitrating for des- 
tination, only the lowest priority gets the mes- 
sage. So even though the addressing is broad- 
cast since the destination mode is lowest priority 
only one gets the message. 


9. When destination shorthand (i.e., broadcast) is 
used with “Fixed” destination mode, then all the 
units get the message. 


ISR/IRR/TMR 


Bits O-15 of IRR/ISR/TMR do not track interrupt. 
No interrupt of vector numbers from 0-15 can be 
posted. The total interrupt supported are 240. When 
reading the lowest 32 bits of these registers, 0 will 
be returned for the lower 16 bits. 


FOCUS PROCESSOR 


Focus processor is applicable only within the ad- 
dressed units. 


ExtINT Interrupt Posting 


The external interrupt has no priority relationship 
with the 82489DxX priority. But when posting an inter- 
rupt to the processor, if both an external interrupt 
and a 82489DX interrupt are pending, 82489DX 
could post either one to the processor. In 82489DX 
implementation, it would post external interrupt 
whenever there is no other 82489DxX interrupt that 
can be posted to the processor. It should be also 
noted that External interrupts can not be masked by 
raising task priority. However, they can be masked 
by the mask bit in the table entry for the (“ExtINTA’’) 
external interrupt. 


The extINT interrupts are specific in their character- 
istics in that they do not have any priority relation- 
ship with the rest of the interrupt structure. ISR and 
IRR bits in 82489DX are used to do the housekeep- 
ing functions for interrupt priority. Since extINT inter- 
. rupts do not have any priority relationship, ISR and 
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IRR bits are not maintained for external interrupts. 
As far as interrupt acceptance is concerned, if more 
than one extINT interrupts are directed towards a 
local unit, that local unit treats all the extiNT inter- 
rupts directed to it as only one extINT interrupt. This 
leads to an important point that in a system not more 
than one interrupt should be programmed as extINT 
interrupt type with the same destination. It should be 
noted that there can be more than one extINT type 
of interrupt in a system with each having different 
local unit as destination. 


Synchronizing Arb IDs 


Initialization of an 82489DX’s local unit ID is imple- 
mentation dependent. In some platforms, power-on 
reset will latch the right values into the 82489DxXs; in 
other platforms, unique IDs may be assigned by ini- 
tialization firmware. In both cases the 82489DX I/O 
unit should be assigned unique !D by initialization 
firmware. The important point is that the 82489DXs 
are required to have unique IDs before they can use 
the bus, and in addition, all their Arb IDs must be “‘in 
sync’. Synchronizing Arb IDs is accomplished as a 
side effect of a ‘“deassert reset” interrupt command. 
This resets the (rotating) Arb ID to the (constant) unit 
ID; it assumes that all 82489DXs have their unique 
ID. 


LOWEST PRIORITY 


“Only once delivery” semantics for a group destina- 
tion is guaranteed only if multiple fixed delivery of 
the same interrupt vector are not mixed. For lowest 
priority arbitration to work, all the arbitration ID of 
local 82489DXs in the system should be in sync. 
This means after local unit IDs are written in all local 
units (each ID should be different from other IDs) a 
RESET DEASSERT message should be sent in ALL 
INCLUSIVE mode. The RESET DEASSERT mes- 
sage should be sent before system is used for low- 
est priority arbitration. This ensures that all ARB IDs 
are also different. (Arb IDs are copied from local unit 
IDs during RESET DEASSERT message.) 


The RESET DEASSERT message, if not sent, only 
one delivery semantics may not be guaranteed in 
the cases where lowest arbitration is used in the sys- 
tem. 


DISABLING LOCAL UNIT 


Once the 82489DxX is enabled by setting bit 8 of 
spurious vector register to 1, the user should not 
disable the local unit by resetting the bit to 0. The 
result will put the local unit in an inconsistant state. 
The local unit can be disabled by sending “reset” 
interrupt message to the local unit. 
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ISSUING EOI 


EOI, End of Interrupt issuing indicates end of service 
routine to 82489DX. The ISR bit which is set during 
INTA cycle gets cleared by EOI. This section dis- 
cusses the relevence of EOI to the specific types of 
interrupts and its timing related to interrupt deasser- 
tion. 


EXTERNAL INTERRUPTS AND EOI 


External Interrupts should be programmed as edge 
type. INTA cycles to external interrupts are taken 
automatically as EOI by 82489DxX. This is similar to 
AEOI, Automatic End of Interrupt of 8259A. So there 
is no need to issue EO! to 82489DX for external 
interrupt servicing. This is done to achieve software 
transparency in the compatible mode. 


SPURIOUS INTERRUPTS AND EOI 


Spurious interrupts do not have any priority relation- 
- ship to other interrupts in the system. So IRR is not 
set for spurious interrupts. EO! should not be issued 
for spurious interrupts. It is advisable not to share 
the spurious interrupt with any vector. If spurious in- 
terrupt vector is shared with some other interrupt 
then. while servicing issuing EO! depends on the 
source of interrupt. If the source is spurious interrupt 
(for which the corresponding IRR is not set) then 
EOI should not be used. If the source is a valid inter- 
rupt sharing the spurious interrupt vector (for which 
the IRR is set) ther. EO! should be issued. 


NM AND EOI 


For NM type of interrupt no IRR bit is set. So EOl 
should not be issued while servicing NMI type of in- 
terrupts. 


TASK PRIORITY REGISTER 


Task priority register is used to specify the priority of 
the task the processor is executing. In 8259 the 
priority is defined only among the interrupts that it 
handles. 82489DX goes farther ahead in handling 
priority. In multitasking system, in addition to device 
interrupts various tasks have different priority and 
82489DxX allows consideration of the priority at sys- 
tem level. The processor specifies the priority of the 
task it executes by writing to task priority register. 
Now any interrupts at and below the task priority will 
be masked temporarily till the task priority gets low- 
ered. The masking granularity is at priority level. Out 
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of 256 interrupt vectors 16 priority levels are speci- 
fied and 16 vectors share one priority level. Since 
the masking granularity by the task priority register is 
at priority level, group of 16 vectors get masked 
when task priority register is increased by one level. 


When task priority is at its minimum level of 0, inter- 
rupt vectors having level 1 to 16 are passed to CPU. 
Stated in other words, even when the task priority 
register is at its minimum (of level 0), interrupt vec- 
tors at level 0 will be masked. This means that the | 
interrupt should not be programmed with vectors 0 
to 15. So out of 256 interrupt vectors, only 240 inter- 
rupt vectors (vector 16 to 255) can be used in 
82489DxX. 


ExtINT INTERRUPT AND TASK PRIORITY 


ExtINT interrupt does not have any priority relation- 
ship with other interrupts or task priority register. So 
ExtINT interrupt can not be masked by raising task 
priority. They can be masked by writing to the vector 
table entry which corresponds to ExtIiNT interrupt. 


REMOVING MASKS 


When enabling units and removing Mask bits in situ- 
ations where a device may already be injecting inter- 
rupts into the 82489DX system, the Mask in the Re- 
direction Table should be removed last to ensure 
proper initial state (e.g., Remote IRR bit matching 
IRR in local unit). 


DELIVERY MODE AND TRIGGER MODE 


it is software’s responsibility to make sure that Deliv- 
ery Mode and Trigger Mode are set to meaningful 
combinations as listed below. 


Trigger Mode 


Delivery Mode 

Fixed edge/level 
Lowest Priority edge/level 
Remote Read edge 

NMI level 
Reset level 
ExtINT : edge 


Software is also responsible for not using meaning- 


_less Delivery Modes in Redirection Table entries and 
_ local Vector Table entries (e.g., use of Remote Read 


delivery mode). 
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ASSIGNING INTERRUPT VECTORS 


Software has total control over the assignment of 
interrupt vectors to interrupt sources. The operating 
system writer should be aware of a number of things 
when doing this assignment. 


Some processor architectures assign a predefined 
meaning to some of the vectors (i.e., entries in the 
interrupt table) as entry points to certain trap and 
exception handlers (e.g., divide error, invalid op- 
code, page fault, etc.). The programmer is strongly 
advised not to reuse these vectors. 


The programmer must also be careful when using 


the same vector number to represent different inter- 
rupt sources (sharing vectors). This is especially true 
for level triggered interrupts. When multiple sources 
with different Redirection table entries share an in- 
terrupt vector, any of the sources deactivating its 
level signal will remove the interrupt request for all 
sources. Giving each interrupt source its interrupt 
vector in any case is the preferred approach. 


SENDING INTER-PROCESSOR INTERRUPTS 


Each 82489DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. It is software’s responsibility to synchronize 
access to this register. Specifically, 1) writing ail 
fields of the register, 2) sending the interrupt mes- 
sage (by writing the low register), and 3) waiting for 
Delivery State to become Idle again, should occur as 
a single atomic operation. For example, if interrupt 
handlers are allowed to send inter-processor inter- 
rupts, then interrupt dispensing to the processor 
must be disabled for the duration of these activities. 


DELAY WITH LEVEL TRIGGERED INTERRUPTS 


When a level triggered interrupt source deasserts its 
interrupt input, the destination will clear the inter- 
rupt’s IRR bit only after receiving the message from 
the ICC bus. This introduces a small delay between 
the removal of the interrupt at the source and the 
removal of the interrupt at the processor. To avoid 
generating unnecessary interrupts, the interrupt han- 
dler should remove the interrupt at the source (at the 
device) as early as possible in the handler. 


In any case, handlers should be able to deal with 
unnecessary interrupts. 
RESET DEASSERT 


A side effect of a reset deassertion message broad- 
cast in the ICC bus is that all 82489DX local units 
reset their Arb ID to their unit ID. Interrupt com- 
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mands that use the “‘self’’ Destination Shorthand do 
not generate a message on the ICC bus. If software 
only wanis to generate the side effect of resetting 
Arb IDs, it should use a command with Logical Desti- 
nation Mode and a Destination field containing all 
zeroes. : 


INTERRUPT MASKING 


There are a number of levels at which interrupts can 
be masked, each resulting in a different behavior on 
interrupt delivery. 


e First, interrupt injection (or deliver) can be 
masked by setting the Mask bit in the interrupt’s 
Redirection Table or local Vector Table entry. 
These interrupts are ignored, no message is sent 
for them. Granularity is an individual interrupt. 


e Second, each 82489DX can individually mask in- 
terrupt dispensing by raising its Task Priority to 
some level. This 82489DxX will not dispense inter- 
rupts to its processor of this and lower priority 
unless it is currently the focus of the interrupt. 
Note again that the 82489DxX is designed to oper- 
ate as fully nested with non-specific EOI! (to use 
existing 8259A terminology). There is no explicit 
interrupt mask (such as MR) and there is no no- 
tion of specific EOI. 


e Third, each processor may provide a mechanism 
that masks all interrupt dispensing to it using the 
processor supplied instructions or status bits to 
do so. This does not interfere with lowest priority 
arbitration of the processor’s 82489DxX local unit. 


CHANGING REDIRECTION TABLES 


Redirection Tables are typically set up at initializa- 
tion time. When modifying a Redirection Table entry 
‘‘on the fly” the programmer must be aware of state 
kept at other 82489DxXs relative to the interrupt be- 
ing modified. 


DEVICE DRIVERS WITH 82489DX 


It is strongly recommended to read the device status 
registers before servicing the device. This is be- 
cause if an edge triggered device deasserts its inter- 
rupt before interrupt acknowledge cycle (it should 
NOT) 82489DX will NOT give spurious vector. It will 
give genuine interrupt vector corresponding to the 
device. So, interrupt service routine should validate 
the interrupt request before servicing the device. 
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SYSTEM HARDWARE AND 
SOFTWARE DESIGN 
CONSIDERATIONS 


Design Consideration 1 


Description: The following design consideration has 
to be taken care of when using ISP (82357) as exter- 
nal interrupt controller. 82489DX allows connecting 
external 8259 type interrupt controller at one of its 
inputs. The mode associated with the interrupt input 
which has 8259 connected to it is called ExtINTA 
mode. 82489DX allows only EDGE TRIGGERED 
programming option for ExtINTA mode. But in the 
case of 82357, the INT output from ISP stays high in 


case more than one interrupt is pending at its inputs. © 


It does not always inactivate its INT output after 
INTA cycle. This will lead to a situation where ISP 
keeps the interrupt at high level continously and 
waits for INTA cycle. But since 82489DX expects an 
edge for interrupt sensing (for ExtINTA interrupts) it 
does not pass the interrupt to CPU and further inter- 
rupts are lost. So External circuitry should monitor 
the end of SECOND CYCLE of INTA cycle and force 
an inactive state at 82489DX’s input. To avoid glitch- 
es at 82489DxX input, this external logic should clear 
its output only at the end of second INTA cycle. It 
should be set by high going 82357 output. It should 
never be cleared by low going 82357 output. That is 
it should not follow 82357 output. 


Design Consideration 2 


Description: The following design consideration has 
to be taken care of when using 82489DX in EISA 
systems. EISA ISP(82357) chip integrates 8259A. It 
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additionally allows sharing of interrupts. To facilitate | 
this sharing it has a programmable register, ELCR 
(Edge / Level trigger control register) by which cer- 
tain interrupt inputs can be programmed as edge 
(low to high except for RTC) or level (the level is 
active low). The determination of edge or level is 
done during initial configuration of EISA system by. 
reading EISA add in boards from the interrupt de- 
scription data structures. The solution is to have pro- 
grammable logic at the interrupt inputs so that 
82489DxX is compatible with EISA ISP. This will intro- 
duce one more register and logic to support this. 
This should be an 11-bit programmable register and 
an array of ExOR logic (12 ExOR gates or equivalent 
PLD). The ISP allows programmability of the follow- 
ing interrupts. 


INT3 INT4 INT5 INT6 INT7 INT9S INT10 INT11 
INT12 INT14 INT15. In addition to the above 11 in- 
terrupts, it fixes INT8 to be active low edge triggered 
interrupt. INT8 is the only case where it is active low 
edge triggered type. So the following logic can be 
used to add programmability in 82489DX based 
EISA system. Before connecting these 11 interrupt 
lines directly (#INT8 which is from Real Time Clock 
is always active low edge triggered. #INT8 can be 
passed through an inverter since there is no need 
for programmability) to the 82489DX they should 


, pass through an array of 11 Ex__OR gates. One in- 


put of Ex__OR gate connects to the corresponding 
INT pin and other input connects to a bit of program- 
mable register. The output of Ex_OR gate is con-. 
nected to 82489DX. The idea of Ex_OR is to use as © 
a controlled inverter. 


Intin3 «INT INTIN4 

intiIns ~ NTS INTING 

intiIn7 _INTS INTING 

intinto (NTA 
{ 

INTING2 [NT14 INTIN1 4 


INTIN15 
#|NT8 INTS 


INTIN 1 1 
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INTIN are the interrupt inputs to the 82489DX and 
INT are the system interrupt. The Ex__OR gating 
register is programmed after EISA configuration is 
found from add in boards as how these interrupt 
lines are going to be used in that particular configu- 
ration. If a particular input is edge triggered, then the 
corresponding bit in the register is written with 0. If a 
particular input is level triggered, then the corre- 
sponding bit in the register is written with 1. 


8259 by itself does not have polarity control where- 
as 8259 when implemented in EISA chipsets have 
the polarity control. Similarly APIC does not have by 
itself polarity register. So polarity register should be 
programmed as a part of system BIOS and not APIC 
BIOS. 


Design Consideration 3 


loc bus drive is an open drain bus with drive capacity 
of 4 mA only. Since data is transmitted at each Icc 
clock, the “charging” of Icc bus should be fast 
enough to ensure proper logic level at each clock 
edge. The Icc bus needs pull up resistors since it is 
open drain bus. Since the drive is only 4 mA, the pull 
up resistor value can not be less than 5V/4 mA. This 
being the limit of the resistor value, the length and 
the characteristics of the Icc trace forces a capaci- 
tance value. Both the resistor and capacitance 
brings a RC time constant to the Icc bus waveform. 
So, Electrical consideration has to be given to and 
practice of controlled impedence should be exer- 
cised for layout of the Icc bus. The length of the 
trace should be kept as minimum as possible. If the 
length of the Icc bus can’t be kept less, than say 6 
inch, because of mechanical design of the system, 
the external line drivers should be added to Icc bus 


and Icc bus should be simulated with the added driv- — 


er characteristics. 


74(ABT/F)125 


Buffered bus 
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Design Consideration 4 


This is related to ADS#, BGT# and CS# timings. 
For bus cycles not intended for 82489DX, (CS# = 1 
where 82489DX is supposed to sample it), any 
change in CS# line while the ADS# is still active, 
may erroneously cause a RDY# returned from 
82489DX. Anomolous behavior may result if for 
BGT # ties low cases 


a) BGT# goes away just one clock after ADS# or 


b) ADS# is still active, and CS# changes during 
this period. 


For other cases anomolous behavior results if CS # 
changes when ADS# is still active. The following 
considerations are important from timing point of 
view. Always limit the pulse width of 82489DX 
ADS # to one CLKIN. Also avoid changing levels on 
BGT #/CS# line, when ADS# is active for cases 
being identified as BGT # tied low (BGT # sampled 
low when ADS # goes active). Also avoid changing 
levels on CS# line when BGT # is active. 


Design Consideration 5 


82489DX does not recognize the interrupt when an 
edge occurs at the interrupt input pin while interrupt 
is masked. When later it is unmasked there is no 
further edge and so 82489DxX never passes that for- 
gotten edge and that interrupt channel becomes un- 
usable after that. 


The recommendation is that first 82489DX should 
be unmasked and then the device interrupt should 
be enabled in the device register. By this, software 
can ensure that always an edge will occur after an 
interrupt is unmasked. 


Design Consideration 6 


Description: Edge triggered interrupts should not 
deassert their output till they are acknowledged by 
INTA cycle from CPU. 


Issue: 


82489DX employs glitch detection logic for edge 
triggered logic. To make sure the detected edge in- 
terrupt is not a glitch, 82489DX samples the input 
again before sending the interrupt message. The 
time difference between the first sampling of inter- 
rupt to be active and second sampling (just before 
sending the interrupt) is not a constant number. This 
is because the ICC bus might have been occupied 
by other messages. So, for example if during first 
sampling it was detected that INTINO and INTIN15 
are both active and after sending INTINO it samples 
INTIN15 again before sending message for 
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INTIN15. But between this time ICC bus might have 
been occupied by other messages. So even if an 
edge triggered interrupt is held active high for a real- 
ly long time and then brought low before INTA cycle, 
it is considered as a glitch. Because it may happen 
that the second sampling occurred just when the in- 
terrupt line got low. 


Once the glitch detection circuitry found this 
“glitch”, it goes back to the state where it will start 
sampling and waiting for an active edge to occur. 
This takes more than one clock cycle (CLK) and if 
the ‘‘glitch interrupt’ generates an edge before that 
time after the secorid sampling of low level is done, 
then the edge is lost forever. 


8259INT 


BCLK is the standard EISA BCLK. 
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_ Since the time when the second sampling is done is 


unknown, the best way is to make sure the edge 


triggered interrupts do not deassert their outputs till 


they are acknowledged by INTA cycle from CPU. It 
is found that in some cases 8259 can generate brief 
active low pulses on its output. So the glue logic 
between 8259 and 82489DxX input pin should make 
sure that 82489DxX input pin is clear only after get- 
ting second interrupt acknowledge cycle. The glue 
logic should not just follow the 8259 output. Put in 
other words, after interrupt acknowledge cycle to 
8259, if the 8259 input is seen active high, it should 
generate an edge at 82489DxX input. Moreover, even 


if 8259 output goes low the glue logic should not 


lower its output since the only time when the glue 
logic can deassert its output is when it finds an inter- 
rupt acknowledge cycle for 8259. The following PLD 
equations and schematics serves as an example for 
the glue logic between 8259 and 82489DX. 


INTA cet 
8259Cyc 

set 8259Cyc 
setTimer 

TO 

T1 

T2. 

T3 

T4 

T5 


APIC Input 
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APIC input = /TO */Reset* /APIC input" 8259INT * INTA gg; ; Sample 8259 interrupt 
+/TO* /Reset * APICinput * INTA cot ; Hold till it is cleared by delayed interrupt acknowledge 
+/INTAgoy * 8259INT 


Set 8259Cyc = /Reset* INTA* ExtINTA _ ; This INTA cycle is for 8259 


8259Cyc = set8259Cyc* /TO* /Reset ; Set 8259cyc will set 8259 cycle and TO will clear it 


+ /TO* /Set8259Cyc * 8259Cyc * /Reset ; Hold 8259 cycle till TO clears it 


INTAse¢ = /Reset * /INTA,,, * INTA ; wait for very first INTA cycle after reset 


+ /Reset * INTA 


set > once first INTA cycle after Reset is found, set the INTA,.. 


Set timer =8259cyc*/A2*/RDYlow _ ; Start the timer at end of second INTA cycle 


TO = Settimer*/T5 * /Reset ; Set timer will set TO and T5 will clear 


+ /Set timer” TO * /T5 * /Reset ; Till T5 clears it hold TO | 
T1 := TO* /Reset* /T5 ; Follow TO after one clock for setting but clear along with TO 


T2 :=1T1* /Reset*/TS_ ; Follow TO after two clock for setting but clear along with TO 
T3 := T2* /Reset*/TS ; Follow TO after three clock for setting but clear along with TO 
T4 := T3*/Reset”/TS_ ; Follow TO after four clock isi setting but clear along with TO 
TS := T4* /Reset ; Follow TO after 5 clock for setting 


290446-46 
NOTES: 
T1, T2, T3, T4 and T5 are clocked signals and others are combinatorials. 
This circuit and PLD equations are given for concept clarification purpose. They are not tested. 
INTAget is needed so that some 8259 logic at power on activates its INT output to 1 and it deactivates its output after 
only 8259 initialization (which should happen after APIC initialization) and since APIC needs to detect rising edge at 
8259, it is essential to follow the 8259 until first interrupt. This is the only occasion 8259 output will be just followed. 


DIRECTIONS FOR EASY MIGRATION Note that the names APIC BIOS and APIC DRIVER 


TO FUTURE INTEGRATED APIC are interchangeably used in the following discussion. 
Different Operating systems refer such functional 
The following are the software programming direc- | Module differently. 


tions Intel strongly recommends for easy migration 
from 82489DxX to integrated APIC. The audience to 
this portion of the document are both hardware de- 


signers and firmware developers for APIC based = Question: The logical destination register in future 
systems. In the following discussions, the APIC — implemented APIC may have only 8 MSBs defined 


BIOS is viewed functionally as two subsections 1) —_ and 82489DX has 32 bits specified. Will this hinder 
APIC BIOS which are all interrupt vector, priority, in- binary level compatibility? 


terrupt distribution related functions and the remain- 
ing portion of BIOS which is referred to part of sys- 
tem BIOS which is responsible for interrupt polarity 
programming, starting next processor, etc. _ 


Consideration 1 
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82489DX 


Response: In logical destination (flat addressing 
-mode) 82489DX can go up to 32 CPUs whereas fu- 
ture APIC can go up to 8 CPUs with flat logical ad- 
dressing mode. For binary compatibility, it is 
strongly recommended that 82489DX software 
use ONLY 8 MSB of logical destination register. 


Consideration 2 | 


Question: The present day MP systems with exter- 
nal control ports for starting next processor may pro- 
gram those external control ports for starting next 
processor. APIC DRIVER may use external control 
ports for starting next processor. In future implemen- 
tations of APIC, the starting of next processors may 
use more refined mechanisms which may not use 
external control ports. Will this introduce compatibil- 
ity problem? 


Response: Again, the starting of next processor is 
really part of MP system DRIVER and depending of 
the mechanism used to start next processor it will 
vary. In future implementation if starting next proces- 
sor is done using new mechanisms, the starting next 
processor portion of MP DRIVER will be changed 
accordingly. Even though this will not result in any 
change in the APIC DRIVER which deals with inter- 
rupt priority, distribution, etc., the corresponding 
change will be needed in the starting ee 
processors peer of DRIVER. 


One possible method of implementing software is 
using version register. Version register is different in 
489DX and future implementations of APIC. Taking 
care of these differences, such as mechanism for 
starting next processor, should be possible using 
version register. 


_ Consideration 3 


Question: APIC architecture, by its nature, seems to 
misinterpret spurious interrupts as genuine inter- 
rupts. That is, if an edge triggered interrupt goes in- 
active before interrupt acknowledge cycle, APIC, in- 


stead of giving spurious interrupt vector, gives genu-. 


ine interrupt vector. Is it true that this is not the case 
with 8259? If that is the case, drivers which do not 
check device status registers for servicing the de- 
vice may work with 8259 but may not work with 
APIC. Is this a compatibility problem? 


Response: No, this is not true. Even with 8259 there 
is a time window in which a similar thing can happen. 
For example if interrupt goes inactive just after first 
INTA cycle but before second INTA cycle 8259 will 
also signal this spurious interrupt as genuine inter- 
rupt. So drivers which do not check device status 
registers may also fail with 8259. 
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Our strong recommendation to device drivers is to 
read device status register before servicing the de- 
vice. If the device status register indicates that there | 
is no valid source of the interrupt, the service routine 
should just issue EOI and return. It should not serv- 
ice the device. This should take care of the new driv- 
ers that will be written for APIC. To coexist with 
8259, the APIC interrupt input connected to 8259 will 
be programmed for virtual wire mode. In virtual wire 
mode, the time window of 8259 will apply. So the 
driver will behave same way as it was behaving with 


— 8259. 


Consideration 4 


Question: EISA system has active low level polarity. 
82489DxX itself does not have polarity control regis- 
ter to support this EISA feature. Implementations us- . 
ing external polarity register may implement the po- 
larity register at different address. Will this introduce 
a problem for achieving the goal of single binary? 


Response: 8259 by itself does not have polarity 
control whereas 8259 when implemented in EISA 
chipset has the polarity control. Similarly APIC does 
not have by itself polarity register. When implement- 
ed in ESC chipset, it will have polarity control regis- 
ter. So polarity register should be programmed as a 
part of EISA BIOS and not APIC BIOS. Since system 
BIOS or EISA BIOS should be able to take charge of 
changes, if any, to polarity control register. APIC 
BIOS should not be affected by differences in the 
address for polarity register. 


Consideration 5 


Question: 8259 recognizes the interrupt when an 
edge occurs at the interrupt input pin even. if the 
interrupt was masked. So when the interrupt input is 
later unmasked, the interrupt is posted to the CPU. 
82489DX does not register this edge and if interrupt 
happens when the interrupt is masked 82489DxX just 
ignores the interrupt. When later it is unmasked 
there is no further edge and so 82489DX never 
passes that forgotten edge and that interrupt chan- 
nel becomes unusable after that. 


Response: When the interrupt is masked, logically 
interrupt controller should ignore whatever happens 
there. It is strongly recommended that first 82489DX 
should be unmasked and then the device interrupt 
should be enabled. By this sequence, software can 
ensure that always an edge will occur at the APIC 
input only after the interrupt is unmasked. . 


Please contact Intel for platform level specification 
in Multiprocessor system design using APIC. 
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UPI-41AH/42AH 
UNIVERSAL PERIPHERAL INTERFACE 


8-BIT SLAVE MICROCONTROLLER 


m UPI-41: 6 MHz; UPI-42: 12.5 MHz m@ Fully Compatible with all Intel and Most 
m Pin, Software and Architecturally Other Microprocessor Families 
Compatible with all UPI-41 and UPI-42 m Interchangeable ROM and OTP EPROM 
Products Versions 
m@ 8-Bit CPU plus ROM/OTP EPROM, RAM, m Expandable |/O 
1/0, Timer/Counter and Clock in a m Sync Mode Available 
Single Package Adcians aseisindieie 
m 2048 x 8 ROM/OTP, 256 x 8 RAM on Be ee er ee ee 
UPI-42, 1024 x 8 ROM/OTP, 128 x 8 m Available in EXPRESS 
RAM on UPI-41, 8-Bit Timer/Counter, 18 — Standard Temperature Range 
Programmable I/O Pins mw inteligent Programming Algorithm 
m One 8-Bit Status and Two Data _-— Fast OTP Programming | 
Registers for Asynchronous Slave-to- m Available in 40-Lead Plastic and 44- 


Master Interface 


m= DMA, Interrupt, or Polled Operation 
Supported 


Lead Plastic Leaded Chip Carrier 
Packages 
(See Packaging Spec., Order #240800-001) 


Package Type P and N 


The Intel UPI-41AH and UPI-42AH are general-purpose Universal Peripheral Interfaces that allow the designer 
to develop customized solutions for peripheral device control. 


They are essentially ‘‘slave’”’ microcontrollers, or microcontrollers with a slave interface included on the chip. 
Interface registers are included to enable the UPI device to function as a slave peripheral controller in the MCS 
Modules and iAPX family, as well as other 8-, 16-, and 32-bit systems. 


To allow full user flexibility, the program memory is available in ROM and One-Time Programmable EPROM 
(OTP). All UPI-41AH and UPI-42AH devices are fully pin compatible for easy transition from prototype to 
production level designs. 
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Figure 1. DIP Pin Configuration 
210393-3 


Figure 2. PLCC Pin Configuration 


The complete document for this product is available on Intel’s “Data-on-Demand”’ CD-ROM product. Contact 
your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 
November 1994 
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— UPI-C42/UPI-L42 
_ UNIVERSAL PERIPHERAL INTERFACE 
CHMOS 8-BIT SLAVE MICROCONTROLLER 


| Pin, Software and Architecturally m One 8-Bit Status and Two Data 
Compatible with all UPI-41 and UPI-42 Registers for Asynchronous Slave-to- 
Products Master Interface 

m Low Voltage Operation with the UPI- m@ Fully Compatible with all Intel and Most 
L42 Other Microprocessor Families 


— Full 3.3V Support 


m Interchangeable ROM and OTP EPROM 

m Integrated Auto A20 Gate Support. Versions 

m Suspend Power Down Mode | m Expandable 1/O 

m Security Bit Code Protection Support m Sync Mode Available 

m 8-Bit CPU plus ROM/OTP EPROM, RAM, m Over 90 Instructions: 70% Single Byte 
1/0, Timer/Counter and Clock in a m Quick Pulse Programming Algorithm 
Single Package — Fast OTP Programming 

m 4096 x 8 ROM/OTP, 256 x 8 RAM 8-Bit m Available in 40-Lead Plastic, 44-Lead 
Timer/Counter, 18 Programmable 1/O Plastic Leaded Chip Carrier, and © 
Pins 44-Lead Quad Flat Pack Packages 

w DMA, Interrupt, or Polied Operation (See Packaging Spec., Order #240800, Pane Type P, N, 
Supported ane) 


The UPI-C42 is an enhanced CHMOS version of the industry standard Intel UPI-42 family. It is fabricated on 
Intel’s CHMOS III-E: process. The UPI-C42 is pin, software, and architecturally compatible with the NMOS UPI 
family. The UPI-C42 has all of the same features of the NMOS family plus a larger user programmable memory 
array (4K), integrated auto A20 gate support, and lower power consumption inherent to a CHMOS product. 


The UPI-L42 offers the same functionality and socket compatibility as the UPI-C42 as well as providing low 
voltage 3.3V operation. . 


The UPI-C42 is essentially a ‘‘slave”’ microcontroller, or a microcontroller with a slave interface included on the 
chip. Interface registers are included to enable the UPI device to function as a slave peripheral controller in the 
MCS Modules and iAPX family, as well as other 8-, 16-, and 32-bit systems. | 


To allow full user flexibility, the program memory is available in ROM and One-Time Programmable EPROM 
(OTP). | 
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Figure 1. DIP Pin 290414-2 (290414-3 
Configuration Figure 2. PLCC Pin Configuration Figure 3. QFP Pin Configuration 


_ The sora document for this product is available on Intel’s “Data-on-Demand” CD-ROM product. Contact 
_ your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 


October 1994 
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8XC51SL/LOW VOLTAGE 8XC51SL 
KEYBOARD CONTROLLER 


80C51SL — CPU with RAM and 1/0; Vcc = 5V +10% 


81C51SL — 16K ROM Preprogrammed with SystemSoft Keyboard Controller and Scanner 
Firmware. Vcc = 5V +10%. 


83C51SL — 16K Factory Programmed ROM. Vcc = 5V +10%. 
87C51SL — 16K OTP ROM. Vcc = 5V + 10%. 
Low Voltage 80C51SL—CPU with RAM and I/0; Vcc = 3.3V +0.3V 


Low Voltage 81C51SL— 16K ROM Preprogrammed with SystemSoft Keyboard Controller 
and Scanner Firmware. Vcc = 3.3V +0.3V. 


Low Voltage 83C51SL— 16K Factory Programmed ROM. Vcc = 3.3V +0.3V. 
Low Voltage 87C51SL— 16K OTP ROM. Vcc = 3.3V +0.3V. 


m Proliferation of 8051 Architecture m 4-Channel, 8-Bit A/D | 
m Complete 8042 Keyboard Control m Interface for up to 32 Kbytes of 
Functionality External Memory 
m 8042 Style Host interface m Slew Rate Controlled !/O Buffers Used 
m Optional Hardware Speedup of to Minimize Noise 
GATEA20 and RCL m 256 Bytes Data RAM 
m Local 16 x 8 Keyboard Switch Matrix m Three Multifunction |/O Ports 
Support - @ 10 Interrupt Sources with 6 User- 4 7 
m Two Industry Standard Serial Keyboard Definable External Interrupts 
Interfaces; Supported via Four High 2 MHz-16 MHz Clock Frequency 
Drive Outputs 
5 LED Dri m@ 100-Pin PQFP (8XC51SL) 
7 Evers 100-Pin SQFP (Low Voltage 8XC51SL) 


m Low Power CHMOS Technology 


The 8XC51SL, based on Intel’s industry-standard MCS® 51 microcontroller family, is designed for keyboard 
control in laptop and notebook PCs. The highly integrated keyboard controller incorporates an 8042-style UPI 
host interface with expanded memory, keyboard scan, and power management. The 8XC51SL supports both 
serial and scanned keyboard interfaces and is available in pre-programmed versions to reduce time to market. 
The Low Voltage 8XC51SL is the 3.3V version optimized for even further power savings. Throughout the 
remainder of this document, both devices will generally be referred to as 51SL. 


The 8XC51SL is a pin-for-pin compatible replacement for the 8XC51SL-BG. It does, however have some 
additional functionality. Those additional functions are as follows: 


1. 16K OTP ROM: The 8XC51SL-BG had only 8K of ROM. 


2. New Register Set: The 8XC51SL adds a second set of host interface registers available for use in support- 
ing power management. This required an additional address line (A1) for decoding. To accommodate this, 
one Vcc pin was removed. However, in order to maintain compatibility with the -BG version, an enable bit 
for this new register set was added in configuration register 1. This allows the 8XC51SL to be drop in 
compatible to existing 8XC51SL-BG designs; no software modifications required. 


NOTE: 
The changes made to the Vcc pins require that all three Vcc pins be properly connected. Failing to do so 
could result in high leakage current and possible damage to the device. 


The complete document for this product is available on Intel’s “Data-on-Demand” CD-ROM product. Contact 
your local Intel field sales office, Intel technical distributor, or call 1-800-548-4725. 

November 1994 
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INTRODUCTION 


The 89C124FX Data/Fax Modem Chip Set Applica- 
tion Note provides the end user with applications and 
layout guidelines to reduce power consumption to a 
minimum when in the Power Down Mode. 


GENERAL DESCRIPTION 


When the 89C124FX is in the power down mode, the 
microcontroller (89C126FX) consumes very little pow- 
er (less than 0.5 mW). However, the external memory, 
voltage regulators and peripherals draw excess current 
that makes the overall system power consumption more 
than 80 mW. This application note describes a method 
to reduce overall system power consumption to less 
than 1 mW when the 89C124FX is in the power down 
mode. Adding a power down feature in the microcon- 
troller and reducing power sink to a minimum accom- 
plishes this goal. 


Three steps are required to reduce the overall system 
power consumption in the power down mode: 


1. Detect power down in the microcontroller and 
isolate the power source from potential current 
sink from other components. 


2. Inhibit the DC path from the power supply, 
through other components, to ground when the 
microcontroller is powered down. 


3. Solve current drift problems due to floating inputs 
when power is removed from peripherals. 


POWER DOWN DETECTION 


Monitoring the clock output (CLKOUT, pin 65) from 
the microcontroller detects the power down condition. 
The CLKOUT pin is held high when the controller is 
in the power down mode. When power down is detect- 
ed, the detector shuts off the +5V supply to all compo- 
nents except the microcontroller, RAM, and logic 
gates. The detector also shuts off power to the voltage 
regulators. 
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BLOCKING DC PATH 


When the power down detector shuts off the bipolar 
switches, some of the device input pins become current 
sinks and drain current from the controller output pins 
when the output pins are high. These controller output 
pins are the CLKOUT and SCLK outputs. To elimi- 


_ nate these DC paths in the power down mode, add an 


inverter from the controller clock output (CLKOUT) 
to the AFE clock input (CLKIN) and use an AND 
gate to change the SCLK output to the AFE to a low. 


CURRENT DRIFT 


_ When the controller is in the power down mode, the 


SDATA pin becomes a floating input that can draw 
current in excess of 300 wA. Placing a 510 KO resistor 
between SDATA pin and ground solves this problem. 
Using a 100 KO resistor or lower may impede circuit 
functionality. 


/ 890124FX —— 


89C 126FX 


SDATA 


292101-1 


Figure 1. Placement of Current Drain 
Blocking Resistor 


DESIGN TRADE OFF 


The reduced power drain feature design trade-off, be- 
sides adding circuit complexity, is an additional 20 mW 
power consumption when the 89C124FX is active. 
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Figure 2. 89C124FX Power Consumption Circuit Modifications 
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INTRODUCTION 


The evolution of the floppy has been marked in little 
over a decade by a significant increase in capacity ac- 
companied by a noticeable decrease in the form factor 
from the early 8 inch floppy disks to the present day 3.5 
inch floppy disks. This decade will also be remarkable 
as OEMs adopt “Super” dense floppies. 


The most commonly seen floppies today are invariably 
one of the form factors — the 5.25” or the 3.5". Each 
form factor has several associated capacity ranges. The 
5.25” floppies available are: 180 KB (single density), 
360 KB (double density) and 1.2 MB (high density). 
The 3.5” floppies available are: 720 KB (double densi- 
ty) and 1.44 MB (high density). The emerging super 
dense floppies will evolve on the installed base of 3.5” 
floppies. The latest member of this set is the 2.88 MB 
(extra density) floppy, pioneered by Toshiba. The cor- 
nerstone of market acceptance of newer drives is com- 
patibility to the older family. The 2.88 MB (formatted) 
floppy drive allows the user to format, read from and 
write to the lower density diskettes. 


As programs and data files get bigger, the demand for 
higher capacity floppies becomes obvious. There are 
several 3.5" higher density drives available from vari- 
ous vendors with capacities well into the 20 MB range. 
NEC has introduced a 13 MB drive and companies 
such as Insite have introduced 20 MB drives. Both 
drives require servo-mechanisms to accurately position 
the head over the right track. NEC’s drive has the stan- 
dard floppy drive interface whereas Insite’s interface is 
SCSI based. The market for these floppy drives will 
remain a niche unless they receive more OEM support. 


Initiated by Toshiba’s research and innovation of the 


higher density 4 MB floppy disk media, the market is 
headed towards the super dense floppy drive. After 
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IBM’s endorsement of the 4 MB (unformatted) floppy 
disk drives on their PS/2 model 57 and PS/2 model 90, 
several OEMs have shown a growing interest in “‘su- 
per” dense floppy disk drives. The latest DOS 5.0 sup- 
ports the new 4 MB floppy media and BIOS vendors 


like Pheonix, AMI, Award, Quadtel, System Soft, and 
Microid all support the newer 4 MB floppy media. 


PURPOSE 


An important consideration to implement the 4 MB 


floppy drive is the floppy disk controller. Intel’s highly _ 


integrated floppy disk controller, 82077AA/SL, has led 
the market in supporting the 4 MB floppy drive. Two — 
ingredients are necessary to fully support these drives: 
1 Mbps transfer rate and the perpendicular recording 
mode. This paper deals with a discussion of what the 
perpendicular mode is and how can a 4 MB floppy disk 
drive be implemented in a system using the 
82077AA/SL. 


PERPENDICULAR RECORDING MODE 


Toshiba has taken the 2 MB floppy and doubled the 
storage capacity by doubling the number of bits per 
track. Toshiba achieved this by an innovative magnetic 
recording mode, called the vertical or the perpendicular 
recording mode. This mode utilizes magnetization per- 
pendicular to the recording medium plane. This is in 
contrast to the current mode of longitudinal recording 
which uses the magnetization parallel to the recording 
plane. By making the bits stand vertical as opposed to 
on their side, recording density is effectively doubled, 
Figure 1. The new perpendicular mode of recording not 
only produces sharp magnetization transitions neces- 
sary at higher recording densities, but is also more sta- 
ble. 


in ® 


The 4 MB disks utilize barium ferrite coated substrates 
to achieve perpendicular mode of magnetization. Cur- 
rent disks use cobalt iron oxide (Co-g-Fe203) coating 
for longitudinal recording. The barium ferrite ensures 
good head to medium contact, stable output and dura- 
bility in terms of long use. High coercivity is required 
to attain high recording density for a longitudinal re- 
cording medium (coercivity specification of a disk re- 
fers to the magnetic field strength required to make an 
accurate record on the disk). A conventional head 
could not be used in this case; however, the barium 
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ferrite disk has low coercivity and the conventional fer- 
rite head can be used. The new combination heads in- 
clude a pre-erase mechanism, i.e., the ferrite ring heads 
containing erase elements followed by the read/write 
head. These erase elements have deep overwrite pene- 
tration and ensure complete erasure for writing new 
data. The distance between the erase elements and the 
read/write head is about 200mm. This distance is im- 
portant from the floppy disk controller point of view 
and will be discussed in later sections. 


PERPENDICULAR RECORDING 


MAGNETIZATION 


MAGNETIC LAYER 


LONGITUDINAL RECORDING 


se SUBSTRATE 
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Figure 1. Perpendicular vs Longitudinal Recording 
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PERPENDICULAR DRIVE FORMAT 
AND SPECIFICATION 


Figures 2a and 2b show the IBM drive format for both 
double density and perpendicular modes of recording. 
The main difference in recording format is the length of 
Gap2 between the ID field and the Data field. The 
main reason for the increased Gap2 length is the pre- 
erase head preceding the read/write head on the newer 
4 MB floppy drives. The size of the data field is main- 
tained at 512 KBytes standard. The increase in the ca- 
pacity is implemented by increasing the number of sec- 
tors from 18 to 36. Table 1 shows the specifications of 
the various capacity 3.5” drives. 
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PERPENDICULAR MODE COMMAND 


The current 82077AA/SL parts contain the “en- 
hanced” perpendicular mode command as shown in 
Figure 3. This is a two byte command with the first 
byte being the command code (0x12H). The 2nd byte 
contains the parameters required to enable perpendicu- 
lar mode recording. The former command (in the older 
82077 parts) included only the WGATE and GAP bits. 
This command is compatible to the older mode where 
only the two LSBs are written. The enhanced mode 
allows system designers to designate specific drives as 
perpendicular recording drives. The second byte will be 
referenced as the PR[0:7] byte for ease of discussion. 
The following discusses the use of the enhanced perpen- 
dicular recording mode. 
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Figure 2b. Perpendicular 4 MB Format (MFM) 
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PERPENDICULAR MODE COMMAND 
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Command Code 
D1 DO GAP WGATE|PR. 


Figure 3. Perpendicular Mode Command 
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Table 1. Specifications of FDDs 


wines | svete | res | tw | eit 
in > aon Different Kinds of FDDs. 360 KB 720 KB 1 aie MB 2: 2 MB 
Number of | Number of Cylinders 

saa A SO A 


Rotation Speed (rpm) 300 
a 


Track "aa (tpi) | 135 | 


| Recording Density (bp) Density (bpi) —_ 2 8717 a — 


Data Transfer Rate (Mbps) 0.25 
aa 30 


| Gap Length for Read/Write | Length for Read/Write 


Gap Length for Format | es ae a ae 
Density Notation DD/DS HD/DS DD/DS HD/DS ED/DS 


The following describes the various functions of the unique requirements of the 4 MB floppy drives by GAP 
programmed bits in the PR: and WGATE bits of the PR. Table 2 describes the ef- 
OW If this bit is not set high, all PR[2:5] are fects of the GAP and WGATE bits for the perpendicu- 
lar command. 


ignored. In other words, if OW = 0, only 
GAP and WGATE are considered. In order 
to select a drive as perpendicular, it is neces- 


sary to set OW = 1 and select the Dn bit. 82077AA/SL’s PERPENDICULAR 


Dn This refers to the drive specification bits and MODE SUPPORT 


corresponds to PR[2:5]. These bits are con- ——-The 82077AA and 82077SL both support 4 MB record- 
sidered only if OW = 1. During the READ/ ing mode. The 82077SL has power management fea- 
WRITE/FORMAT command, the drive se- tures included as well. Both AA and SL product lines 
lected in these commands is compared to have three versions each out of which two of the ver- 
Dn. If the bits match then perpendicular sions support the 4 MB floppy drives. The 82077AA-1, 
mode will be enabled for that drive. For ex- 399774. A, 82077SL, and 82077SL-1 all support the 
ample, if DO is set then drive 0 will be con- 4 MB floppy drives. A single command puts the 


figured for perpendicular mode. 82077AA/SL into the perpendicular mode. This mode 
GAP This alters the Gap2 length as required by also requires the data rate to be set at 1 Mbps. The 
the perpendicular mode format. FIFO that is unique to Intel’s 82077AA/SL parts may 
WGATE Write gate alters timing of WE to allow for become necessary to remove the host interface bottle- 


neck due to the higher data rate. The 4 MB floppy disk 
drives are downward compatible to 1 MB and 2 MB 
The VCOEN timing and the length of the Gap2 field floppy diskettes. The following discussion explains the 


(explained above) can be altered to accommodate the implications of the new 4 MB combination head and 
the functionality of the perpendicular mode command. 


pre-erase loads in perpendicular drives. 
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Perpendicular 


The implementation of 4 MB drives requires under- 
standing the Gap2 (see Figures 2a and 2b) and VCO 
timing requirements unique to these drives. These new 
requirements are dictated by the design of the “‘combi- 
nation head” in these drives. Rewriting of disks in the 
4 MB drives requires a pre-erase gap to erase the mag- 
netic flux on the disk preceding the writing by the 
read/write gap: The read/write gap in the 4 MB drive 
does not have sufficient penetration (as shown in Figure 
4a) to overwrite the existing data. In the conventional 
drives, the read/write gap had sufficient depth and 
could effectively overwrite the older data as depicted in 
Figure 4b. It must be noted that it is necessary to write 


| RD/WR GAP 
COMBINATION HEAD 
| ID FIELD 


Table 2. Effects of GAP and WGATE Bits 


Portion 
VCO Low of Gap2 pp ae 
Time after Written by for Read 
Index Pulse | Write Data of 
; Operations 
Operation 


so byies | 22 te 24 Byes 


Perpendicular 33 Bytes 22 Bytes 19 Bytes 24 Bytes 
(Data Rate = 500 kbps) 


so byes | 22 Byes 24 byte 
33 Bytes | 41 Bytes | 38 Bytes 43 Bytes 


the conventional 2 MB media in the 4 MB drive at 
500 Kbps perpendicular mode. This ensures proper era- 
sure of existing data and reliable write of the new data. 
The pre-erase gap in the 4 MB floppy drives is activated 
only during format and write commands. Both the pre- 
erase gap and read/write gap are activated at the same 
time. 


As shown in Figure 4a, the pre-erase gap precedes the 
read/write gap by 200mm. This distance translated to 
bytes is about 38 bytes at a data rate of 1 Mbps and 19 
bytes at 500 Kbps. Whenever the read/write gap is en- 
abled by the Write Gate signal the pre-erase gap is acti- 
vated at the same time. 


PRE-ERASE GAP 


20.0.80 20 = a8 0 ererceescececane 


VILLI LLL, 
Pe DATA FIELD 
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Figure 4a. Head Design for the 4 MB Perpendicular Mode 


020.0.0,0,0,0,0,0,0,2,0,0, 0.9.9.0, 0.0, 


CONVENTIONAL HEAD = : 
A A A 22 2 22 : 
se GAP2 = 22 bytes of 4E * 


RD/WR GAP 


CLL LLL LL 
a DATA FIELD 
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Figure 4b. Head Design for the Conventional 2 MB Mode 
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In conventional drives, the Write Gate is asserted at the 
beginning of the sync field, i.e., when the read/write is 
at the beginning of the data field. The controller then 
writes the new sync field, data address mark, data field 
and CRC (see Figure 2a). With the combination head, 
the read/write gap must be activated in the Gap2 field 
to ensure proper write of the new sync field. To accom- 
modate both the distance between the pre-erase gap and 
read/write gap and the head activation and deactiva- 
tion time, the Gap2 field is expanded to a length of 41 
bytes at 1 Mbps (see Figure 2b). Since the bit density is 
proportional to the data rate, 19 bytes will be written in 
the Gap? field at 500 Kbps data rate in the perpendicu- 
lar mode. 


On the read back by the 82077AA/SL, the controller 
must begin the synchronization at the beginning of the 
sync field. For conventional mode, the internal PLL 
VCO is enabled (VCOEN) approximately 24 bytes 
from the start of the Gap2 field. However, at 1 Mbps 
perpendicular mode the VCOEN goes active after 43 
bytes to accomodate the increased Gap? field size. For 
each case, a 2 byte cushion is maintained from the be- 
ginning of the sync field to avoid write splices caused 
by motor speed variation. 


It should be noted that none of the alterations in Gap2 
size, VCO timing or Write Gate timing affect the nor- 
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mal program flow. Once the perpendicular command is 
invoked, 82077AA/SL behaviour from the user stand- 
point is unchanged. 


PROGRAMMING PERPENDICULAR 
MODE 


Figures 5a and 5b show a flowchart on how the perpen- 
dicular recording mode is implemented on _ the 
82077AA/SL. The perpendicular mode command can 
be issued during initialization. As shown in Figure 5a 
the perpendicular command stores the PR value inter- 
nally. This value is used during the data transfer com- 
mands for configuration in order to deal with the per- 
pendicular drives. Table 2 shows how the Gap2 length, 
VCOEN timing or Write Gate timing is affected. The 
OW bit is also tested for in this part of the loop. The 
enhanced perpendicular mode is enabled by setting the 
OW = 1, setting the Dn bits corresponding to the in- 
stalled perpendicular drive high and leaving PR[0:1] = 
“00”. 


As shown in Figure 5b, the Gap2 length is initially set 
to the conventional length of 22 bytes. Next the 
PR[0:1] bits (GAP, WGATE) are checked if they are 


set to ‘00’. If the PR[0:1] bits are set to ‘10’ then, per- 


pendicular mode is disabled and conventional mode is 
retained. If the PR[0:1] = ‘01’ or ‘11’ the VCOEN is 


GET THE FIRST BYTE 
FROM FIFO 


N 
TEST FOR NEXTH, - 
Oe 


FIFO -PR[0:7] 
SAVE PR[0:7] 
INTERNALLY 
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Figure 5a. Perpendicular Command Handling 
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During RD/WR/FRMT 

command execution 

Set Gap2 to std. 22 
bytes 


PR[O:1] ='00'?? 


PR[O:1] ='1'?? 


Get drive information as 
programmed in the current 
read/write/format command 


SET WRITE GAP 


Compare drive indicated 
in the current command 
to bits in PR[0:5] 


PR[O:1] ='11'?? 


Do the bits 
match ?? 


DISABLE PRECOMP 
SET WRITE GAP BIT 


DRATE = 1 Mbps?? 


Y 


SET GAP2 TO 41 BYTES 


CONTINUE WITH 
RD/WR/FRMT/ETC. CMD 


Figure 5b. During Data Transfer Commands 
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set to activate 43 bytes or 24 bytes from the start of the 
Gap? field, depending on the value as shown in Table 2. 
After this, PR[0:1] = ‘11’ is checked; if not true (pro- 
grammed ‘01’) the program is exited with only the 
VCOEN timing being set for perpendicular mode. If 
true, however, the Gap2 length is set up for perpendic- 
ular mode (note: this is done independent of the data 
rate). It must be noted that if the PR[0:1] bits are set to 
‘11’ then it is up to the user to disable precompensation 
before accessing perpendicular drives. The other branch 
of the flowchart refers to setting of PR[0:1] to ‘00’. In 
this case, the perpendicular command will have the fol- 
lowing effect: 


1. If any of the Dn bits in PR[2:5] programmed high, 
then precompensation is automatically disabled 
(0 ns is selected for the specified drive regardless of 
the data rate) and VCOEN is set to activate appro- 
priately. All the bits that are set low will enable the 
82077 to be configured for conventional mode, i.e., 
exit the program without modifications (shown Fig- 
ure 5b). 


2. Next the data rate is checked for 1 Mbps. If the data 
rate is at 1 Mbps, then Gap2 length is set to 41 
bytes, otherwise, the program is exited without set- 
ting up the Gap2 to 41 bytes. 


It must be noted that if PR[2:5] are to be recognized in 
the command the OW bit must be set high. If this bit is 
low, setting of Dn bits will have no effect. Setting the 
OW bit will enable the storage of the Dn bit. Also set- 
ting PR[0:1] to any other value than ‘00’ will override 
anything written in the Dn bits. In other words, setting 
PR[O0:1] to a value other than ‘00’ enables the effect of 
that for all drives. It must be noted that if PR[0:1] bits 
are set to a value other than ‘00’ then it is recommend- 
ed not to use the enhanced command mode, Le., all 
other bits should be zero. Consider the following exam- 
ples: 


a. PR[0:7] = 0x84; This is the way to use the com- 
mand in the enhanced mode. In this case, the OW 
= 1 and DO is set high. During the data transfer 
command, if DO is selected it will be automatically 
configured for perpendicular mode. If D1 is ac- 
cessed, however, it will be configured for conven- 
tional mode. Similarly, if PR[0:7] = 0x88 then D1 
is configured for perpendicular mode and D0 is con- 
figured for conventional mode. Software resets do 
not clear this mode. 
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b. PR[0:7] = 0x03; This is the way to use the com- 
mand in the old mode. If the user decides to use this 
mode, then it must be noted that the command has 
to be issued before every data transfer command. 
Also when used this way, all the drives are config- 
ured for perpendicular mode. The user must also 
remember to disable precompensation and set the 
data rate to 1 Mbps while accessing the perpendicu- 
lar drive in the system. Any software reset clears the 
command. 


c. PR[0:7] = 0x87; In this case, the OW = 1, DO = 1 
and PR[0:1] = 11. This may be called a mixed 
mode and should be refrained from usage. This is 
similar to setting PR[0:7] = 0x03, because setting 
PR[0:1] high overrides automatic configuration. In 
this case the user has to be aware that precompensa- 
tion must be disabled and the data rate must be set 
to 1 Mbps while accessing drive 0. After software 
reset, bits GAP and WGATE will be cleared, but 
OW and DO will retain their previously set values. 
In other words, after software reset, the part will see 
PR[0:7] = 0x84. Evidently, this would cause prob- 
lems and, therefore, it is recommended this mode 
not be used. 


d. PR[0:7] = 0x80; In this case, the OW = 1, Dn = 
O and PR[0:1] = 00. This has the effect of clearing 
the perpendicular mode command without doing a 
hardware reset. Another way to do this would be to 
set PR[0:7] = 0x02; this can then be used to tempo- 
rarily disable perpendicular mode configuration 
without affecting the previously programmed Dn 
values. Software reset following this will reenable 
the previously programmed enhanced mode com- 
mand. 


Using the enhanced perpendicular command removes 
the requirement of issuing the perpendicular command 
for each data transfer command and manually setting 
the perpendicular configuration. 


“Software” RESETs (via DOR or DSR registers) will 
only clear the PR[0:1] values to ‘0’. Dn bits will retain 
their previously programmed values. “Hardware” RE- 
SETs will clear all the programmed bits including OW 
and Dn bits to ‘0’. The status of these bits can be deter- 
mined by issuing the dumpreg command and checking 
the 8th result byte. This byte will contain the pro- 
grammed values of the Dn and PR[0:1] bits as shown 
in Figure 6. The OW bit is not returned in this result 
byte. 
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Data Bus | 
D6 D5 D4 D3 D2 D1 DO Remarks 
DUMPREG COMMAND 
Eighth Result Byte -— 
0 D3 D2 &«iOD1 DO GAP WGATE 


Figure 6. Dumpreg Command 


_INTERFACE BETWEEN 82077AA/SL AND THE DRIVE 


FDC-FDD 
INTERFACE 


There is currently no industry-wide standard for the 
FDC to FDD interface. There are numerous floppy 
drive vendors, each with their own modes and interface 
pins to enable 4 MB perpendicular mode. The drive 
interface not only varies from manufacturer to manu- 
facturer but also within a manufacturer’s product line. 
The differences on the interface mainly originate from 
configuring the floppy drive into the 4 MB mode. De- 
pending on the drive, the differences can create prob- 
lems of daisy-chaining a 4 MB drive with the standard 
1 MB and 2 MB drives. Of course, for laptops this is 
not a problem since most of them use a single floppy 
drive. Lack of an industry standard makes it necessary 


to look at each drive and build a interface for that par- 


ticular drive. _ ; 


The following is a brief discussion about some of the 
floppy drives available in the market and how these can 
be interfaced with the 82077AA/SL. It is important to 
note that although a manufacturer’s name may be given 
in connection with the interface described, Intel does 
not guarantee that the interface discussed will apply to 
~ all the drives from that manufacturer. The main goal is 
introduce to the reader how to interface the 
82077AA/SL with a 4 MB floppy drive. 


Previously, for the conventional 1 MB and 2 MB AT 
mode drives, a single Density Select input was used by 
floppy drives to select between high density and low 
density drives. A high on this input enabled high densi- 
ty operation (500 K-bps) whereas a low enabled low 
density operation (300 Kbps/250 Kbps). This signal 
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was asserted high or low by the floppy disk controller 


depending on the data rate programmed. For the 4 MB 
operation, there are two inputs defined by the floppy 
drive manufacturers. The polarity of these inputs en- 
ables the selected density operation. Implementing this 
requires at least 1 new pin to be defined on the FDC- 
FDD interface. Most floppy vendors have elected to 
take pin 2 (originally density select) and redefine the 
polarity to conform to one of these new density select 
inputs and another pin to be the other density select 
input. However, the new density select on pin 2 is not 
compatible to the old density select input in many of 
the floppy drives. This precludes the user from daisy 
chaining 4 MB drives with conventional drives. Anoth- 
er problem is that the second density select pin varies 
on its location on the FDC-FDD interface from drive 
to drive. 


The way that the BIOS determines what type of disk- 
ette is in what type of drive is by trial and error. The 
system tries to read the diskette at 250 Kbps; if it fails 
then it will set the data rate to higher value and retry. 
The BIOS does this until the right data rate is selected. 
This method will still be implemented for the 4 MB 
drives by some BIOS vendors. However, the 4 MB 


_ drives available today also have two media sense ID 


pins that relate to the user what type of media is pres- 
ent in the floppy drive. This information will also re- 
quire two pins on the FDC-FDD interface. The loca- 
tion of these pins is once again variable from drive to 
drive. 
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Some manufacturers have circumvented the entire stan- 
dardization problem by including an auto configuration 
in the drive. In these cases, the type of floppy put into 
the drive is sensed by the hole (each 4/2/1 MB diskette 
has a hole in different locations identifying it) on the 
diskette. Then the drive automatically sets itself up for 
this mode. The BIOS must obviously set up the floppy 
disk controller for the correct data rate which could be 
done if the media sense ID was read and decoded as to 
the data rate. Due to lack of extra pins on the even side 
of the floppy connector the newer locations of some of 
the functions are migrating to the odd pins (previously 
all grounded). Some drive manufacturers have even 
made this configurable via jumpers. For instance, the 
new TEAC drives have a huge potpourri of configura- 
tions that would satisfy the appetite of some of the most 
finicky system interfaces. 
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The 82077AA/SL currently has two output pins 
DRATEO and DRATE!1 (pins 28 and 29 respectively) 
which directly reflect the data rate programmed in the 
DSR and CCR registers. These two pins can be used to 
select the correct density on the drive. These two can 
also be used with the combination of DENSEL to select 
the correct data rate. At the present time the 
82077AA/SL does not support media sense ID. How- 
ever, the user could easily make it readable directly by 
BIOS. The following is a discussion on what combina- 
tion of DRATEO, DRATE1, and DENSEL could be 
used to interface to some of the currently available flop- 
py drives. 


1. TEAC 235J-600/Toshiba PD-211/Sony (Old Version) 


These were among the first 4 MB drives available in the market. Each of them has a mode select ae on pins 2 and 
6. The polarity required for each different data rate is as shown below: 


1 ee 4 | 4MB 
500 | 500 Kbps_| 2 MB 


Data mean sini DRATE1 | DRATEo | MODSELO | MODSEL1 
<<" eo) ia 


300 Kbps/ 4 MB 
1 fom 


It is clear from the above that DRATEO = MODSELO and MODSEL!1 = DRATE1#. This would mean taking 
the drate signals onto pins 2 and 6 of the FDC-FDD interface. Unfortunately this solution requires an inverting gate. 
TEAC has recently, however, come out with a new version called TEAC 235J-3653. On this drive there are a 
number of possible configurations into which the drive can be put into, however, only the best way to interface to the 
82077AA/SL will be discussed. The requirements are as shown below. This shows that HDIN = DENSEL (original 
signal for conventional drives) and EDIN = DRATEDO. As suggested in the TEAC spec for method 1, the straps 
connected are MSC, HI2 (sets HDIN on pin 2), DC34 and EI6 (sets EDIN on pin 6). Pins 4, 29, and 33 are left 
open. Since pin 2 has the same polarity as the conventional drive requirement and the secondary input is connected 
via pin 6 (no connect on the conventional drives) daisy chaining this TEAC drive with a conventional drive does not 
cause any incompatibility. Figure 7 shows how the TEAC can be connected to the 82077AA/SL. It also shows daisy 
chaining of the TEAC drive with a conventional drive. 


1 ee 
500 | 500 Kbps. | 2 is 


300 Kbps/ 4 MB 
1 ae 
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Figure 7. Interfacing 82077AA/SL to TEAC 235J-3653 
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2. Panasonic JU-259A (New Version) 


This is Panasonic’s new drive and has the HDIN signal on pin 2 and EDIN signal on pin 6. The requirements are 
shown below. This type of interface allows for daisy chaining the Panasonic drive with a conventional drive. The 
DENSEL signal can be connected to pin 2 and the DRATEO should be connected to pin 6. 


a 


1 | 1Mbps 


. eo ie 

sere 
1 Mbps 

[250 Kbps | ima [0 


3. Mitsubishi MF356C (Model 252UG/788UG) 


There are two models of this drive. The 252UG has DENSEL! on pin 2 and DENSELO on pin 33, whereas the 
788UG has DENSELO located on pin 2 and DENSEL! located on pin 6. Via jumpers, it is possible to configure the 
drives to different polarity for the density select line. The following table shows the configuration for the 252UG in 
which jumper setting is 2MS = I/F and 4 MS = I/F. 


ON Oe eG 

fee 
1 aa 

j2sokops| imp | o | 1 | o | o | o | 


The correct connection requirement is: DENSEL (from 82077AA/SL) = DENSELI1 and DRATEO = DENSELO. 
Although there are other configurations, this provides the best one, since daisy chaining is possible without any 
problem. 


4. Epson SMD-1060 


This drive has 3 different modes of operation. Mode B is the best and is similar to Mitsubishi’s drives as described 
above. In this mode, HDI signal is connected to pin 2 and EDI is connected to pin 33. Mode B is enabled by 
inserting jumpers across 3-4 and 7-8 (SSO1 B block) and 1-2 and 3-4 (SS0O3 block) for the drive with the power 
separated type (i.e., a connector for the floppy signals and another one for power supply) of 34-pin connector. 


ss a DENSEL | DRATE1 | DRATEO aioe 


500 ee 2 s— 


300 Kbps/ 4 MB 
1 oy 


250 | 250 Kbps _ 


As demonstrated by the table, HDI = DENSEL and EDI = DRATEO. These connections would ensure daisy 
chaining capability without any problems. 
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5. Sony MP-F40W- 14/15 


The dash 14 and 15 are two drives from Sony that handle 4 MB requirements. The MP-F40W-14 has the DENSITY 
SELECT 1, DENSITY SELECT 0 on pins 2 and 33 respectively, whereas the MP-F40W-15 has the DENSITY 
SELECT 1, DENSITY SELECT 0 on pins 2 and 6 respectively. As it is obvious from the table below, daisy chaining 
is easily done if the 82077AA/SL is connected in the PS/2 mode (by tying IDENT low) with either type of drive, the 
-only difference being the location of DENSITY SELECT 0. 


DENSITY | DENSITY 
DRATE1| DRATEO| SELECT1 SELECTO 


300 Kbps/ 
1 Mbps 


If the drive is used in the PS/2 mode, then DENSITY SELECT1. = DENSEL and DENSITY SELECTO 
DRATEO. To use the drive in AT mode, DENSITY SELECT1 = DRATEI1 and DENSITY SELECTO 
DRATEO, as shown below. However, daisy chaining is not possible. 


DENSEL DENSITY | DENSITY 
PS/2 mode | DRATE1| DRATEO| SELECT1 SELECTO 
(IDENT = 0)| i 


6. Toshiba ND3571 


Toshiba MB drive has the HD mode selection on pin 6 and ED mode selection on pin 2. This causes daisy chaining 
problems with conventional drives as shown in the figure below: 


DENSEL | DRATE1 | DRATEO | ©2 Mode | HD Mode 


300 Kbps/ 


| 4MB 
500 Kbps a 


ana 


The DENSEL from the 82077 is connected to ai 6 and DRATEDO is connected to pin 2. 


4-324 


intel. 


82077SL 4 MB DESIGN 


This section presents a design application of a PC/AT 
compatible floppy disk controller. The 82077SL inte- 
grates the entire PC/AT controller design with the ex- 
ception of the address decode on a single chip. The 
schematic for this solution is shown in Figure 8. The 
chip select for the 82077SL is generated by a 85C220 
uPLD that is programmed to decode addresses 03FOH 
through 03F7H when AEN is low. The programming 
equations for the uPLD is in the Intel’s .ADF format 
and can be processed using the IPLSII compiler (avail- 
able from Intel). 


A floppy disk interface is provided by on-chip output 
buffers with a 40 mA sink capability. The outputs from 
the disk drive are terminated at the floppy disk control- 
ler with a 1 KO resistor pack. The 82077SL disk inter- 
face inputs contain a Schmitt trigger input structure for 
higher noise immunity. The host interface is a similar 
direct connection with on-chip 12 mA sink capable 
buffers on DBO-7, INT and DRQ. 


some] on |g 

Connection 
Tai BROT: DWA request used with DACKT# toallow forDMAtransfers ‘| _Open 
[sa | DACK': DMA acknowledge 1 used with DAG to allow for DMA transfers | Open 
[15 | IRS: Interuptiine 5 used to generate floppy interupts =| Open 
[—a7__| Ave: Address selection (between 9FX and @7X adéressranges) «| ~—Open 
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The schematic shows eleven jumpers numbered Jl 
through J11. The table below describes the functions of 
these jumpers as well as their normal connections. The 
normal connections allow the BIOS to work without 
modification. In the normal mode, the 82077SL re- 
sponds to DRQ2 and DACK2# as well as IRQ6. De- 
pending on the type of drive interfaced to this board, 
the DENOUTO and DENOUTI signals can be tied. 
With the setting to 2—3 on J8 and J9, the default setting 
is DENSEL on DRVDENO and DRATEO on 
DRVDEN1. PIN6/33 SELECT is used to set for pin 6 
as the EDIN input. The J11 should always be closed. It 
can be used to measure the current consumption of 
82077SL. J7 selects between the primary and secondary 
address spaces. There are two resistor packs used for 
pullups on input signals from the floppy drive interface. 
These resistors are rated at 1K. Please note that if using 
older 5.25” drives, the pullup on some of them is 1502. 
Most modem 5.25” drives use a 1K value. In order to 
ensure the correct value please refer to the floppy drive 
specification manual. 


For further information, please contact your local Intel 
sales office. 
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Designer: K. Shah 

Company: Intel Corp. 

Dept: IMD Marketing 

Date: April '92 

Rev.#: 

% The pPLD used in the 82077SL Evaluation board design, Rev.#1.0. % 
85C220 dip package 


OPTIONS: TURBO = ON 
PART: 85C220 


INPUTS: 
SA9@2, % System Address Inputs % 
SA8@3, 
SA7@4, 
SA6@5, 
SA5@6, 
SA4@7, 
SA3@8, 
AEN@9, 


DENOUTO@1, % Maps the DRVDENO and DRVDEN1 to appropriate polarity table % 
DENOUT1@18, % Maps the DRVDENO and DRVDEN1 to appropriate polarity table % 


ADDSEL@1l1, % Selects between primary and secondary address Spaces % 
DRATEO@12, % DRATEO signal from the 82077SL % 


DRATE1@13, % DRATE1 signal from the 82077SL % 
DENSEL@14 % DENSEL signal from the 82077SL % 


OUTPUTS: 
CS_@15, % 82077SL chip select signal % 


DRVDEN1@16, % Drive density signal connected to EDIN of the drive % 
DRVDENO@17 % Drive density signal connected to HDIN of the drive % 


NETWORK : 
% Inputs % 


INP (SAQ9) 
INP (SA8) 
INP (SA7) 
INP (SA6) 
INP (SA5) 
INP (SA4) 
INP (SA3) 
INP (AEN) 
ADDSEL = INP(ADDSEL) 
= INP(DRATEO) 
DRATE1 = INP(DRATE1) 
= INP(DENSEL) 
= INP(DENOUTO) 
DENOUT1 = INP(DENOUT) 


n 
> 
on 
Huta ud an 


% Outputs % 
CS. = CONF(CSeq, Voc) 


DRVDENO 
DRVDEN1 


4-327 


CONF (DENOeq, Vcc) 
CONF (DENleq, Vcc) 


AP-358 | | intel. 


EQUATIONS : 


% CS_is activated for 3F0-SF7 and 570-377 address spaces % 


CSeq = (AEN' * SAO * SA8 * SA7' * SAG * SAS * SA4 * SAS' * ADDSEL' 
+ AEN' * SAQ * SAS * SA7 * SAG * SA5 * SA4 * SAS' * ADDSEL)'; 


% These are the eTenets generated on EHVOEH? and DRVDEN1 for the FDC-FDD 
interface 
DENOUT]. DENOUTO DRVDENO DRVDEN1 
DRATEO 
DRATEO 
DRATEO 
DRATE1 


DENOeq = DENSEL * (DENOUTO' * DENOUT1") + DENSEL' * (DENOUTO * DENOUTL') | 
| + DRATE1 * (DENOUTO' * DENOUT1) + DRATEO * (DENOUTO * DENOUTL) ; 
DENleq = DRATE1 * (DENOUTO * DENOUT1) + DRATEO * (DENOUTO' + DENOUTL') ; 


-82077SL Application Note Revision Summary 


The following changes have been made since revision 001: 
Table 2 kBps was corrected to kbps. 


Page 4-323 3. Mitsubishi MF356C description modified to read: “There are two models of this drive. The 252UG 
a has DENSEL! on pin 2 and DENSELO on pin 33, whereas the 788UG has DENSELO located on pin 2 
‘and DENSEL1 located on pin 6. Via jumpers, it is possible to configure the drives to different polarity 
for the density select lines. The following table shows the configuration for the 252UG in which jumper 
| setting is 2 MS = I/F and 4 MS = I/F.” 
Figure 8 Arrow added to diagram. | 


Page 4-328 Columns corrected to line up properly. 


(4-328 : | 
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Flash Memory 
Components 


intel. 
28F001BX-T/28F001BX-B | 
1M (128K x 8) CMOS FLASH MEMORY 


m@ High Integration Blocked Architecture m High-Performance Read 
— One 8 KB Boot Block w/Lock Out — 70/75 ns, 90 ns, 120 ns, 150 ns 
— Two 4 KB Parameter Blocks Maximum Access Time 
— One 112 KB Main Block — 5.0V + 10% Vcc 
m 100,000 Erase/Program Cycles Per m Hardware Data Protection Feature 
Block — Erase/Write Lockout during Power 
m Simplified Program and Erase | Transitions 
— Automated Algorithms via On-Chip | m Advanced Packaging, JEDEC Pinouts 
Write State Machine (WSM) — 32-Pin PDIP 
m= SRAN-Compatible Write Interface — 32-Lead PLCC, TSOP 


m ETOX Ii Nonvoliatile Flash Technology 
— EPROM-Compatible Process Base 
— High-Volume Manufacturing 

Experience 


m Extended Temperature Options 


m Deep-Powerdown Mode 
— 0.05 uA Icc Typical 
— 0.8 A Ipp Typical 


m 12.0V +5% Vpp 


Intel’s 28FO001BX-B and 28F001BX-T combine the cost-effectiveness of Intel standard flash memory with 
features that simplify write and allow block erase. These devices aid the system designer by combining the 
functions of several components into one, making boot block flash an innovative alternative to EPROM and 
EEPROM or battery-backed static RAM. Many new and existing designs can take advantage of the 
28F001BX’s integration of blocked architecture, automated electrical reprogramming, and standard processor 
interface. 


The 28F001BX-B and 28F001BX-T are 1,048,576 bit nonvolatile memories organized as 131,072 bytes of 
8 bits. They are offered in 32-pin plastic DIP, 32-lead PLCC and 32-lead TSOP packages. Pin assignment 
conform to JEDEC standards for byte-wide EPROMs. These devices use an integrated command port and 
state machine for simplified block erasure and byte reprogramming. The 28F001BX-T’s block locations pro- 
vide compatibility with microprocessors and microcontrollers that boot from high memory, such as Intel’s 
MCS-186 family, 80286, i386™, i486™, i860™ and 80960CA. With exactly the same memory segmentation, 
the 28F001BX-B memory map is tailored for microprocessors and microcontrollers that boot from low memory, 
such as Intel’s MCS-51, MCS-196, 80960KX and 80960SxX families. All other features are identical, and unless 
otherwise noted, the term 28F001BX can refer to either device throughout the remainder of this document. 


The boot block section includes a reprogramming write lock out feature to guarantee data integrity. It is 
designed to contain secure code which will bring up the system minimally and download code to the other 
locations of the 28F001BX. Intel’s 28F001BX employs advanced CMOS circuitry for systems requiring high- 
performance access speeds, low power consumption, and immunity to noise. Its access time provides 
no-WAIT-state performance for a wide range of microprocessors and microcontrollers. A deep-powerdown 
mode lowers power consumption to 0.25 wW typical through Vcc, crucial in laptop computer, handheld instru- 
mentation and other low-power applications. The RP # power control input also provides absolute data protec- 
tion during system powerup or power loss. 


Manufactured on Intel’s ETOX process base, the 28F001BX builds on years of EPROM experience to yield the 
highest levels of quality, reliability, and cost-effectiveness. 


The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
Intel’s “Data-on-Demand” CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 

November 1994 

Order Number: 290406-006 5-1 


intel. 


28F200BX-T/B, 28F002BX-T/B 


2-MBIT (128K x 16, 256K x 8) BOOT BLOCK 


FLASH MEMORY FAMILY 
m x8/x16 Input/Output Architecture m Hardware Data Protection Feature 
— 28F200BX-T, 28F200BX-B — Erase/Write Lockout during Power 
— For High Performance and High Transitions 
Integration 16-bit and 32-bit CPUs m Very High-Performance Read 
m x8-only Input/Output Architecture — 60/80/120 ns Maximum Access Time 
— 28F002BX-T 28F002BX-B — 30/40/40 ns Maximum Output Enable 
— For Space Constrained 8-bit Time 
Applications Low Power Consumption 
m= Upgradable to Intel’s SmartVoitage — 20 mA Typical Active Read Current 
Products m Reset/Deep Power-Down Input 
mw Optimized High Density Blocked —0.2 wA Icc Typical 
Architecture — Acts as Reset for Boot Operations | 
— One 16-KB Protected Boot Block 
— Two 8-KB Parameter Blocks ™ meas ih oi Operation 
— One 96-KB Main Block 7 
— One 128 KB Main Block m Write Protection for Boot Block | 
— Top or Bottom Boot Locations m Industry Standard Surface Mount 
m Extended Cycling Capability _ Packaging 
— 100,000 Block Erase Cycles — 28F200BX: JEDEC ROM Compatible 
44-Lead PSOP 
m Automated Word/Byte mene, and ee 
BI E 56-Lead TSOP 
ock Erase — 28F002BX: 40-Lead TSOP 
— Command User Interface 
— Status Registers m 12V Word/Byte Write and Block Erase 
— Erase Suspend Capability = ve = bea ey 
m= SRAM-Compatible Write Interface else es Pree 
m= ETOX Ill Flash Technology 
m Automatic Power Savings Feature me 
5V Read 
— 1 mA Typical icc Active Current in | 
Static Operation m@ Independent Software Vendor Support — 


The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
Intel’s “Data-on-Demand”’ CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 
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28F200BL-T/B, 28FO002BL-T/B 
2-MBIT (128K x 16, 256K x 8) LOW POWER BOOT BLOCK 
FLASH MEMORY FAMILY 


m Low Voltage Operation for Very Low = m SRAM-Compatible Write Interface 


ae Pe a aa m Automatic Power Savings Feature 
ECC, = ees — 0.8 mA Typical Icc Active Current in 


m Expanded Temperature Range Static Operation 
— — 20°C to + 70°C m Very High-Performance Read 
m@ x8/x16 Input/Output Architecture — 150 ns Maximum Access Time 
— 28F200BL-T, 28F200BL-B — 65 ns Maximum Output Enable Time 


— For High Performance and High 
Integration 16-bit and 32-bit CPUs 


m@ x8-only Input/Output Architecture 
— 28F002BL-T, 28FO02BL-B 
— For Space Constrained 8-bit 


‘mg Low Power Consumption 
— 15 mA Typical Active Read Current 


m Reset/Deep Power-Down Input 
— 0.2 vA Icc Typical 
— Acts as Reset for Boot Operations 


mictipaiebs Write P tion for Boot Block 
m Upgradeable to Intel’s SmartVoltage Be Tey nor meee moe 
Products m@ Hardware Data Protection Feature 
= Optimized High Density Blocked Oe ee ern eee ers 
Architecture 
— One 16-KB Protected Boot Block m Industry Standard Surface Mount 
— Two 8-KB Parameter Blocks Packaging 
— One 96-KB Main Block — 28F200BL: JEDEC ROM Compatible 
— One 128-KB Main Block 44-Lead PSOP 
— Top or Bottom Boot Locations 7 56-Lead TSOP 
: a — 28F002BL: 40-Lead TSOP 
m Extended Cycling Capability 
— 10,000 Block Erase Cycles m 12V Word/Byte Write and Block Erase 


od — + ro) 
u Automated Word/Byte Write and Block Vpp = 12V +5% Standard 


Erase m ETOX Ill Flash Technology 
— Command User Interface — 3.3V Read 
— Status Registers m Independent Software Vendor Support 


— Erase Suspend Capability 


The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
Intel’s “Data-on-Demand” CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 
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28F400BX-T/B, 28F004BX-T/B 
4 MBIT (256K x16, 512K x8) BOOT BLOCK FLASH 
MEMORY FAMILY 


x8/x16 Input/Output Architecture | 
— 28F400BX-T, 28F400BX-B 
— For High Performance and High 
Integration 16-bit and 32-bit CPUs | 


x8-only Input/Output Architecture 

— 28F004BX-T, 28F004BX-B 

— For Space Constrained 8-bit 
Applications 


Upgradeable to Intel’s Smart Voltage 
Products | 


Optimized High Density Blocked 
Architecture 

— One 16-KB Protected Boot Block 
— Two 8-KB Parameter Blocks 

— One 96-KB Main Block 

— Three 128-KB Main Blocks 

— Top or Bottom Boot Locations 


Extended Cycling Capability 
— 100,000 Block Erase Cycles 


Automated Word/Byte Write and Block 
Erase 

— Command User Interface 

— Status Registers 

— Erase Suspend Capability 


SRAM-Compatible Write Interface 


Automatic Power Savings Feature 
— 1 mA Typical Ic¢c Active Current in 
Static Operation 


m Very High- -Performance Read 


— 60/80/120 ns Maximum Access Time 

— 30/40/40 ns Maximum Output Enable 
Time 

Low Power Consumption 

— 20 mA Typical Active Read Current 


Reset/Deep Power-Down Input 
— 0.2 vA Icc Typical 
— Acts as Reset for Boot Operations 


Extended Temperature Operation 
— —40°C to + 85°C 


Write Protection for Boot Block 


Hardware Data Protection Feature 
— Erase/Write Lockout During Power 
Transitions 


Industry Standard Surface Mount 
Packaging 
— 28F400BX: JEDEC ROM Compatible 
44-Lead PSOP 
56-Lead TSOP 
— 28F004BX: 40-Lead TSOP 


12V Word/Byte Write and Block Erase 
—Vpp = 12V +5% Standard 
—Vpp = 12V + 10% Option 


ETOX Ill Flash Technology 
— 5V Read 


The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
Intel’s “Data-on-Demand”’ CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 
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PRELIMINARY 


_ 28F008SA 
8-MBIT (1-MBIT x 8) FlashFile™’ MEMORY 


Extended Temperature Specifications Included 


m High-Density Symmetrically Blocked m Very High-Performance Read 
Architecture — 85 ns Maximum Access Time 
— Sixteen 64-Kbyte Blocks m= SRAM-Compatible Write Interface 
m Extended Cycling Capability m Hardware Data Protection Feature 
— 100,000 Block Erase Cycles — Erase/Write Lockout during Power 
— 1.6 Million Block Erase Transitions 
Cycles per Chip i duisiry Standard Packaal 
m@ Automated Byte Write and Block Erase ” = aR ea TCOE epee 
— Command User Interface ie | 
— Status Register m@ ETOX Ill Nonvolatile Flash Technology 


— 12V Byte Write/Block Erase 


independent Software Vendor Support 
— Microsoft* Flash File System (FFS) 


m System Performance Enhancements 
— RY/BY# Status Output a 
— Erase Suspend Capability 


m Deep-Powerdown Mode 
— 0.20 nA Icc Typical 


Intel’s 28FOO8SA 8-Mbit FlashFile™’ Memory is the highest density nonvolatile read/write solution for solid 
state storage. The 28FOO8SA’s extended cycling, symmetrically blocked architecture, fast access time, write 
automation and low power consumption provide a more reliable, lower power, lighter weight and higher per- 
formance alternative to traditional rotating disk technology. The 28FOO8SA brings new capabilities to portable 
computing. Application and operating system software stored in resident flash memory arrays provide instant- 
on, rapid execute-in-place and protection from obsolescence through in-system software updates. Resident 
software also extends system battery life and increases reliability by reducing disk drive accesses. 


For high density data acquisition applications, the 28FO08SA offers a more cost-effective and reliable alterna- 
tive to SRAM and battery. Traditional high density embedded applications, such as telecommunications, can 
take advantage of the 28FO08SA’s nonvolatility, blocking and minimal system code requirements for flexible 
firmware and modular software designs. 


The 28F008SA is offered in 40-lead TSOP (standard and reverse) and 44-lead PSOP packages. Pin assign- 
ments simplify board layout when integrating multiple devices in a flash memory array or subsystem. This 
device uses an integrated Command User Interface and state machine for simplified block erasure and byte 
write. The 28FOO8SA memory map consists of 16 separately erasable 64-Kbyte blocks. 


Intel’s 28FOO8SA employs advanced CMOS circuitry for systems requiring low power consumption and noise 
immunity. Its 85 ns access time provides superior performance when compared with magnetic storage media. 
A deep powerdown mode lowers power consumption to 1 wW typical thru Vcc, crucial in portable computing, 
handheld instrumentation and other low-power applications. The RP# power control input also provides 
absolute data protection during system powerup/down. 


Manufactured on Intel’s 0.8 micron ee process, the 28FOO8SA provides the highest levels of quality, 
reliability and cost-effectiveness. 


*Microsoft is a trademark of Microsoft Corporation. 


The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
Intel’s “Data-on-Demand”’ CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 
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28F016SA 
16 MBIT (1 MBIT x 16, 2 MBIT x 8) | 
FlashFile™ MEMORY 


User-Selectable 3.3V or 5V Vcc m Revolutionary Architecture 


= 

m User-Configurable x8 or x16 Operation — Pipelined Command Execution 
70 ns Maxi A Ti — Write During Erase 

- id eral el ead — Command Superset of Intel 

m 28.6 MB/sec Burst Write Transfer Rate 28F008SA 

m 1 Million Typical Erase Cycles per m 1MmA Typical Icc in Static Mode 

_ Block m 1A Typical Deep Power-Down 

i 56-Lead, 1.2mm x 14mm x 20mm TSOP aes : 
Package m 32 Independently Lockable Blocks 

m 56-Lead, 1.8mm x 16mm x 23.7mm m State-of-the-Art 0.6 .m ETOX™ IV 
SSOP Package Flash Technology 


Intel’s 28FO016SA 16-Mbit FlashFileT™’ memory is a revolutionary architecture which is the ideal choice for 
designing embedded direct-execute code and mass storage data/file flash memory systems. With innovative 
capabilities, low-power, extended temperature operation and high read/write performance, the 28FO016SA 
enables the design of truly mobile, high-performance communications and computing products. 


The 28F016SA is the highest density, highest performance non-volatile read/write solution for solid-state 
storage applications. Its symmetrically blocked architecture (100% compatible with the 28FOO8SA 8-Mbit 
FlashFile memory), extended cycling, extended temperature operation, flexible Vcc, fast write and read per- 
formance and selective block locking provide highly flexible memory components suitable for resident flash 
arrays, high-density memory cards and PCMCIA-ATA flash drives. The 28F016SA dual read voltage enables 
the design of memory cards which can interchangeably be read/written in 3.3V and 5.0V systems. Its x8/x16 
architecture allows optimization of the memory-to-processor interface. Its high read performance and flexible 
block locking enable both storage and execution of operating systems and application software. Manufactured 
on Intel’s 0.6 wm ETOX™ IV process technology, the 28F016SA is the most cost effective, highest density 
monolithic 3.3V FlashFile memory. 


oe ee 
ETOX™ and FlashFile™ are trademarks of Intel Corporation. . 

The complete document for this product can be ordered by calling 1-800-548-4725. It is also available on 
‘Intel’s “Data-on-Demand”’ CD-ROM product; contact your local Intel field sales office or Intel technical distrib- 
utor. 
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Intel486™ — 
Microprocessor 
SmartDie™ Products 


intel . PRELIMINARY 
Intel486™ DX2 MICROPROCESSOR 


SmartDie™ Product Specification 


m SL-Technology for Energy Efficiency m IEEE 1149.1 Boundary Scan 
— Intel’s System Management Mode Compatibility 
— Stop Clock, Auto HALT and Auto Idle a High-Performance Design 
Power Down — 40/50 MHz Core Speed Using 
m Binary-Compatible with Large Software 20/25 MHz Bus Clock at 3.3V 
Base | — RISC Integer Core with Frequent 
— MS-DOS*, OS/2*, Windows* Instructions Executing in One Core 
— UNIX* System V/Intel386™ Clock 
— iIRMX® Software, iRMK Kernels — 64/80 Mbyte/sec Burst Bus 
High Integration Enables On-Chi @40/50 MHz 
= a Kbyte aa and atest — Dynamic Bus Sizing for 8-, 16- and 
32-Bit Buses 


— Floating Point Unit ; ; 
— Paged, Virtual Memory Management — Complete 32-Bit Architecture 
m Multiprocessor Support 
z a ae Test — Cache Consistency Protocols 
— Hardware Debugging Support -— Support for Second-Level Cache 


g Intel SmartDie Product 
— Full AC/DC Testing at Die Level 
— 0°C-80°C (Junction) Temperature 
Range 
— 40 MHz and 50 MHz Core Speeds 
@3.3V 


NOTICE: This document contains preliminary information on new products in production. It is valid for the 
devices indicated in the revision history. This specification is subject to change without notice. Verify with your 
local Intel Sales Office that you have the latest SmartDie product specification before finalizing a design. 


REFERENCE INFORMATION: The information in this document is provided as a supplement to the Standard 


Package Data Sheet on a specific product. Please reference the Standard Package Data Sheet (Order No. 
242202) for additional product information and specifications not found in this document. 


*Other brands and names are the property of their respective owners. 
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in | | PRELIMINARY 
Intelag6m™ SX MICROPROCESSOR 


SmartDie™ Product Specification 


m SL Technology for Energy Efficiency m= Complete 32-Bit Architecture 

— Intel’s System Management Mode — Address and Data Buses 

— Stop Clock, Auto HALT and Auto Idle _ — Registers 

Power Down | — 8-Bit, 16-Bit and 32-Bit Data Types 

= Binary-Compatible with Large Software m Multiprocessor Support 

Base — Multiprocessor Instructions | 

— MS-DOS", OS/2*, Windows* — | — Cache Consistency Protocols 

— UNIX* System V/386 — Support for Second-Level Cache 


— iRMX Software, iRMK Kernels m IEEE 1149.1 Boundary Scan 


g High Integration Enables On-Chip Compatibility 
—& Kbyte Code and Data Cache Intel SmartDie Product 
= raged, Virthal memory manegement — Full AC/DC Testing at Die Level 
m@ Easy to Use —0°C to + 80°C (Junction) 
— Built-In Self Test Temperature Range 
— Hardware Debugging Support — 25 MHz and 33 MHz Speeds @ 3.3V 


— Intel Software Support 
— Extensive Third Party Software 
Support 


m High-Performance Design 
'— Intel486 One Clock Instruction Core 
— 80/100 Mbyte/sec Burst Bus at 
25/33 MHz 
— CHMOS V Process Technology 
— Dynamic Bus Sizing for 8-Bit, 16-Bit 
and 32-Bit Buses 


NOTICE: This document contains preliminary information on new products in production. It is valid for the 
devices indicated in the revision history. This specification is subject to change without notice. Verify with 


your local Intel Sales Office that you have the latest SmartDie product specification before finalizing a 
design. 


~ REFERENCE INFORMATION: The information in this document is provided as a supplement to the Standard 
Package Data Sheet on a specific product. Please reference the Standard Package Data Sheet/Book (Order 
No. 242202) for aciditional product information and specifications not found in this document. 
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Intel486™ Microprocessors and Related Products 


In 1994, Intel enhanced the Intel486™ microprocessor family with the 
IntelDX4™ Processor, the world’s fastest 486, and the Write-Back 
Enhanced IntelIDX2™ Processor. These new processors enable _ 
development of higher performance, lower-cost, entry-level desktop 
and mobile system products. Supporting PCIsets are also available, 
allowing the system designer to support the Peripheral Component 
Interconnect (PCI) bus. PCI allows a glueless interface for high 
performance peripherals to be placed onto a fast local bus. By pro- 
viding multiple performance options, Intel makes desktop and mobile 
computing power affordable for more and more users. 


This databook contains extensive information on Intel486 micro- 
processor families, OverDrive™ processors, supporting PClIsets, 
floppy and hard disk controllers, mobile peripheral products and flash 
memory components for the desktop and mobile family of Intel486 
microprocessors. The datasheets and application notes contained in 
this databook include comprehensive charts, diagrams and instruction 
and hardware information for leading-edge 32-bit system development. 
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